fig|1040638.4.peg.4480
Escherichia coli O104:H4 str. LB226692
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
L
AT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YA
I
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|6666666.5357.peg.1734
Escherichia coli TY-2482
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
L
AT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YA
I
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|585055.8.peg.1292
Escherichia coli 55989
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
L
AT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YA
I
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|585055.6.peg.1293
Escherichia coli 55989 (94-769/769)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
L
AT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YA
I
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|340186.5.peg.5177
Escherichia coli E110019
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|566546.3.peg.343
Escherichia coli W
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|566546.4.peg.1251
Escherichia coli W
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|585034.4.peg.1177
Escherichia coli IAI1
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
I
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|585034.5.peg.1173
Escherichia coli IAI1
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
I
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|340184.6.peg.4459
Escherichia coli B7A
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
ID
GNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|550672.3.peg.1469
Escherichia coli B088 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YA
I
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|562.375.peg.4059
Escherichia coli EC4100B (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YA
I
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|679207.4.peg.1866
Escherichia coli MS 107-1 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YA
I
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|409438.11.peg.1350
Escherichia coli SE11 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YA
I
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|340186.3.peg.4906
Escherichia coli E110019 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|340184.3.peg.4258
Escherichia coli B7A (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
ID
GNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|679204.3.peg.3591
Escherichia coli MS 145-7 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
ID
GNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|344601.3.peg.3881
Escherichia coli B171 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|344601.5.peg.4070
Escherichia coli B171 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|340185.3.peg.4620
Escherichia coli E22 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|340185.4.peg.4869
Escherichia coli E22 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|595495.4.peg.1780
Escherichia coli KO11 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|585395.4.peg.1333
Escherichia coli O103:H2 str. 12009 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|749545.3.peg.871
Escherichia coli MS 182-1 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|749532.3.peg.3225
Escherichia coli MS 78-1 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|331111.12.peg.1602
Escherichia coli E24377A
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSL
G
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
W
F
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|656408.3.peg.1231
Escherichia coli H591 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDV
N
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
W
F
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|679206.4.peg.837
Escherichia coli MS 119-7 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDV
N
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
W
F
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|573235.3.peg.1705
Escherichia coli O26:H11 str. 11368 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSL
G
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
L
FQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|331111.3.peg.3776
Escherichia coli E24377A (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGDTLDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
T
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
S
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSL
G
S
------------
ANEI
L
Y
H
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
NSKI
TG
SA
N
I
STD
V
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
DVE
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
L
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
M
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
C
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
W
F
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|679205.4.peg.4540
Escherichia coli MS 124-1 (217-892/892)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGD
K
LDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
V
VL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
D
SKI
TG
SA
N
L
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
D
F
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
I
VA
P
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
W
L
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|749533.3.peg.4628
Escherichia coli MS 84-1 (217-892/892)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAGD
K
LDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
Y
TTGPDAA
I
AKIYNGG
T
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
V
VL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
T
V
DA
T
D
SKI
TG
SA
N
L
STD
D
-----------
NTHTYLSLSDNSTWDIKA
D
S
TV
S
----
-
NLT
VDNSTVYI
S
R
A
D
G
R
D
F
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
I
VA
P
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
W
L
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|656444.3.peg.1863
Escherichia coli TA280 (207-882/882)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAG
SK
LDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
H
TTGPDAA
I
AKIYNGG
K
VTLKN
--
T
F
AVA
--------------------------
HQ
G
A
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
A
I
DA
S
NSKI
TG
SA
N
L
STD
D
-----------
S
THTYLSLSDNSTWDIK
T
D
S
TV
S
----
-
K
LT
VDNSTVYI
S
R
A
D
G
K
D
F
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FR
S
FI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|656437.3.peg.1297
Escherichia coli TA143 (207-882/882)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKA
D
D
K
LDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
H
TTGPDAA
I
AKIYNGG
K
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
V
NEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
S
I
DA
S
NSKI
TG
SA
N
L
STD
D
-----------
S
THTYLSLSDNSTWDIKA
D
S
TV
S
----
-
K
LT
VDNSTVYI
S
R
A
D
G
K
D
F
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
V
G
E
T
RYIDPVTEQE
R
S
N
R
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
L
A
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
M
MVGLKY
K
F
fig|749531.3.peg.3701
Escherichia coli MS 69-1 (207-882/882)
MAK
G
EI
-
--
T
T
L
GT
ESYAAYANGTVVKAG
SK
LDYTNASVTLTDVD
--
I
T
T
Y
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
H
TTGPDAA
I
AKIYNGG
K
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
A
I
DA
S
NSKI
TG
SA
N
L
STD
D
-----------
S
THTYLSLSDNSTWDIK
T
D
S
TV
S
----
-
K
LT
VDNSTVYI
S
R
A
D
G
K
D
F
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|585057.4.peg.1983
Escherichia coli IAI39 (207-882/882)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAG
SK
LDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
H
TTGPDAA
I
AKIYNGG
K
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
LV
I
DA
S
NSKI
TG
SA
N
L
STD
D
-----------
S
THTYLSLSDNSTWDIK
T
D
S
TV
S
----
-
K
LT
VDNSTVYI
S
R
A
D
G
K
AF
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
S
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
F
QTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNP
S
A
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
M
MVGLKY
K
F
fig|585057.6.peg.1982
Escherichia coli IAI39 (207-882/882)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAG
SK
LDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
H
TTGPDAA
I
AKIYNGG
K
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
LV
I
DA
S
NSKI
TG
SA
N
L
STD
D
-----------
S
THTYLSLSDNSTWDIK
T
D
S
TV
S
----
-
K
LT
VDNSTVYI
S
R
A
D
G
K
AF
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
S
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GAE
L
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
F
QTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNP
S
A
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
M
MVGLKY
K
F
fig|749527.3.peg.4920
Escherichia coli MS 21-1 (207-882/882)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAG
SK
LDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
H
TTGPDAA
I
AKIYNGG
K
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
F
VL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
A
I
DA
S
NSKI
TG
SA
N
L
STD
D
-----------
S
THTYLSLSDNSTWDIK
T
D
S
TV
S
----
-
K
LT
VDNSTVYI
S
R
A
D
G
K
D
F
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EII
C
VEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
R
YL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
F
QTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
L
L
G
DNGYND
TA
VMVGLKY
K
F
fig|550677.3.peg.2613
Escherichia coli B354 (207-882/882)
MAK
G
EI
-
--
T
T
L
GT
ESYAAYANGTVVKAG
SK
LDYTNASVTLTDVD
--
I
T
T
Y
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
H
TTGPDAA
I
AKIYNGG
K
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
L
A
I
DA
S
NSKI
TG
SA
N
L
STD
D
-----------
S
THTYLSLSDNSTWDIK
T
D
S
TV
S
----
-
K
LT
VDNSTVYI
S
R
A
D
G
K
D
F
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
G
GD
N
SV
T
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIF
T
GAY
E
Y
S
L
TRGNT
D
AT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
M
MVGLKY
K
F
fig|439855.10.peg.2136
Escherichia coli SMS-3-5 (98-773/773)
MAK
G
EI
-
--
T
TH
GT
ESYAAYANGTVVKAG
SK
LDYTNASVTLTDVD
--
I
T
TH
G
DNAH
A
IAARQ
GT
V
S
FNQGE
I
-
-
-----------------------
H
TTGPDAA
I
AKIYNGG
K
VTLKN
--
TSAVA
--------------------------
HQ
G
A
G
IVL
E
SSING
------
QEATV
D
I
L
S
-
G
SSLR
S
------------
ANEI
L
Y
N
KNETSN
V
TITD
S
---------
EVSSAADVF
-
IN
N
I
K
G
H
LV
I
DA
S
NSKI
TG
SA
N
L
STD
D
-----------
S
THTYLSLSDNSTWDIK
T
D
S
TV
S
----
-
K
LT
VDNSTVYI
S
R
A
D
G
K
AF
E
-------
PTR
-
LT
ITEN
Y
V
G
NN
G
V
L
H
L
RT
EL
----
GD
D
N
S
AT
D
KV
V
I
NGNT
S
G
T
T
R
V
K
V
TNAG
G
S
G
AY
T
LN
GI
EIISVEG
-----
ES
N
G
E
F
IK
--
DSRIFA
GAY
E
Y
S
L
TRGNTEAT
N
KN
WYL
TN
------
F
QAT
SGGE
T
--------
NSGGS
S
A
P
TVA
P
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
F
QTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNP
S
A
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
M
MVGLKY
K
F
fig|656419.3.peg.1494
Escherichia coli M718 (227-887/887)
M
N
G
GS
I
-
--
T
T
N
G
IN
SY
GV
YANG
-----------
KK
A
YIN
L
DY
V
A
--
L
E
T
VA
D
GSY
AV
A
I
RQ
G
N
I
D
IKN
S
S
I
-
-
-----------------------
T
TTG
TK
A
P
I
AKIYNGG
E
LFFS
N
--
VT
AV
S
--------------------------
E
Q
D
K
G
I
S
I
D
A
S
NID
------
SQ
A
K
I
AL
L
S
-
VE
L
SS
A
------------
LD
S
I
DV
N
K
TT
T
DVSI
LN
R
S
---------
IITPGNN
V
L
-
IN
N
A
G
G
G
L
N
I
I
S
S
D
S
TL
N
G
ATK
L
VSG
T
-----------
T
T
---
L
K
LS
E
N
TI
W
NM
K
D
D
S
V
V
T
----
-
H
LTN
S
D
S
IIN
L
S
Y
D
D
G
K
TFT
-------
Q
G
K
TLTV
K
G
N
Y
V
G
NN
G
Q
L
N
I
RT
V
L
----
GD
D
K
S
AT
D
R
L
IV
E
GNT
S
G
S
TTV
Y
V
K
NAG
G
S
G
A
A
T
LN
GI
E
L
I
T
V
N
G
D
----
ES
P
A
DAFRQ
G
D
A
RI
A
A
GA
F
E
Y
Q
L
K
Q
-----
QG
KN
WYL
T
S
------
Y
Q
SV
NEED
N
--------
S
S
E
G
N
S
E
S
T
E
TP
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SSM
SDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
W
L
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|749537.3.peg.1647
Escherichia coli MS 115-1 (222-882/882)
M
N
G
GS
I
-
--
T
T
N
G
IN
SY
G
AYANG
-----------
KK
A
YIN
L
DY
V
A
--
L
E
T
VA
D
GSY
AV
A
I
RQ
G
N
I
D
IKN
S
S
I
-
-
-----------------------
T
TTG
TK
A
P
I
AKIYNGG
E
LFFS
N
--
VT
AV
S
--------------------------
K
Q
D
K
G
I
S
I
D
A
S
NID
------
SQ
A
K
I
AL
L
S
-
VE
L
SS
A
------------
LD
S
I
DV
N
K
TT
T
DVSF
LN
R
S
---------
IITPGNN
V
L
-
V
N
N
T
G
G
D
L
N
I
I
S
S
D
S
I
L
N
G
ATK
L
VSG
T
-----------
T
T
---
L
K
LS
E
N
TI
W
NM
K
D
D
S
V
V
T
----
-
H
LTN
S
D
S
IIN
L
S
Y
D
D
G
Q
TFT
-------
Q
G
K
TLTV
K
G
N
Y
V
G
NN
G
Q
L
N
I
RT
V
L
----
GD
D
K
S
AT
D
R
L
IV
E
GNT
S
G
S
TTV
Y
V
K
NAG
G
S
G
A
A
T
LN
GI
E
L
I
T
V
N
G
D
----
ES
P
A
DAFRQ
G
D
A
RI
A
A
GA
F
E
Y
Q
L
K
Q
-----
QG
KN
WYL
T
S
------
Y
Q
SV
NEED
N
--------
S
S
E
G
N
S
E
S
T
E
TP
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYIDPVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
G
GDL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
L
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
DSW
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
ATV
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
SN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|550676.3.peg.641
Escherichia coli B185 (227-887/887)
M
N
G
GS
I
-
--
T
T
NNIN
SY
G
AYANG
-----------
KK
A
YIN
L
D
DV
A
--
L
E
T
VAEGSY
AV
A
I
RQ
G
N
I
D
IKN
S
F
I
-
-
-----------------------
T
TTG
TKSP
I
AKIYNGG
E
LFFS
N
--
VT
AV
S
--------------------------
V
Q
D
K
G
I
S
I
D
A
S
NID
------
SH
A
K
I
AL
S
S
-
VE
L
SS
A
------------
LD
S
I
DV
N
K
TT
T
NLNI
LN
R
S
---------
IITPGNNIL
-
V
N
N
T
G
G
D
L
N
I
I
S
S
NS
TL
N
G
ATK
L
VSG
T
-----------
T
T
---
L
K
LS
E
N
TI
W
NM
K
D
D
S
V
V
T
----
-
H
LTN
S
D
S
IIN
L
S
Y
D
D
G
Q
TFT
-------
Q
G
K
TLTV
K
G
N
Y
V
G
NN
G
Q
L
N
I
RT
V
L
----
GD
D
K
S
AT
D
R
L
IV
E
GNT
S
G
S
TTV
Y
V
K
NAG
G
S
G
A
A
T
LN
GI
E
L
I
T
V
N
G
D
----
ES
P
A
DAFRQ
G
D
A
RI
A
A
GA
F
E
Y
Q
L
K
Q
-----
QG
KN
WYL
T
S
------
Y
Q
SV
NEED
N
--------
S
S
E
G
N
S
E
S
T
E
TP
T
--
P
-
-
-
-----
VL
R
P
EA
G
S
Y
V
A
NL
A
-----
AAN
TL
F
V
M
RL
N
D
R
AG
E
T
RYI
E
PVTEQE
R
SSR
L
W
LR
Q
IGG
H
NAWRDSN
GQL
RTTSHRYV
-
S
QL
GA
DL
LT
G
GFT
D
SDS
W
R
LG
V
MA
GY
ARDYN
S
TH
SS
VSDY
R
SKGSVR
GY
S
A
GL
YAT
WF
ADD
I
S
K
K
GA
YI
D
A
W
A
QY
S
WF
K
N
S
V
----
KG
D
E
LA
Y
ES
Y
SAK
G
A
S
V
SLEAGY
GFALNKSFGLEAAKYT
W
------
IFQ
PQAQ
A
I
WM
GV
DHNAH
T
E
AN
G
S
R
IENDANNN
IQTRLG
FRTFI
RT
QEKNSGPHGD
D
F
E
P
FVEM
N
WI
H
N
-
SK
D
FAVSMN
G
VKVEQ
D
GA
RN
L
GEIK
L
GV
N
GN
LNPAA
S
VW
G
N
V
GV
Q
L
G
DNGYND
TA
VMVGLKY
K
F
fig|550677.3.peg.743
Escherichia coli B354 (688-1345/1345)
AK
I
-
--
TG
S
G
DLAFSSQKGQ
TV
SLSNKDN
DYT
G
----
I
TD
L
-
--
-------------
R
S
GTL
L
L
N
ND
NV
L
GHTHELRLAAETELDMNGHSQIVG
T
LN
G
SADS
L
LSL
-
NGG
S
L
T
VT
N
GG
TS
T
GS
--------------------------
LT
G
S
G
---
-
-----
------
---E
LN
I
Q
G
-
GT
L
DI
A
------------
GD----
N
S
N
L
T
A
N
V
N
I
AN
S
--------------
A
N
V
L
-
V
S
H
A
Q
G
---
-
--
-
----L
G
SA
N
V
ENN
G
-----------
T
------
L
AL
N
N
S
AEKR
A
AA
S
VN
YTLG
GNLTN
NGTL
--
MT
G
M
S
G
Q
--
Q
-------
AG
N
V
L
V
V
K
G
N
Y
H
G
NN
G
Q
L
VMN
T
V
L
----
NGDDSV
T
D
K
LV
V
E
G
D
T
S
G
T
T
A
V
T
VN
NAG
G
T
G
A
K
T
LN
GI
E
L
I
H
V
D
G
-----
K
S
E
G
E
F
VQ
--
A
G
RI
V
A
GAYDYTL
A
RG
QG
-
A
NS
G
N
WYL
T
S
GSD
S
P
E
L
Q
P
E
PDPM
P
--------
N
P
E
P
N
P
N
P
N
PTP
T
--
P
G
P
D
LNVDND
L
R
P
EA
G
S
Y
I
A
NL
A
-----
AAN
T
M
F
TT
RL
H
E
R
L
G
N
T
Y
Y
T
D
M
VT
GEQ
K
QT
T
M
W
M
R
HE
GG
H
N
K
WRD
GS
GQL
K
T
Q
S
N
RYV
-
L
QL
G
GD
V
AQWSQNG
SDS
W
H
V
G
V
MA
GY
G
NS
DS
K
T
I
SS
R
TG
Y
RA
K
A
SV
N
GY
S
T
GL
YAT
W
YADD
E
S
R
N
GA
Y
LDSW
A
QY
S
WF
D
N
T
V
----
KG
D
DL
Q
S
ES
Y
K
S
K
G
F
T
ASLEAGY
KHK
L
AEFN
G
SQGTRN
E
W
------
Y
V
Q
PQAQV
T
WM
GV
K
A
D
K
H
R
E
S
N
G
TL
V
HS
N
GDG
N
V
QTRLG
V
K
T
WL
KS
HH
K
MDDDK
S
R
E
F
Q
P
FVE
V
N
W
L
H
N
-
SK
D
F
ST
SM
DG
V
S
V
T
Q
D
GA
RN
IA
EIK
T
GV
E
G
Q
LN
AN
LN
VW
G
N
V
GV
Q
VA
D
R
GYND
T
S
A
MVG
I
K
W
Q
F
fig|331111.12.peg.1994
Escherichia coli E24377A (342-1009/1009)
N
ATKV
E
FG
S
G
E
G
VFVF
---------------
NH
TN
N
S
DAGYQ
VD
--
M
LI
T
G
D
D------
KD
G
K
V
I
H
D
A
G
HT
V
-
-----------------------
F
NA
G
NTYS
G
KTLV
N
D
G
L
L
T
IAS
--
H
T
A
DG
--------------------------
VT
G
M
G
---
-
-----
------
-SSE
V
T
I
A
S
P
GT
L
D
--
------------
---
I
L
A
S
T
N
SAGDY
T
L
T
N
A
LKGDGLMRVQL
SS
YDKM
F
GFT
H
A
T
G
---
-
--
-
-TEFA
G
V
A
Q
L
KDS
T
FTLERDNTAALTHAMLQ
S
D
S
E
N
T
T
SVK
V
G
E
Q
SI
G
----
-
G
L
AMNG
G
T
LIFD
M
E
NG
G
T
V
QMNSEGGK
P
G
N
V
LTVNG
N
YTG
NN
G
L
M
T
F
NA
T
L
----
G
GD
N
S
P
T
D
K
M
N
V
K
G
D
T
Q
GNT
R
V
R
V
D
N
I
G
G
V
G
A
Q
T
V
N
GI
E
L
I
E
V
G
G
-----
N
S
A
G
N
F
A
L
T
-
T
G
T
V
E
A
GAY
V
YTL
A
K
G
KG
-
N
D
E
KN
WYL
T
S
K
WDGVT
P
AD
T
PDPI
N
--------
N
P
---
-
-------
--
P
V
V
D
P
EGPS
V
Y
R
P
EA
G
S
Y
I
S
N
I
A
-----
AAN
S
L
F
SH
RL
H
D
R
L
G
E
P
Q
Y
T
D
SLHS
Q
G
S
A
S
SM
W
M
R
H
V
GG
H
ERS
R
A
GD
GQL
N
T
QA
N
RYV
-
L
QL
G
GDL
AQWSSNAQ
D
R
W
H
LG
V
MA
GY
A
N
QHS
N
T
Q
S
N
RV
G
Y
K
S
D
G
R
IS
GY
S
A
GL
YAT
W
Y
Q
N
D
A
N
K
T
GA
Y
V
DSW
A
L
Y
N
WF
D
N
S
V
----
S
S
D
NRS
A
D
D
Y
D
S
R
G
V
T
AS
V
E
G
GY
T
F
EAGTFS
G
S
E
GTLN
T
W
------
Y
V
Q
PQAQ
I
T
WM
GV
K
DS
D
H
T
RK
D
G
T
R
IE
TEGDG
N
V
QTRLG
V
K
T
YLN
S
HHQRDDGKQR
E
F
Q
P
Y
I
E
A
N
WI
N
N
-
SK
V
Y
AV
K
MN
G
Q
T
V
GR
E
GA
RN
L
GE
VR
T
GV
EAK
V
N
NN
L
SL
W
G
N
V
GV
Q
L
G
D
K
GY
S
D
T
Q
G
M
L
G
V
KY
S
W
fig|444449.5.peg.1505
Escherichia coli O157:H7 str. EC4042 (2-602/602)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|550677.3.peg.1780
Escherichia coli B354 (650-1242/1242)
ADSSGQH
L
DEGST
L
TKTGAGT
-----------------------------
LE
--
M
TASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
K
TGADQ
------
DIQSID
V
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTV
M
VNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQE
A
TPP
S
PPD
PDPT
-
--------
-
PDPT
P
D
------
--
P
E
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
D
GQTLNLRVIGG
N
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
A
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVV
V
EPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
S
V
H
V
I
PTLDLNYYHD
-
PH
A
T
K
IEEDGSTISD
E
AV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSM
M
VKW
fig|562.375.peg.3223
Escherichia coli EC4100B (652-1240/1240)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
-
--------
-----
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|444453.5.peg.1611
Escherichia coli O157:H7 str. EC4076 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|478005.5.peg.1892
Escherichia coli O157:H7 str. EC4486 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|331111.3.peg.267
Escherichia coli E24377A (650-1252/1252)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
PDPT
P
--------
DPDPT
P
DPDPTP
E
PA
P
D
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLF
R
GRWGDDG
G
WMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
T
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGV
M
GNISQRVSLRGSVAWQKGSDDFAQTA
V
FLSMTVKW
fig|749531.3.peg.2720
Escherichia coli MS 69-1 (650-1250/1250)
ADSSGQH
L
DEGST
L
TKTGAGT
-----------------------------
LE
--
M
TASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
K
TGADQ
------
DIQSID
V
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEPTSTGIKVVDFAA
E
PTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
D
GQTLNLR
I
IGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
A
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPG
H
GVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
P
S
LDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|656437.3.peg.2508
Escherichia coli TA143 (650-1242/1242)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
K
TGADQ
------
DIQSID
V
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQE
A
TPP
S
PPD
PDPT
-
--------
-
PDPT
P
D
------
--
P
E
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
D
GQTLNLRVIGG
N
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
A
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVV
V
EPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
S
V
H
V
I
PTLDLNYYHD
-
PH
A
T
K
IEEDGSTISD
E
AV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|340184.3.peg.1603
Escherichia coli B7A (650-1244/1244)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|679204.3.peg.265
Escherichia coli MS 145-7 (650-1244/1244)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|585396.4.peg.3074
Escherichia coli O111:H- str. 11128 (652-1246/1246)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
N
VGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|573235.3.peg.3292
Escherichia coli O26:H11 str. 11368 (634-1228/1228)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|83334.1.peg.3111
Escherichia coli O157:H7 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|155864.1.peg.3115
Escherichia coli O157:H7 EDL933 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|656419.3.peg.2996
Escherichia coli M718 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
N
GTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNT
T
GA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNL
Q
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQ
M
LNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
Q
D
DG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|656443.3.peg.2942
Escherichia coli TA271 (650-1244/1244)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
R
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|550676.3.peg.1444
Escherichia coli B185 (634-1228/1228)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
N
GTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNT
T
GA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DP
A
PT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAF
V
MER
R
DHAG
-
-
---------
G
D
G
P
TLNLRVI
C
G
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLF
R
GRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLD
N
WLQYAWF
N
NDV
----
SE
Q
D
DG
A
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|409438.11.peg.2652
Escherichia coli SE11 (652-1246/1246)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYH
N
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|511693.5.peg.2267
Escherichia coli BL21 (403-1003/1003)
ADSSG
E
HQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSID
T
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEP
I
STGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLS
V
DLF
R
GRWGDDGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
T
DHYHSSGIIA
L
LEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|749538.3.peg.2299
Escherichia coli MS 116-1 (383-983/983)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|444454.5.peg.2042
Escherichia coli O157:H7 str. EC4024 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|444448.5.peg.265
Escherichia coli O157:H7 str. EC4045 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|444452.5.peg.2630
Escherichia coli O157:H7 str. EC4113 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|444450.8.peg.3355
Escherichia coli O157:H7 str. EC4115 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|444451.5.peg.771
Escherichia coli O157:H7 str. EC4196 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|444447.5.peg.399
Escherichia coli O157:H7 str. EC4206 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|478004.5.peg.1775
Escherichia coli O157:H7 str. EC4401 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|544404.4.peg.3158
Escherichia coli O157:H7 str. TW14359 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|562.373.peg.3981
Escherichia coli 1125A (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|478007.5.peg.1225
Escherichia coli O157:H7 str. EC508 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|749545.3.peg.4140
Escherichia coli MS 182-1 (652-1246/1246)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|340184.6.peg.1684
Escherichia coli B7A (634-1228/1228)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|478008.5.peg.2388
Escherichia coli O157:H7 str. EC869 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|637388.3.peg.355
Escherichia coli O157:H7 str. FRIK2000 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|570506.3.peg.4978
Escherichia coli O157:H7 str. FRIK966 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|331111.12.peg.2812
Escherichia coli E24377A (634-1236/1236)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
PDPT
P
--------
DPDPT
P
DPDPTP
E
PA
P
D
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLF
R
GRWGDDG
G
WMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
T
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGV
M
GNISQRVSLRGSVAWQKGSDDFAQTA
V
FLSMTVKW
fig|344601.3.peg.1465
Escherichia coli B171 (650-1244/1244)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
K
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|340185.3.peg.661
Escherichia coli E22 (650-1244/1244)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
K
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|340185.4.peg.704
Escherichia coli E22 (652-1246/1246)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
K
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|585395.4.peg.2828
Escherichia coli O103:H2 str. 12009 (652-1246/1246)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
K
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|550672.3.peg.1947
Escherichia coli B088 (650-1244/1244)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYH
N
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|585034.4.peg.2278
Escherichia coli IAI1 (634-1228/1228)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|585034.5.peg.2275
Escherichia coli IAI1 (634-1228/1228)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|1040638.4.peg.3313
Escherichia coli O104:H4 str. LB226692 (193-787/787)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|562.371.peg.5091
Escherichia coli 1044A (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|562.372.peg.5536
Escherichia coli 1212A (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|562.374.peg.3381
Escherichia coli 536A (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|155864.8.peg.3009
Escherichia coli O157:H7 EDL933 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|478006.5.peg.1833
Escherichia coli O157:H7 str. EC4501 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|386585.9.peg.3250
Escherichia coli O157:H7 str. Sakai (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|502346.5.peg.4263
Escherichia coli O157:H7 str. TW14588 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|216592.1.peg.3044
Escherichia coli 042 (650-1246/1246)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
K
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTV
M
VNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
D
----
P
A
--
P
D
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
D
GQTLNLRVIGG
N
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
A
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVV
V
EPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|216592.3.peg.2585
Escherichia coli 042 (650-1246/1246)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
K
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTV
M
VNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
D
----
P
A
--
P
D
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
D
GQTLNLRVIGG
N
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
A
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVV
V
EPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|701177.3.peg.2795
Escherichia coli O55:H7 str. CB9615 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
Q
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
C
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
Q
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEED
A
STISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|679207.4.peg.3126
Escherichia coli MS 107-1 (634-1228/1228)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQF
C
LAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|316401.4.peg.2707
Escherichia coli ETEC H10407 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|83333.1.peg.2206
Escherichia coli K12 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|749533.3.peg.1670
Escherichia coli MS 84-1 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
QDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|316407.3.peg.2166
Escherichia coli W3110 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|316385.5.peg.2358
Escherichia coli str. K-12 substr. DH10B (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|331112.3.peg.2229
Escherichia coli HS (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GV
M
LTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|749532.3.peg.3513
Escherichia coli MS 78-1 (634-1228/1228)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|344601.5.peg.1540
Escherichia coli B171 (634-1228/1228)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
K
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|749537.3.peg.2270
Escherichia coli MS 115-1 (634-1228/1228)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KR
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|749547.3.peg.3510
Escherichia coli MS 187-1 (634-1234/1234)
ADSSG
E
HQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSID
T
T
SSGTIDIS
------------
DGTVLR
L
T
W
QDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEP
I
STGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLS
V
DLF
R
GRWGDDGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
T
DHYHSSGIIA
L
LEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|656379.3.peg.2769
Escherichia coli FVEC1302 (637-1237/1237)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
K
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
D
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
N
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGT
C
ADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
H
EDG
A
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRG
N
VAWQKGSDDFAQTAGFLSMTVKW
fig|656380.3.peg.2327
Escherichia coli FVEC1412 (637-1237/1237)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
K
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
D
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
N
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGT
C
ADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
H
EDG
A
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRG
N
VAWQKGSDDFAQTAGFLSMTVKW
fig|749549.3.peg.1794
Escherichia coli MS 198-1 (637-1237/1237)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
K
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
D
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
N
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGT
C
ADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
H
EDG
A
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRG
N
VAWQKGSDDFAQTAGFLSMTVKW
fig|585056.7.peg.2750
Escherichia coli UMN026 (637-1237/1237)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
K
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
D
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
N
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGT
C
ADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
H
EDG
A
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
G
V
T
PTLDLNYYHD
-
PH
A
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRG
N
VAWQKGSDDFAQTAGFLSMTVKW
fig|413997.3.peg.2255
Escherichia coli B str. REL606 (634-1234/1234)
ADSSG
E
HQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSID
T
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEP
I
STGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLS
V
DLF
R
GRWGDDGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
T
DHYHSSGIIA
L
LEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|469008.4.peg.1468
Escherichia coli BL21(DE3) (634-1234/1234)
ADSSG
E
HQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSID
T
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEP
I
STGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLS
V
DLF
R
GRWGDDGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
T
DHYHSSGIIA
L
LEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|481805.3.peg.1519
Escherichia coli ATCC 8739 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PP
E
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
A
P
V
P
V
YQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGV
R
GNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|595496.3.peg.2204
Escherichia coli BW2952 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|536056.3.peg.1498
Escherichia coli DH1 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|679205.4.peg.1829
Escherichia coli MS 124-1 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
QDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|749548.3.peg.2130
Escherichia coli MS 196-1 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|316385.7.peg.2412
Escherichia coli str. K-12 substr. DH10B (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|511145.12.peg.2322
Escherichia coli str. K-12 substr. MG1655 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|511145.6.peg.2306
Escherichia coli str. K-12 substr. MG1655 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|656414.3.peg.2610
Escherichia coli H736 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|749540.3.peg.4013
Escherichia coli MS 146-1 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|656444.3.peg.3232
Escherichia coli TA280 (650-1252/1252)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
N
S
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEPTSTGIKVVDFAA
E
PTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
PA
P
D
P
T
PAYQPVLN
A
KVGGY
F
NNLR
-----
AANQAF
V
MER
H
DHAG
-
-
---------
G
N
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSG
C
WG
T
DGEWMLG
A
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
H
EDG
V
DHYHSSGIIASLEAGY
---------------
QWLPG
H
GVVIEPQAQVIYQGVQQDDFTAAN
H
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
T
PTLDLNYY
Y
D
-
PH
S
T
K
IEEDGSTISD
E
AV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|331112.6.peg.2326
Escherichia coli HS (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GV
M
LTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|358709.5.peg.686
Escherichia coli 101-1 (634-1234/1234)
ADSSG
E
HQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
A
DVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSID
T
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDS
A
SDQLV
L
NGNTAGNTTVV
I
N
P
ITGIGEP
I
STGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLS
V
DLF
R
GRWGDDGEWMLG
I
VGGYSDNQG
D
SRSSMTGTRADNQNHGY
A
VGLTSSWFQHG
K
QKQGAWLD
N
WLQYAWF
S
NDV
----
SE
H
EDG
T
DHYHSSGIIA
L
LEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|6666666.5357.peg.574
Escherichia coli TY-2482 (634-1228/1228)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|585055.6.peg.2516
Escherichia coli 55989 (634-1228/1228)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|585055.8.peg.2521
Escherichia coli 55989 (634-1228/1228)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLSDVTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLV
L
NGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNA
K
FSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
L
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|595495.4.peg.205
Escherichia coli KO11 (652-1246/1246)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
K
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|566546.3.peg.1404
Escherichia coli W (652-1246/1246)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
K
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|566546.4.peg.2409
Escherichia coli W (652-1246/1246)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
T
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYL
A
DVTVN
----
G
D
LTNTSGA
--
VSL
K
NG
V
----------
AGDTLTVNGDYTG
-
GGTLL
F
DSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|481805.6.peg.1513
Escherichia coli ATCC 8739 (634-1234/1234)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PP
E
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
A
P
V
P
V
YQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGTRADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGV
R
GNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|344610.3.peg.3339
Escherichia coli 53638 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGT
C
ADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|344610.7.peg.4091
Escherichia coli 53638 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGT
C
ADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|670888.3.peg.4045
Escherichia coli 1827-70 (650-1250/1250)
ADSSGQHQDEGST
L
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
L
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
PDPT
P
--------
DPDPT
P
DPDPTPD
--
P
E
P
T
PAYQPVLN
A
KVGGY
L
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
D
GQTLNLRVIGG
D
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWG
T
DGEWMLG
I
VGGYSDNQG
D
SRS
N
MTGT
C
ADNQNHGY
A
VGLTSSWFQHG
N
QKQGAWLDSWLQYAWF
S
NDV
----
SE
Q
EDG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQTRLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
fig|679206.4.peg.2455
Escherichia coli MS 119-7 (650-1244/1244)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
R
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|656408.3.peg.2480
Escherichia coli H591 (634-1228/1228)
ADSSGQHQDEGST
F
TKTGAGT
-----------------------------
LE
--
LTASGTTQSAVRVEEGTL
K
GDVADI
F
P
-----------------------
-
-------
-
-------
-
-------
YASSL
--------------------------
WVG
D
GATF
V
TGADQ
------
DIQSIDA
I
SSGTIDIS
------------
DGTVLR
L
TGQDTSVALNAS
-------------------
LFN
G
DGTLV
N
AT
D
GVTLTGELN
T
NLE
T
----------------------
DSLTYLS
N
VTVN
----
GNLTNTSGA
--
VSL
Q
NG
V
----------
AGDTLTVNGDYTG
-
GGTLLLDSEL
----
NGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVE
-----
DNNDWYLRSQEVTPP
S
PPD
----
P
--------
DPDPT
P
DPDPTPD
--
-
-
P
I
PAYQPVLN
A
KVGGY
R
NNLR
-----
AANQAFMMER
R
DHAG
-
-
---------
G
N
GQTLNLRVIGG
R
YHYTAA
-
GQLAQHEDTST
-
VQLSGDLFSGRWGDDGEWMLG
A
VGGYSDNQG
E
SRS
N
MTGTRADNQNHGY
A
VGLTSSW
Y
QHG
N
QKQGAWLDSWLQYAWF
N
NDV
----
SE
Q
D
DG
T
DHYHSSGIIASLEAGY
---------------
QWLPGRGVVIEPQAQVIYQGVQQDDFTAAN
R
ARVSQSQGDDIQ
M
RLGLHSEWRT
--------
AV
H
V
I
PTLDLNYYHD
-
PH
S
TEIEEDGSTISDDAV
KQ
RGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQ
M
AGFLSMTVKW
fig|550676.3.peg.683
Escherichia coli B185 (258-955/955)
IK
T
N
G
DNAHGLWSF
G
Q
V
SANAL
T
V
D
V
T
G
A
AANGVE
V
RGGT
T
T
I
G
ADS
H
ISS
A
Q
G
G
G
L
V
ASG
S
D
A
T
I
-----------------------
N
FS
G
TA
A
Q
R
NS
I
FS
GG
S
YGASAQTAT
AV
I
NMQNTDITVDRNGSLALGLWALSGGRIT
G
D
SLAI
T
GA
A
GARGIYAMTNSQ
ID
L
T
S
DLV
ID
M
S
TPDQMAIATQHD
DG
YAAS
R
I
N
ASGR
M
L
I
N
G
S
-------------------
V
L
S
K
G
G
L
I
N
L
D
M
H
PG
SV
WTGS
S
L
S
DN
V
-----------
N
GGKLDVVMN
NS
V
W
NVT
S
NS
N
LD
----
-
T
L
AL
S
H
STV
DF
A
S
H
GS
T
----------
AG
TFT
T
L
N
V
E
NLSG
N
S
T
F
IM
R
ADV
VGEG
NG
V
N
N
KG
D
L
L
N
IS
G
SS
AGN
HV
L
A
I
R
N
---Q
G
SEA
T
TG
N
E
V
L
T
V
VK
-----
TT
D
G
AA
S
F
S
A
S
S
Q
V
E
L
G
G
Y
L
Y
D
V
R
K
-----
N
GT
N
W
E
L
Y
A
SGTI
P
E
P
T
P
N
PEPT
P
APAQPPIV
N
----
-
-
PDPTP
N
PT
P
T
P
K
P
----
TTT
A
D
A
GG
N
Y
L
N
V
GYLLNYVE
N
R
TL
M
Q
R
M
G
D
---
-
-
-----
LRN
Q
S
K
DGN
I
W
LR
SY
GG
S
LDSF
A
S
-
G
K
L
S
GF
D
MG
Y
S
G
I
Q
F
G
GD
---K
R
LS
D
VMPLY
V
G
L
YI
G
------
S
TH
A
S
PDY
S
GG
D
G
TA
R
SD
Y
M
G
M
YA
S
Y
M
A
H
-
-
--N
G
F
Y
S
D
LV
VK
A
S
RQ
K
N
S
FHVLD
S
Q
N
NGV
N
A
N
GT
A
N
G
L
SI
SLEAG
QR
F
N
L
TPT
--------
--
--
G
Y
G
FY
IEPQ
T
Q
L
T
Y
SHQN
E
M
A
MK
A
S
N
G
LN
I
HL---
N
HY
ES
L
LG
RA
S
M
I
LGYDITA
G
--
KS
Q
L
N
IY
V
KTGA
I
H
E
F
S
G
D
TE
YLL
N
N
S
REKYSFK
GN
GWNNG
VGV
S
AQY
N
KQ
H
T
FYLEAD
Y
T
Q
G
-
N
L
F
D
Q
KQ
V
NG
G
YRFS
F
fig|656419.3.peg.1537
Escherichia coli M718 (258-961/961)
IK
T
N
G
DNAHGLWSF
G
Q
V
SANAL
T
V
D
V
T
G
A
AANGVE
V
RGGT
T
T
I
G
ADS
H
ISS
A
Q
G
G
G
L
V
TSG
S
D
A
T
I
-----------------------
N
FS
G
TA
A
Q
R
NS
I
FS
GG
S
YGASAQTAT
AV
I
NMQNTDITVDRNGSLALGLWALSGGRIT
G
D
SLAI
T
GA
A
GARGIYAMTNSQ
ID
L
T
S
DLV
ID
M
S
TPDQMAIATQHD
DG
YAAS
R
I
N
ASGR
M
L
I
N
G
S
-------------------
V
L
S
K
G
G
L
I
N
L
D
M
H
PG
SV
WTGS
S
L
S
DN
V
-----------
N
DGKLDVAMN
NS
V
W
NVT
S
NS
N
LD
----
-
T
L
AL
S
H
STV
DF
A
S
H
AS
T
----------
AG
TFT
T
L
N
V
E
NLSG
N
S
T
F
IM
R
ADV
VGEG
NG
V
N
N
KG
D
L
L
N
IS
G
SS
AGN
HV
L
A
I
R
N
---Q
G
SEA
T
TG
N
E
V
L
T
V
VK
-----
TT
D
G
AA
S
F
S
A
S
S
Q
V
E
L
G
G
Y
L
Y
D
V
R
K
-----
N
GT
N
W
E
L
Y
A
SGTV
P
E
P
T
P
N
PEPT
P
APAQPPIV
N
PDPT
P
E
PDPTP
T
PT
P
T
P
K
P
----
TTT
A
D
A
GG
N
Y
L
N
V
GYLLNYVE
N
R
TL
M
Q
R
M
G
D
---
-
-
-----
LRN
Q
S
K
DGN
I
W
LR
SY
GG
S
LDSF
A
S
-
G
K
L
S
GF
D
MG
Y
S
G
I
Q
F
G
GD
---K
R
LS
D
E
MPLY
V
G
L
YI
G
------
S
TH
A
S
PDY
S
GG
D
G
TA
R
SD
Y
M
G
M
YA
S
Y
M
A
H
-
-
--N
G
F
Y
S
D
LV
VK
A
S
RQ
K
N
S
FHVLD
S
Q
N
NGV
N
A
N
GT
A
N
G
L
SI
SLEAG
QR
F
N
L
TPT
--------
--
--
G
Y
G
FY
IEPQ
T
Q
L
T
Y
SHQN
E
M
A
MK
A
S
N
G
LN
I
HL---
N
HY
ES
L
LG
RA
S
M
I
LGYDITA
G
--
KS
Q
L
N
IY
V
KTGA
I
H
E
F
S
G
D
TE
YLL
N
N
S
REKYSFK
GN
GWNNG
VGV
S
AQY
N
KQ
H
T
FYLEAD
Y
T
Q
G
-
N
L
F
D
Q
KQ
V
NG
G
YRFS
F
fig|550672.3.peg.1503
Escherichia coli B088 (258-961/961)
IK
T
S
G
DNAHGLWSF
G
Q
V
SANAL
T
V
D
VIG
A
AANGVE
V
RGGT
T
T
I
G
ADS
H
ISS
A
Q
G
G
G
L
V
TSG
S
D
A
T
I
-----------------------
N
FS
G
TA
A
Q
R
NS
I
FS
GG
S
YGASAQTAT
AV
I
NMQNTDITVDRNGSLALGLWALSGGRIT
G
D
SLAI
T
GA
A
GARGIYAMTNSQ
ID
L
T
S
DLV
ID
M
S
TPDQMAIATQHD
DG
YAAS
R
I
N
ASGR
M
L
I
N
G
S
-------------------
V
L
S
K
G
G
L
I
N
L
D
M
H
PG
SV
WTGS
S
L
S
DN
V
-----------
N
DGKLDVAMN
NS
V
W
NVT
S
NS
N
LD
----
-
T
L
AL
S
H
STV
DF
A
S
H
AS
T
----------
AG
TFT
T
L
N
V
E
NLSG
N
S
T
F
IM
R
ADV
VGEG
NG
V
N
N
KG
D
L
L
N
IS
G
SS
AGN
HV
L
A
I
R
N
---Q
G
SEA
T
TG
N
E
V
L
T
V
VK
-----
TT
D
G
AA
S
F
S
A
S
S
Q
V
E
L
G
G
Y
L
Y
D
V
R
K
-----
N
GT
N
W
E
L
Y
A
SGTV
P
E
P
T
P
N
PEPT
P
APAQPPIV
N
PDPT
P
E
PDPTP
T
PT
P
T
P
K
P
----
TTT
A
D
A
GG
N
Y
L
N
I
GYLLNYVE
N
R
TL
M
Q
R
M
G
D
---
-
-
-----
LRN
Q
S
K
DGN
I
W
LR
SY
GG
S
LDSF
A
S
-
G
K
L
S
GF
D
MG
Y
S
G
I
Q
F
G
GD
---K
R
LS
D
VMPLY
V
G
L
YI
G
------
S
TH
A
S
PDY
S
GG
D
G
TA
R
SD
Y
M
G
M
YA
S
Y
M
A
H
-
-
--N
G
F
Y
S
D
LV
VK
A
S
RQ
K
N
S
FHVRD
S
Q
N
NGV
N
A
N
GT
A
N
G
L
SI
SLEAG
QR
F
N
L
TPT
--------
--
--
G
Y
G
FY
IEPQ
T
Q
L
T
Y
SHQN
E
M
A
MK
A
S
N
G
LN
I
HL---
N
HY
ES
L
LG
RA
S
M
I
LGYDITA
G
--
KS
Q
L
N
MY
V
KTGA
I
H
E
F
S
G
D
TE
YLL
N
N
S
REKYSFK
GN
GWNNG
VGV
S
AQY
N
T
Q
H
T
FYLEAD
Y
T
Q
G
-
N
L
F
D
Q
KQ
V
NG
G
YRFS
F
Consen1
Primary consensus
ADSSGQHqdeGst
tkTgaGT
-----------------------------
le
--
lTasGttqsAvrveeGTl
gdvadI
p
-----------------------
-------
-------
-------
yassl
--------------------------
wvG
Gatf
tgadq
------
diqsiDa
SsGtidiS
------------
dgtvLr
tgqdtsValnaS
-------------------
lfN
dGtLv
at
gvtlTGelN
nle
----------------------
dsltylsdvTVn
----
gnLTntsga
--
vSl
nG
----------
agdtLTvngdYtG
-
gGtLlldsEL
----
ngDdSvsDqlVmNGNTaGnTtVvvnsitGiGepTstGIkvvdfaadptqfqnNaqFslagsgyvnmGAYdYtLve
-----
dNndWYLrsqevtpp
ppd
p
--------
dpdpt
dPdptPd
--
p
p
payqpVLn
kvGgY
nNLr
-----
AANqaFmMer
DhAG
-
---------
g
gqtLnLRvIGG
yhytaa
-
GQLaqhedtst
-
vQLsgdLfsGrwgddgeWmLG
vgGYsdnqg
srSsmtgtRadnqnhGY
vGLtssWfqhg
qKqGAwlDsWlQYaWF
NdV
----
se
edg
dhYhssGiiaSLEAGY
---------------
qWlpgrgvviePQAQvIyqGVqqddfTaAN
aRvsqsqgddIQTRLGlhsewRT
--------
av
v
PtldlNyyHd
-
ph
teieedgstisdDav
rGEIKvGVtGNisqrvSlrGsVawQkGsddfaqTAgflsmtvKw
Consen2
Secondary consensus
mak
ei
--
th
esyaayangtvvkagdtldytnasvtltdvd
i
th
dnah
iaarq
v
fnqge
-
ttgpdaa
akiyngg
vtlkn
tsava
hq
ivl
ssing
qeatv
i
-
sslr
anei
y
knetsn
titd
evssaadvf
in
k
h
t
da
nski
sa
std
nthtylslsdnstwdikans
s
-
vdnstvyi
r
d
dve
ptr
-
iten
v
nn
v
hfrt
gd
n
at
kv
s
t
r
kitnag
s
ay
ln
eiisveg
-----
es
g
ik
--
dsrifa
e
s
trgnteat
kn
tn
------
qat
t
nsggs
a
tva
t
-
-
-----
r
ea
s
a
a
tl
v
rl
r
e
ryidpvteqe
ssr
w
q
nawrdsn
rttshryv
s
gae
lt
gfttsds
r
ma
ardyn
th
nvsdy
skgsvr
a
yat
yadd
s
k
yi
a
s
s
kg
dla
es
sak
atv
gfalnksfgleaakyt
------
ifq
a
wm
dhnah
e
s
iendannn
frtfi
qeknsgphgd
f
fvem
wi
n
sk
favsmnavkveq
ga
l
l
n
lnpaa
vw
n
gv
l
dngynd
vmvglky
f
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character