fig|1040638.4.peg.4742
Escherichia coli O104:H4 str. LB226692
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGN
T
GAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|6666666.5357.peg.773
Escherichia coli TY-2482
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGN
T
GAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|585055.8.peg.956
Escherichia coli 55989
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGN
T
GAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|344601.3.peg.2709
Escherichia coli B171
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|340185.3.peg.3053
Escherichia coli E22
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|340185.4.peg.3218
Escherichia coli E22
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749545.3.peg.4319
Escherichia coli MS 182-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749531.3.peg.2922
Escherichia coli MS 69-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749532.3.peg.782
Escherichia coli MS 78-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|585395.4.peg.987
Escherichia coli O103:H2 str. 12009
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656444.3.peg.1511
Escherichia coli TA280
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|216592.1.peg.227
Escherichia coli 042
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
V
LTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|340184.3.peg.2686
Escherichia coli B7A
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|340184.6.peg.2809
Escherichia coli B7A
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|562.375.peg.4543
Escherichia coli EC4100B
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|595495.4.peg.3357
Escherichia coli KO11
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|679204.3.peg.2432
Escherichia coli MS 145-7
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|585396.4.peg.1002
Escherichia coli O111:H- str. 11128
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|566546.3.peg.1883
Escherichia coli W
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|566546.4.peg.992
Escherichia coli W
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656379.3.peg.1743
Escherichia coli FVEC1302
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MQ
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656380.3.peg.1571
Escherichia coli FVEC1412
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MQ
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749549.3.peg.5326
Escherichia coli MS 198-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MQ
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|585056.7.peg.1282
Escherichia coli UMN026
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MQ
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|331111.3.peg.3466
Escherichia coli E24377A
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMT
E
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|358709.5.peg.3408
Escherichia coli 101-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|670888.3.peg.1509
Escherichia coli 1827-70
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|344610.3.peg.3941
Escherichia coli 53638
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|344610.7.peg.770
Escherichia coli 53638
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|481805.3.peg.2902
Escherichia coli ATCC 8739
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|550676.3.peg.362
Escherichia coli B185
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|536056.3.peg.2902
Escherichia coli DH1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|316401.4.peg.1107
Escherichia coli ETEC H10407
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656414.3.peg.1101
Escherichia coli H736
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|331112.3.peg.927
Escherichia coli HS
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|331112.6.peg.966
Escherichia coli HS
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|83333.1.peg.879
Escherichia coli K12
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749538.3.peg.2886
Escherichia coli MS 116-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749540.3.peg.2700
Escherichia coli MS 146-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749544.3.peg.632
Escherichia coli MS 175-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749547.3.peg.1841
Escherichia coli MS 187-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749548.3.peg.2260
Escherichia coli MS 196-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|316407.3.peg.860
Escherichia coli W3110
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|511145.12.peg.924
Escherichia coli str. K-12 substr. MG1655
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|679206.4.peg.3406
Escherichia coli MS 119-7
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTP
Q
WASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656443.3.peg.1179
Escherichia coli TA271
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTP
Q
WASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656437.3.peg.1030
Escherichia coli TA143
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPD
D
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|679205.4.peg.1027
Escherichia coli MS 124-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTP
Q
WASQITGIPA
E
KII
Q
LAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGN
T
GAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749533.3.peg.1023
Escherichia coli MS 84-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTP
Q
WASQITGIPA
E
KII
Q
LAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGN
T
GAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656417.3.peg.1134
Escherichia coli M605
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EK
I
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656393.3.peg.1656
Escherichia coli H299
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
L
P
AKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749527.3.peg.2432
Escherichia coli MS 21-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
L
P
AKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|439855.10.peg.2382
Escherichia coli SMS-3-5
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
L
P
AKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|340186.3.peg.3976
Escherichia coli E110019
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
L
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|679207.4.peg.2010
Escherichia coli MS 107-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTP
Q
WASQITGIPA
E
KII
Q
LAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|409438.11.peg.1082
Escherichia coli SE11
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTP
Q
WASQITGIPA
E
KII
Q
LAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|562.371.peg.2141
Escherichia coli 1044A
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|562.373.peg.1124
Escherichia coli 1125A
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|562.372.peg.990
Escherichia coli 1212A
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|562.374.peg.2523
Escherichia coli 536A
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749537.3.peg.4337
Escherichia coli MS 115-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
G
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|83334.1.peg.1047
Escherichia coli O157:H7
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|444454.5.peg.5475
Escherichia coli O157:H7 str. EC4024
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|444449.5.peg.4408
Escherichia coli O157:H7 str. EC4042
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|444448.5.peg.3684
Escherichia coli O157:H7 str. EC4045
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|444453.5.peg.5057
Escherichia coli O157:H7 str. EC4076
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|444452.5.peg.4998
Escherichia coli O157:H7 str. EC4113
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|444450.8.peg.1141
Escherichia coli O157:H7 str. EC4115
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|444451.5.peg.3774
Escherichia coli O157:H7 str. EC4196
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|444447.5.peg.3891
Escherichia coli O157:H7 str. EC4206
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|478004.5.peg.3678
Escherichia coli O157:H7 str. EC4401
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|478006.5.peg.3407
Escherichia coli O157:H7 str. EC4501
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|478007.5.peg.4035
Escherichia coli O157:H7 str. EC508
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|478008.5.peg.2563
Escherichia coli O157:H7 str. EC869
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|637388.3.peg.4472
Escherichia coli O157:H7 str. FRIK2000
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|570506.3.peg.2034
Escherichia coli O157:H7 str. FRIK966
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|386585.9.peg.1096
Escherichia coli O157:H7 str. Sakai
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|544404.4.peg.1007
Escherichia coli O157:H7 str. TW14359
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|502346.5.peg.193
Escherichia coli O157:H7 str. TW14588
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|701177.3.peg.1112
Escherichia coli O55:H7 str. CB9615
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSG
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656419.3.peg.1188
Escherichia coli M718
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AK
S
PEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
H
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656408.3.peg.952
Escherichia coli H591
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
A
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTP
Q
WASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
E
K
P
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|869729.3.peg.2789
Escherichia coli UM146
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|655817.3.peg.985
Escherichia coli ABU 83972
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|199310.1.peg.1003
Escherichia coli CFT073
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749528.3.peg.1712
Escherichia coli MS 45-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|753642.3.peg.709
Escherichia coli NC101
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|685038.3.peg.828
Escherichia coli O83:H1 str. NRG 857C
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|656440.3.peg.808
Escherichia coli TA206
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|562.376.peg.4705
Escherichia coli WV_060327
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
K
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|362663.8.peg.918
Escherichia coli 536
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QE
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|362663.9.peg.918
Escherichia coli 536
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QE
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|340197.3.peg.2200
Escherichia coli F11
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QE
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|340197.5.peg.2314
Escherichia coli F11
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QE
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749550.3.peg.2131
Escherichia coli MS 200-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QE
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|749546.3.peg.1788
Escherichia coli MS 185-1
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
S
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
H
I
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
D
I
IA
T
K
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYL
X
QAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTLPA
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
V
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
S
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
T
K
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|670897.3.peg.4690
Escherichia coli 2362-75
MKTKIPDAV
LA
AE
V
SRR
G
LVKTTAIG
G
LA
M
ASSA
L
TLPFS
RIA
H
A
V
D
R
A
-----
IP
TKSD
EKV
I
WSACTVNCGSRC
P
LR
M
HV
V
D
G
E
I
K
Y
VET
--
DNTGDD
N
Y
D
G
L
HQVRACLRGRS
M
RRR
V
YN
PDRLKYPMKRVG
A
RGEGKFERISW
E
EA
Y
G
I
IA
T
N
M
Q
R
L
I
K
E
YGNE
SI
Y
L
N
YGTG
TL
GG
T
MTRS
W
P
PG
N
T
L
V
A
RLMNCCGG
Y
LN
H
YG
D
YS
S
AQI
A
E
GL
N
YTYG
G
WA
DGNSP
S
DIENSKLVV
L
FGNNP
G
ETRMSGGGVTYYLEQAR
Q
KSNARMI
I
IDPRYTDT
G
AGREDEW
I
PIRPGTDAALV
N
G
L
A
Y
V
M
ITENLVDQ
E
FLDKYCVGYDEKTL
S
A
S
APKNGHYKAYILGEGPDG
V
AKTPEWASQITG
V
PADKIIKLAREIGS
T
KPA
F
I
S
QGWGPQRHANGE
I
A
T
RAI
S
ML
A
ILTGNVGINGGNSGAREGSY
S
L
PF
V
R
M
P
T
L
-
ENP
IQ
TSIS
M
F
M
WTDAI
ER
G
P
EMTA
L
RDGVRGKDKLDVPIK
M
IWNYAGN
C
LINQHS
E
IN
R
THEILQDD
K
KCE
L
IV
A
ID
C
H
MTSSAKYADILLPD
CT
AS
EQ
M
D
F
A
L
D
A
S
C
GNM
S
YVIF
N
DQ
V
IK
P
R
FE
C
K
T
IY
E
M
T
SE
L
AKRLG
--
V
E
Q
Q
FTEGRTQEEW
MR
HLYA
Q
SR
E
AI
PE
--
LP
T
F
EE
F
R
K
Q
--
GIFKK
R
DP
Q
GH
H
VAYKAFR
E
DPQANPL
T
TPSGKIEIYS
QA
LADIA
A
TWEL
P
EGD
VI
D
PLP
I
YTPGFE
SY
Q
DPL
N
KQ
YPLQLTGFHYKSR
V
HST
-
YGN
V
DVLKAACRQE
M
WINP
L
DAQKRGI
N
NGD
K
VR
I
FN
D
RGEV
H
I
E
AKVTPR
MM
PGV
V
A
L
G
E
GAW
Y
D
P
D
-
-
AK
RVD
K
GGCIN
V
LTT
Q
RPSPLAKGNPSHTNLVQ
V
EKV
fig|670897.3.peg.2337
Escherichia coli 2362-75 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
H
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|216593.1.peg.355
Escherichia coli E2348/69 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
H
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|574521.7.peg.1708
Escherichia coli O127:H6 str. E2348/69 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
H
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|550672.3.peg.2427
Escherichia coli B088 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|753642.3.peg.1576
Escherichia coli NC101 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
A
D
G
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|525281.3.peg.976
Escherichia coli 83972 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|655817.3.peg.1868
Escherichia coli ABU 83972 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|405955.13.peg.1670
Escherichia coli APEC O1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|405955.9.peg.1355
Escherichia coli APEC O1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|199310.1.peg.1915
Escherichia coli CFT073 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|199310.4.peg.1846
Escherichia coli CFT073 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749546.3.peg.3879
Escherichia coli MS 185-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749528.3.peg.3857
Escherichia coli MS 45-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|585035.6.peg.1625
Escherichia coli S88 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|364106.7.peg.1817
Escherichia coli UTI89 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|364106.8.peg.1820
Escherichia coli UTI89 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|679205.4.peg.1617
Escherichia coli MS 124-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
I
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749533.3.peg.4939
Escherichia coli MS 84-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
I
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|562.375.peg.1686
Escherichia coli EC4100B (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749532.3.peg.1355
Escherichia coli MS 78-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|409438.11.peg.1855
Escherichia coli SE11 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|340184.3.peg.1015
Escherichia coli B7A (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|340184.6.peg.1050
Escherichia coli B7A (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|679204.3.peg.504
Escherichia coli MS 145-7 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|585034.4.peg.1621
Escherichia coli IAI1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
I
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|585034.5.peg.1616
Escherichia coli IAI1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
I
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|685038.3.peg.1602
Escherichia coli O83:H1 str. NRG 857C (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
E
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
KQYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEK
N
LPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SA
F
PLQL
F
GFHYKSRTHST
-
YGN
V
DVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656440.3.peg.4366
Escherichia coli TA206 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAN
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
V
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
SI
F
PLQL
F
GFHYKSR
A
HST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEKV
fig|340186.3.peg.17
Escherichia coli E110019 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
I
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
L
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|340186.5.peg.18
Escherichia coli E110019 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
A
T
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
I
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
L
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|439855.10.peg.1773
Escherichia coli SMS-3-5 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
SS
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GN
Q
QVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
E
DP
E
ANPLKTPSGKIEIYS
NR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|585395.4.peg.1810
Escherichia coli O103:H2 str. 12009 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|344601.3.peg.331
Escherichia coli B171 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|344601.5.peg.329
Escherichia coli B171 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|550676.3.peg.1891
Escherichia coli B185 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
H
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKL
E
VPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|595495.4.peg.2853
Escherichia coli KO11 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
I
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|566546.3.peg.1066
Escherichia coli W (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
I
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|566546.4.peg.1739
Escherichia coli W (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
I
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|595496.3.peg.1547
Escherichia coli BW2952 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|536056.3.peg.2167
Escherichia coli DH1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656414.3.peg.1889
Escherichia coli H736 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|83333.1.peg.1573
Escherichia coli K12 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749538.3.peg.4136
Escherichia coli MS 116-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749540.3.peg.4220
Escherichia coli MS 146-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749548.3.peg.4734
Escherichia coli MS 196-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|316407.3.peg.1544
Escherichia coli W3110 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|316385.5.peg.1699
Escherichia coli str. K-12 substr. DH10B (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|316385.7.peg.1744
Escherichia coli str. K-12 substr. DH10B (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|511145.12.peg.1658
Escherichia coli str. K-12 substr. MG1655 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|511145.6.peg.1642
Escherichia coli str. K-12 substr. MG1655 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|481805.3.peg.2201
Escherichia coli ATCC 8739 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
A
S
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|481805.6.peg.2194
Escherichia coli ATCC 8739 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
A
S
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656408.3.peg.1726
Escherichia coli H591 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|679206.4.peg.1318
Escherichia coli MS 119-7 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656379.3.peg.2055
Escherichia coli FVEC1302 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGP
E
VYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656380.3.peg.3018
Escherichia coli FVEC1412 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGP
E
VYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749549.3.peg.5227
Escherichia coli MS 198-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGP
E
VYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|585056.7.peg.2059
Escherichia coli UMN026 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGP
E
VYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656437.3.peg.1798
Escherichia coli TA143 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|331111.12.peg.2081
Escherichia coli E24377A (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
I
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|331111.3.peg.4224
Escherichia coli E24377A (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
I
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656417.3.peg.2077
Escherichia coli M605 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
K
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGP
H
VYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|431946.3.peg.1505
Escherichia coli SE15 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
K
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749531.3.peg.4805
Escherichia coli MS 69-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
E
NPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|550677.3.peg.2969
Escherichia coli B354 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
D
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749527.3.peg.4301
Escherichia coli MS 21-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NG
N
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656393.3.peg.2375
Escherichia coli H299 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|1040638.4.peg.3633
Escherichia coli O104:H4 str. LB226692 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDV
S
IKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|585055.6.peg.1782
Escherichia coli 55989 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDV
S
IKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|585055.8.peg.1786
Escherichia coli 55989 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDV
S
IKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|316401.4.peg.1867
Escherichia coli ETEC H10407 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
N
A
A
AA
-----
V
-
QQAR
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPANAP
Q
NGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
E
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KD
L
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|6666666.5357.peg.2358
Escherichia coli TY-2482 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKV
I
W
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVG
T
RGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
N
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDV
S
IKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
P
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|670888.3.peg.74
Escherichia coli 1827-70 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
I
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKF
K
RISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPANAPKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAY
T
CQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
K
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|331112.3.peg.1563
Escherichia coli HS (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
I
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAY
T
CQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
K
MTA
I
RDGVRGKDKLDVPIKFIWN
F
AGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
I
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|331112.6.peg.1630
Escherichia coli HS (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
I
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAY
T
CQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
K
MTA
I
RDGVRGKDKLDVPIKFIWN
F
AGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
I
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|562.371.peg.4430
Escherichia coli 1044A (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|562.372.peg.4662
Escherichia coli 1212A (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|562.374.peg.4447
Escherichia coli 536A (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|83334.1.peg.2322
Escherichia coli O157:H7 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|155864.1.peg.2330
Escherichia coli O157:H7 EDL933 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|155864.8.peg.2169
Escherichia coli O157:H7 EDL933 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|444454.5.peg.5571
Escherichia coli O157:H7 str. EC4024 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|444449.5.peg.517
Escherichia coli O157:H7 str. EC4042 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|444448.5.peg.4825
Escherichia coli O157:H7 str. EC4045 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|444453.5.peg.4690
Escherichia coli O157:H7 str. EC4076 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|444452.5.peg.3536
Escherichia coli O157:H7 str. EC4113 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|444450.8.peg.2337
Escherichia coli O157:H7 str. EC4115 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|444451.5.peg.3357
Escherichia coli O157:H7 str. EC4196 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|444447.5.peg.5164
Escherichia coli O157:H7 str. EC4206 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|478004.5.peg.4148
Escherichia coli O157:H7 str. EC4401 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|478006.5.peg.4087
Escherichia coli O157:H7 str. EC4501 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|478007.5.peg.3501
Escherichia coli O157:H7 str. EC508 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|478008.5.peg.4497
Escherichia coli O157:H7 str. EC869 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|637388.3.peg.5057
Escherichia coli O157:H7 str. FRIK2000 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|570506.3.peg.4176
Escherichia coli O157:H7 str. FRIK966 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|386585.9.peg.2401
Escherichia coli O157:H7 str. Sakai (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|544404.4.peg.2200
Escherichia coli O157:H7 str. TW14359 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|502346.5.peg.5112
Escherichia coli O157:H7 str. TW14588 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749531.3.peg.4806
Escherichia coli MS 69-1 (2-808/808)
MK
IH
I
TE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
E
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|413997.3.peg.1631
Escherichia coli B str. REL606 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
I
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
E
D
LVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAY
T
CQGWGP
K
R
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|511693.5.peg.1656
Escherichia coli BL21 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
I
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
E
D
LVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAY
T
CQGWGP
K
R
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|469008.4.peg.2118
Escherichia coli BL21(DE3) (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
I
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
E
D
LVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAY
T
CQGWGP
K
R
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|216592.1.peg.2169
Escherichia coli 042 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHY
E
AYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
VER
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GG
S
SGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|216592.3.peg.1805
Escherichia coli 042 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHY
E
AYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
VER
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GG
S
SGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|701177.3.peg.1978
Escherichia coli O55:H7 str. CB9615 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQK
C
GI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656444.3.peg.2478
Escherichia coli TA280 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALD
I
I
S
D
NL
R
R
I
LK
E
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KI
S
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|562.373.peg.5332
Escherichia coli 1125A (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
E
Q
VVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
A
WVET
--
DNTG
S
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGN
I
TRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
S
REKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEIL
L
D
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656437.3.peg.1799
Escherichia coli TA143 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
E
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
H
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|6666666.5357.peg.2359
Escherichia coli TY-2482 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|344601.3.peg.332
Escherichia coli B171 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|679205.4.peg.1618
Escherichia coli MS 124-1 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|481805.3.peg.2200
Escherichia coli ATCC 8739 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|550672.3.peg.2428
Escherichia coli B088 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|340186.3.peg.16
Escherichia coli E110019 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|331111.12.peg.2082
Escherichia coli E24377A (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|331111.3.peg.4225
Escherichia coli E24377A (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|316401.4.peg.1868
Escherichia coli ETEC H10407 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656408.3.peg.1727
Escherichia coli H591 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656408.3.peg.1728
Escherichia coli H591
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|83333.1.peg.1574
Escherichia coli K12 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|679207.4.peg.4677
Escherichia coli MS 107-1 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749540.3.peg.4221
Escherichia coli MS 146-1 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|316407.3.peg.1545
Escherichia coli W3110
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|316385.5.peg.1700
Escherichia coli str. K-12 substr. DH10B
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|511145.12.peg.1659
Escherichia coli str. K-12 substr. MG1655 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|216592.1.peg.2170
Escherichia coli 042 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
T
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|679204.3.peg.505
Escherichia coli MS 145-7 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EW
V
K
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585396.4.peg.2137
Escherichia coli O111:H- str. 11128 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EW
V
K
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|358709.5.peg.1102
Escherichia coli 101-1 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
L
C
LHVKD
N
EV
I
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
E
D
LVDQ
P
FLDKYCVGYDEKTLPA
D
APKNGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAY
T
CQGWGP
K
R
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
ST
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
W
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SR
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGW
N
S
P
E
R
RT
F
PLQL
F
GFHYKSRTHST
-
YGNID
L
LKAACRQEVWINPIDAQKRGI
A
NGD
M
VRVFN
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|749545.3.peg.5107
Escherichia coli MS 182-1 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749532.3.peg.1356
Escherichia coli MS 78-1 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|562.375.peg.1687
Escherichia coli EC4100B (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|331112.3.peg.1564
Escherichia coli HS (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|409438.11.peg.1856
Escherichia coli SE11 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585057.4.peg.1537
Escherichia coli IAI39 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
AP
I
NGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
L
T
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
R
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|585057.6.peg.1537
Escherichia coli IAI39 (10-808/808)
ISRR
T
LVK
S
TAIGSLALA
AGG
F
S
LPF
T
--L
R
S
A
A
AA
-----
V
-
QQAS
EKVVW
G
AC
S
VNCGSRC
A
LRLHVKD
N
EV
T
WVET
--
DNTG
N
D
E
Y
-
GNHQVRACLRGRSIRRR
I
NHPDRL
N
YPMKRVGKRGEGKFERISWDEALDTIA
S
S
LK
K
T
V
E
QYGNEAVY
I
Q
Y
SS
G
IV
GGNMTRS
S
P
SA
S
A
-
V
K
RLMNC
Y
GG
S
LN
Q
YGSYSTAQIS
C
AM
P
YTYG
S
-
NDGNS
T
T
DIENSKLVVMFGNNPAETRMSGGG
I
TY
L
LE
K
AREKSNA
K
MIVIDPRYTDTAAGREDEWLPIRPGTDAALV
A
GIAWVLI
N
ENLVDQ
P
FLDKYCVGYDEKTLPA
D
AP
I
NGHYKAYILGEG
D
D
K
T
AKTP
Q
WASQITGIP
V
D
R
IIKLAREIG
T
AKPAYICQGWGPQR
Q
ANGE
L
TARAIAML
P
ILTGNVGI
S
GGNSGARE
L
T
Y
T
I
TI
ER
L
P
V
L
-
D
NPVKTSIS
C
F
S
WTDAIDHG
P
Q
MTA
I
RDGVRGKDKLDVPIKFIWNYAGNTL
V
NQHSDIN
K
THEILQD
E
SKCEMIV
V
I
E
N
FMTSSAKYADILLPDLM
TV
EQED
I
I
P
N
D
Y
AGNMGY
L
IF
L
QPVTS
E
KFERKPIYW
I
LSEVAKRLGPDVYQ
K
FTEGRTQE
Q
R
LQ
HLYAK
ML
A
KD
P
A
--
LP
S
Y
D
E
L
KK
M
--
GI
Y
K
R
KDP
N
GH
F
VAYKAFR
D
DP
E
ANPLKTPSGKIEIYS
SK
LA
E
IA
R
TWEL
E
KDEVI
S
PLP
V
Y
AST
FEGWD
S
P
E
R
ST
F
PLQL
F
GFHYKSRTHST
-
YGNIDVLKAACRQEVWINPIDAQKRGI
A
NGD
M
VRV
Y
N
H
RGEV
R
L
P
AKVTPRILPGV
S
AMGQGAW
H
E
A
N
M
S
GD
KI
DHGGC
V
N
T
LTT
L
RPSPLAKGNP
Q
HTNLV
E
IEK
I
fig|656414.3.peg.1890
Escherichia coli H736 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
K
MTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|340184.3.peg.1014
Escherichia coli B7A (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
R
VE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EW
V
K
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656379.3.peg.2056
Escherichia coli FVEC1302 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
H
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656380.3.peg.3019
Escherichia coli FVEC1412 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
H
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585056.7.peg.2060
Escherichia coli UMN026 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
H
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656393.3.peg.2376
Escherichia coli H299
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALD
I
I
S
D
NL
R
R
I
LK
E
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|340185.3.peg.3274
Escherichia coli E22 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGW
R
PQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|199310.1.peg.1916
Escherichia coli CFT073 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
L
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585057.4.peg.1536
Escherichia coli IAI39 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
A
IN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYW
I
LSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585057.6.peg.1536
Escherichia coli IAI39 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
A
IN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYW
I
LSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|562.373.peg.5333
Escherichia coli 1125A (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|562.372.peg.4663
Escherichia coli 1212A (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|562.374.peg.4446
Escherichia coli 536A (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|83334.1.peg.2323
Escherichia coli O157:H7 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|444454.5.peg.5572
Escherichia coli O157:H7 str. EC4024
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|444449.5.peg.518
Escherichia coli O157:H7 str. EC4042 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|444448.5.peg.4826
Escherichia coli O157:H7 str. EC4045 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|444453.5.peg.4691
Escherichia coli O157:H7 str. EC4076 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|444452.5.peg.3535
Escherichia coli O157:H7 str. EC4113 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|444450.8.peg.2338
Escherichia coli O157:H7 str. EC4115 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|444451.5.peg.3358
Escherichia coli O157:H7 str. EC4196 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|444447.5.peg.5163
Escherichia coli O157:H7 str. EC4206 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|478004.5.peg.4147
Escherichia coli O157:H7 str. EC4401 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|478005.5.peg.5248
Escherichia coli O157:H7 str. EC4486
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|478007.5.peg.3502
Escherichia coli O157:H7 str. EC508
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|478008.5.peg.4498
Escherichia coli O157:H7 str. EC869
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|637388.3.peg.5056
Escherichia coli O157:H7 str. FRIK2000 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|570506.3.peg.4177
Escherichia coli O157:H7 str. FRIK966 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|386585.9.peg.2402
Escherichia coli O157:H7 str. Sakai (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|544404.4.peg.2201
Escherichia coli O157:H7 str. TW14359 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|502346.5.peg.5111
Escherichia coli O157:H7 str. TW14588 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585395.4.peg.1811
Escherichia coli O103:H2 str. 12009 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWL
L
IRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|655817.3.peg.1869
Escherichia coli ABU 83972 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
L
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
I
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|405955.9.peg.1356
Escherichia coli APEC O1 (2-808/808)
MK
IH
N
TE
A
L
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656437.3.peg.1800
Escherichia coli TA143
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
E
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
H
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|1040638.4.peg.3634
Escherichia coli O104:H4 str. LB226692
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585055.6.peg.1783
Escherichia coli 55989
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585055.8.peg.1787
Escherichia coli 55989
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|344601.5.peg.330
Escherichia coli B171
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|679205.4.peg.1619
Escherichia coli MS 124-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749533.3.peg.4938
Escherichia coli MS 84-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|358709.5.peg.1103
Escherichia coli 101-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|481805.6.peg.2193
Escherichia coli ATCC 8739
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|413997.3.peg.1632
Escherichia coli B str. REL606
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|550676.3.peg.1892
Escherichia coli B185
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|511693.5.peg.1657
Escherichia coli BL21
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|469008.4.peg.2117
Escherichia coli BL21(DE3)
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|595496.3.peg.1548
Escherichia coli BW2952
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|536056.3.peg.2166
Escherichia coli DH1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|340186.5.peg.17
Escherichia coli E110019
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|316401.4.peg.1869
Escherichia coli ETEC H10407
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|595495.4.peg.2852
Escherichia coli KO11
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|679207.4.peg.4678
Escherichia coli MS 107-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749538.3.peg.4137
Escherichia coli MS 116-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|679206.4.peg.1317
Escherichia coli MS 119-7
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656443.3.peg.2236
Escherichia coli TA271
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|566546.3.peg.1065
Escherichia coli W
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|566546.4.peg.1740
Escherichia coli W
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|316385.7.peg.1745
Escherichia coli str. K-12 substr. DH10B
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|511145.6.peg.1643
Escherichia coli str. K-12 substr. MG1655
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|216592.3.peg.1806
Escherichia coli 042
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
T
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|550677.3.peg.2970
Escherichia coli B354
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
T
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|573235.3.peg.2330
Escherichia coli O26:H11 str. 11368
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EW
V
K
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749532.3.peg.1357
Escherichia coli MS 78-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|670888.3.peg.75
Escherichia coli 1827-70
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|562.375.peg.1688
Escherichia coli EC4100B
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|331112.6.peg.1631
Escherichia coli HS
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|216593.1.peg.356
Escherichia coli E2348/69 (2-808/808)
MK
IH
N
TE
A
L
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEG
T
D
S
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656414.3.peg.1891
Escherichia coli H736
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
K
MTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749548.3.peg.4735
Escherichia coli MS 196-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEV
V
KRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749537.3.peg.2398
Escherichia coli MS 115-1
MKAEISRR
N
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
T
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
I
I
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
V
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585034.4.peg.1622
Escherichia coli IAI1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQ
N
DSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585034.5.peg.1617
Escherichia coli IAI1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQ
N
DSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749549.3.peg.5226
Escherichia coli MS 198-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
V
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
H
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749527.3.peg.4300
Escherichia coli MS 21-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
T
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYW
I
LSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFN
T
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|701177.3.peg.1979
Escherichia coli O55:H7 str. CB9615
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
T
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|340185.4.peg.3450
Escherichia coli E22
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGW
R
PQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
K
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|199310.4.peg.1847
Escherichia coli CFT073
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
L
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749546.3.peg.3878
Escherichia coli MS 185-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
L
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|749528.3.peg.3856
Escherichia coli MS 45-1
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
L
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|405955.13.peg.1671
Escherichia coli APEC O1
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|714962.3.peg.1739
Escherichia coli IHE3034
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585035.6.peg.1626
Escherichia coli S88
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|869729.3.peg.1959
Escherichia coli UM146
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|364106.7.peg.1818
Escherichia coli UTI89
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|364106.8.peg.1821
Escherichia coli UTI89
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656417.3.peg.2078
Escherichia coli M605 (2-808/808)
MK
I
YN
TE
A
L
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
S
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYW
I
LSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
GT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|656417.3.peg.2079
Escherichia coli M605
MK
I
YN
TE
A
L
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
S
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYW
I
LSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
T
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
GT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|562.371.peg.4431
Escherichia coli 1044A
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|562.372.peg.4664
Escherichia coli 1212A
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|478006.5.peg.4086
Escherichia coli O157:H7 str. EC4501
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|525281.3.peg.975
Escherichia coli 83972
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
L
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
I
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|655817.3.peg.1870
Escherichia coli ABU 83972
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
L
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
I
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|670897.3.peg.2336
Escherichia coli 2362-75
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEG
T
D
S
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585397.7.peg.1776
Escherichia coli ED1a
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEG
T
D
S
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|585397.9.peg.1771
Escherichia coli ED1a
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEG
T
D
S
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|574521.7.peg.1709
Escherichia coli O127:H6 str. E2348/69
M
S
AEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEG
T
D
S
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|155864.1.peg.2331
Escherichia coli O157:H7 EDL933 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
X
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
X
YVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
X
PSPL
X
KGNPSH
S
NLVQIEKV
fig|155864.8.peg.2170
Escherichia coli O157:H7 EDL933 (2-808/808)
MK
IHTTE
A
L
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
E
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLI
S
EN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LAREIGSAKPAYICQGWGPQRH
S
NGE
Q
T
X
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
F
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
T
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
T
N
E
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
PE
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
X
YVA
FR
AFR
E
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
X
PSPL
X
KGNPSH
S
NLVQIEKV
fig|753642.3.peg.1575
Escherichia coli NC101
MKAEISRR
S
L
M
KT
S
A
L
GSLALASSAFTLPFS
QMV
RAA
Q
A
P
-----
V
-
E---
EK
A
VWS
S
CTVNCGSRC
L
LRLHVKD
D
T
V
Y
WVE
S
--
D
T
TGDD
V
Y
-
GNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTI
S
D
NL
R
R
I
LK
D
YGNEAV
H
V
L
YGTG
VD
GGN
I
T
N
S
-
-
--
N
V
-
P
Y
RLMN
S
CGG
F
L
S
R
YGSYSTAQIS
A
AM
S
Y
MF
G
A
-
NDGNSP
D
DI
A
N
T
KLVVMFGNNPAETRMSGGGVTYY
V
EQARE
R
SNARMIVIDPRY
N
DTAAGREDEWLPIRPGTD
G
AL
A
C
A
IAWVLITEN
M
VDQ
P
FLDKYCVGYDEKTLPANAP
R
N
A
HYKAYILGEGPDG
I
AKTPEWA
AK
IT
S
IPA
E
KII
Q
LA
L
EIGSAKPAYICQGWGPQRH
S
NGE
Q
T
S
RAIAML
S
V
LTGNVGINGGNSG
V
REGS
W
D
L
GV
E
W
L
P
M
L
-
ENPVKT
Q
IS
V
F
T
WTDAIDHG
T
EMTA
P
RDGVRGK
E
KLDVPIKF
L
W
C
YA
S
NTLINQH
G
DIN
H
THE
V
LQDDSKCEMIV
G
ID
H
FMT
A
SAKY
C
DILLPDLM
PT
EQED
L
I
S
H
E
SAGNMGYVI
L
A
QP
A
TS
A
KFERKPIYWMLSEVAKRLGPDVYQ
T
FTEGR
S
Q
H
EWIK
Y
L
H
AK
TK
E
RN
S
E
--
M
P
D
YEE
M
K
T
T
--
GIFKKK
C
P
E
E
HYVA
FR
AFR
V
DPQANPLKTPSGKIEIYS
ER
LA
K
IA
D
TWEL
K
KDE
I
I
H
PLP
A
YTPGF
D
GWDDPLR
KT
YPLQLTGFHYK
A
RTHS
S
-
YGNIDVL
QQ
AC
P
QEVWINPIDAQ
A
RGI
R
H
GD
T
VRVFNN
N
GE
M
L
I
A
AKVTPRILPGV
T
A
I
GQGAW
L
KADM
F
GDRVDHGG
S
IN
I
LT
S
H
RPSPLAKGNPSH
S
NLVQIEKV
fig|478008.5.peg.3072
Escherichia coli O157:H7 str. EC869 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
K
L
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|637388.3.peg.75
Escherichia coli O157:H7 str. FRIK2000 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
K
L
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|570506.3.peg.5258
Escherichia coli O157:H7 str. FRIK966 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
K
L
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|155864.1.peg.3383
Escherichia coli O157:H7 EDL933 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VX
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|562.373.peg.3712
Escherichia coli 1125A (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|83334.1.peg.3383
Escherichia coli O157:H7 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|444454.5.peg.2406
Escherichia coli O157:H7 str. EC4024 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|444449.5.peg.1872
Escherichia coli O157:H7 str. EC4042 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|444448.5.peg.626
Escherichia coli O157:H7 str. EC4045 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|444453.5.peg.107
Escherichia coli O157:H7 str. EC4076 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|444452.5.peg.1484
Escherichia coli O157:H7 str. EC4113 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|444450.8.peg.3715
Escherichia coli O157:H7 str. EC4115 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|444451.5.peg.1254
Escherichia coli O157:H7 str. EC4196 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|444447.5.peg.767
Escherichia coli O157:H7 str. EC4206 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|478004.5.peg.3027
Escherichia coli O157:H7 str. EC4401 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|478005.5.peg.2597
Escherichia coli O157:H7 str. EC4486 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|478006.5.peg.1636
Escherichia coli O157:H7 str. EC4501 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|478007.5.peg.1680
Escherichia coli O157:H7 str. EC508 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|386585.9.peg.3535
Escherichia coli O157:H7 str. Sakai (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|544404.4.peg.3518
Escherichia coli O157:H7 str. TW14359 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|502346.5.peg.3897
Escherichia coli O157:H7 str. TW14588 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|550676.3.peg.1721
Escherichia coli B185 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
Y
F
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
K
L
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
V
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|562.372.peg.1935
Escherichia coli 1212A (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
I
PRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|562.374.peg.3138
Escherichia coli 536A (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
I
PRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|155864.8.peg.3293
Escherichia coli O157:H7 EDL933 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VX
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
X
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
DG
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
KP
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
RGI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
fig|701177.3.peg.3080
Escherichia coli O55:H7 str. CB9615 (14-792/793)
QV
SRR
S
F
LQ
A
T
--------
S
A
LI
TLPF
I
SST
AK
A
Q
SP
DASPE
V
T
APVA
D
KVV
P
T
CS
T
F
D
CG
G
K
C
D
I
R
A
H
MR
D
G
V
V
T
Q
I
T
T
LP
DN
EL
D
P
Q
M
-
--PI
M
RAC
V
RGR
GY
R
K
F
V
Y
HPDRLKYPMKRVGKRGEGKFERISWDEA
TTL
IA
D
NLKR
I
T
Q
QYG
PA
S
R
YV
H
V
GT
A
VS
GG
T
F
S
--
-
-
-G
D
A
MA
R
RL
L
N
LT
GG
Y
L
E
Y
Y
H
S
V
S
LGNT
A
A
A
T
P
YTYG
V
A
A
S
GNS
M
D
T
L
L
D
T
KLV
I
L
W
G
H
NP
T
ET
IF--
G
HTN
YY
F
Q
KM
K
Q
-
N
GT
R
F
IV
V
DPRY
S
DT
V
S
SLA
D
Q
W
I
P
L
L
P
T
TD
N
AL
M
D
A
M
M
Y
V
I
I
S
ENL
H
D
K
T
F
I
D
T
Y
T
L
G
F
DE
N
SM
P
EGV
P
A
N
ESLV
AY
L
F
G
-AK
DG
I
H
KTPEWA
E
K
IT
H
V
PA
QS
I
R
Q
LAR
D
YA
TT
KPA
A
L
I
QGWGPQRH
IC
GE
R
TAR
GST
L
L
A
S
I
TGNVGI
K
GG
W
A
AGYG
GS
S
N
R
KF
CV
G
P
D
M
P
ENPV
Q
AK
IS
I
M
N
W
MQ
A
A
D
DA
S
KV
T
P
-
Q
D
R
LK
G
V
DKLD
SN
I
R
L
L
FS
L
AGN
Y
L
A
NQ
N
P
D
VH
Q
AA
KL
L
E
D
E
SK
I
E
F
IV
L
S
D
L
FMT
P
SAKYAD
V
LLP
E
TS
FM
E
R
W
N
I
-
-
G
E
T
W
G
TA
S
Y
L
I
L
S
E
K
L
I
E
P
D
FER
R
T
D
Y
DW
L
R
D
VAK
K
LG
--
V
E
A
E
F
SQ
GR
D
EK
Q
WI
E
H
IW
E
Q
TR
L
AM
P
D
EN
LP
D
F
AT
L
Q
K
T
RRH
L
FK
----
S
AP
HI
A
F
E
A
NI
R
DPQ
N
NP
FP
TPSGKIEI
F
S
KR
L
F
D
M
-
-
----
-
Q
D
PE
I
P
A
L
S
H
Y
V
P
A
FEG
P
E
D
K
L
T
AK
YPLQL
IT
W
KG
K
N
R
A
N
ST
Q
Y
A
N
-PW
L
Q
EVQT
Q
KL
W
L
NP
Q
DA
KQ
S
GI
S
E
GD
S
V
K
IY
N
D
RG
VS
I
I
P
V
EI
TPRI
I
PGV
V
AM
QA
GAW
W
Q
P
D
-
-
A
QG
I
D
R
GGC
A
N
V
L
S
S
T
R
I
T
A
LAKGN
SHQ
T
M
LV
EV
EK
Consen1
Primary consensus
MKtkipdAvmkAeiSRR
LvKttAiGsLAlAssaftLPFs
raa
aa
-----
v
-
EKvvWsaCtVNCGSRC
LRlHVkD
ev
wVEt
--
DnTGdD
Y
-
GnHQVRACLRGRSiRRRmnhPDRLkYPMKRVGkRGEGKFERISWdEAlDtIa
nlkr
lkqYGNEavyv
YgtG
GGnmTrS
p
n
-
v
RLMNccGG
Ln
YGsYStAQIs
am
YtyG
-
nDGNSp
DIeNsKLVVmFGNNPaETRMSGGGvTYylEqaRekSNArMIvIDPRYtDTaAGREDEWlPIRPGTDaALv
giAwVlItENlVDQ
FLDKYCVGYDEKTLPAnAPkNgHYKAYILGEGpDg
AKTPeWAsqITgiPadkIIkLAREIGsaKPAyIcQGWGPQRhaNGE
taRAIaML
iLTGNVGInGGNSGaREgsy
l
er
P
L
-
eNPvkTsIS
F
WTDAIdhG
eMTA
RDGVRGKdKLDVPIKfiWnYAgNtLiNQHsdIN
ThEiLQDdsKCEmIV
Id
fMTsSAKYaDILLPDlm
EQeD
i
h
saGNMgYvIf
qpvts
kFErKpIYwmlSEvAKRLGpdVyQ
FTEGRtQeeWikhLyAk
e
Pe
--
lP
yeE
kk
--
GIfKkkdP
gHyVAykAFR
DPqANPLkTPSGKIEIYS
LAdIA
TWEL
kdevI
PLP
YtpgFegwddPlr
yPLQLtGFHYKsRtHSt
-
YGNiDvLkaACrQEvWINPiDAQkRGI
nGD
VRvfNnrGEv
i
AKVTPRilPGV
AmGqGAW
kadm
gdrvDhGGciN
LTt
RPSPLAKGNPsHtNLVqiEKv
Consen2
Secondary consensus
ihtte
lla
qv
m
ss
l
g
m
aggls
t
h
v
sp
ip
ai
gs
s
m
v
ti
y
s
t
s
d
l
m
yn
n
a
e
y
i
s
sm
k
e
sih
ss
ti
n
-
s
lp
sy
s
d
s
a
gl
mf
wa
t
a
t
l
g
i
lv
km
qr
k
i
n
g
i
g
a
al
y
m
m
r
a
d
k
q
ak
sv
ver
q
tt
f
s
qs
a
s
v
s
v
stw
i
vw
d
iq
q
er
q
e
ml
c
s
c
v
ge
n
v
ek
l
e
h
a
c
ct
m
a
yc
s
l
l
dqaik
r
c
t
eit
l
--
e
s
hq
y
h
q
a
a
m
fd
rt
y
rrc
e
fr
e
t
egdi
ast
dsy
s
en
f
f
a
v
s
v
l
qq
p
m
l
a
h
iy
n
m
l
mm
e
pn
-
akki
k
sv
s
q
s
ev
i
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character