fig|1040638.4.peg.5417
Escherichia coli O104:H4 str. LB226692
MK
N
APNLK
Y
QPK
---
DK
F
TEVIIFAG
T
DA
Y
AHA
Q
H
W
I
E
SE
G
-RKH
GD
NV
PPV
Y
LG
PT
QL
AD
L
ANI
R
I
I
D
D
E
R
RF
A
RV
Y
I
AG
E
I
EPIQIN
AI
AE
KLA
L
AGVQDA
KL
Y
KG
I
T
D
R
--
E
P
---
ENWR
DY
L
Q
R
I
R
E
QA
ERG
ET
IP
EGI
G
I
A
GGS
P
G
K--
T
D
PM
KP
HITEK
-
DG
AL
W
YIE
P
IPNTRAE
E
MNYK
ETWL
SDRMRTA
G
I
G
ND
G
R
E
A
Y
LIIEMIP
E
GTQKI
I
YE
A
M
P
RNE
IG
MPA
GW
AR
L
RGR
G
VAI
TT
SAHLLNK
LA
EYL
Q
RH
G
D
RTV
W
EVTS
T
A
GWH
C
GAY
V
MPDG
EV
IG
V
PDR
P
VA
F
C
G
G
SAAI
K
GY
I
V
R
GT
V
H
E
WR
N
N
VA
S
L
MR
GN
H
SMMLGV
L
V
G
LAAPL
NS
LVG
GSC
FG
I
HLF
A
QSSAGKTTT
VEA
A
T
SL
Y
G
D
P
E
M
LK
L
S
W
DA
T
RH
G
LTV
EA
A
A
R
NDG
FI
P
I
DEIGQ
G
G
RVNDIAQ
SAY
S
LFNG
V
G
RI
QG
R
KDGGNR
AVMR
W
KIA
A
L
STGE
E
D
F
ETFL
L
K
G
GI
AP
KAGQLVRLL
SI
P
FTD
T
T
V
F
NG
Y
D
D
G
D
Q
HA
R
A
I
K
RLSS
N
YC
GAAGREWV
R
WL
S
A
H
KEL
A
IN
T
TS
D
KENA
W
LGNL
PE
NASS
QV
R
RV
AS
RFA
M
L
D
AA
GE
L
A
TA
I
TGW
T
AE
ECR
E
A
T
Q
RA
F
DD
W
LQD
FG
LE
NRE
KY
Q
V
ISR
A
RD
F
I
QR
H
A
L
SR
F
Q
P
Y
TYGK
S
N
G
D
MD
N
HYASR
I
S
N
LAGY
L
V
S
GK
R
-
ED
GKPE
Y
H
II
P
S
VF
D
SEI
L
C
G
IS
RNFG
CQ
AL
E
E
AGML
V
--
CAEP
----
G
R
WTS
K
T
V
-
KI
N
G
T
Q
Q
R
F
I
VL
I
D
Q
AE
DE
fig|6666666.5357.peg.3035
Escherichia coli TY-2482
MK
N
APNLK
Y
QPK
---
DK
F
TEVIIFAG
T
DA
Y
AHA
Q
H
W
I
E
SE
G
-RKH
GD
NV
PPV
Y
LG
PT
QL
AD
L
ANI
R
I
I
D
D
E
R
RF
A
RV
Y
I
AG
E
I
EPIQIN
AI
AE
KLA
L
AGVQDA
KL
Y
KG
I
T
D
R
--
E
P
---
ENWR
DY
L
Q
R
I
R
E
QA
ERG
ET
IP
EGI
G
I
A
GGS
P
G
K--
T
D
PM
KP
HITEK
-
DG
AL
W
YIE
P
IPNTRAE
E
MNYK
ETWL
SDRMRTA
G
I
G
ND
G
R
E
A
Y
LIIEMIP
E
GTQKI
I
YE
A
M
P
RNE
IG
MPA
GW
AR
L
RGR
G
VAI
TT
SAHLLNK
LA
EYL
Q
RH
G
D
RTV
W
EVTS
T
A
GWH
C
GAY
V
MPDG
EV
IG
V
PDR
P
VA
F
C
G
G
SAAI
K
GY
I
V
R
GT
V
H
E
WR
N
N
VA
S
L
MR
GN
H
SMMLGV
L
V
G
LAAPL
NS
LVG
GSC
FG
I
HLF
A
QSSAGKTTT
VEA
A
T
SL
Y
G
D
P
E
M
LK
L
S
W
DA
T
RH
G
LTV
EA
A
A
R
NDG
FI
P
I
DEIGQ
G
G
RVNDIAQ
SAY
S
LFNG
V
G
RI
QG
R
KDGGNR
AVMR
W
KIA
A
L
STGE
E
D
F
ETFL
L
K
G
GI
AP
KAGQLVRLL
SI
P
FTD
T
T
V
F
NG
Y
D
D
G
D
Q
HA
R
A
I
K
RLSS
N
YC
GAAGREWV
R
WL
S
A
H
KEL
A
IN
T
TS
D
KENA
W
LGNL
PE
NASS
QV
R
RV
AS
RFA
M
L
D
AA
GE
L
A
TA
I
TGW
T
AE
ECR
E
A
T
Q
RA
F
DD
W
LQD
FG
LE
NRE
KY
Q
V
ISR
A
RD
F
I
QR
H
A
L
SR
F
Q
P
Y
TYGK
S
N
G
D
MD
N
HYASR
I
S
N
LAGY
L
V
S
GK
R
-
ED
GKPE
Y
H
II
P
S
VF
D
SEI
L
C
G
IS
RNFG
CQ
AL
E
E
AGML
V
--
CAEP
----
G
R
WTS
K
T
V
-
KI
N
G
T
Q
Q
R
F
I
VL
I
D
Q
AE
DE
fig|585055.6.peg.278
Escherichia coli 55989
MK
N
APNLK
Y
QPK
---
DK
F
TEVIIFAG
T
DA
Y
AHA
Q
H
W
I
E
SE
G
-RKH
GD
NV
PPV
Y
LG
PT
QL
AD
L
ANI
R
I
I
D
D
E
R
RF
A
RV
Y
I
AG
E
I
EPIQIN
AI
AE
KLA
L
AGVQDA
KL
Y
KG
I
T
D
R
--
E
P
---
ENWR
DY
L
Q
R
I
R
E
QA
ERG
ET
IP
EGI
G
I
A
GGS
P
G
K--
T
D
PM
KP
HITEK
-
DG
AL
W
YIE
P
IPNTRAE
E
MNYK
ETWL
SDRMRTA
G
I
G
ND
G
R
E
A
Y
LIIEMIP
E
GTQKI
I
YE
A
M
P
RNE
IG
MPA
GW
AR
L
RGR
G
VAI
TT
SAHLLNK
LA
EYL
Q
RH
G
D
RTV
W
EVTS
T
A
GWH
C
GAY
V
MPDG
EV
IG
V
PDR
P
VA
F
C
G
G
SAAI
K
GY
I
V
R
GT
V
H
E
WR
N
N
VA
S
L
MR
GN
H
SMMLGV
L
V
G
LAAPL
NS
LVG
GSC
FG
I
HLF
A
QSSAGKTTT
VEA
A
T
SL
Y
G
D
P
E
M
LK
L
S
W
DA
T
RH
G
LTV
EA
A
A
R
NDG
FI
P
I
DEIGQ
G
G
RVNDIAQ
SAY
S
LFNG
V
G
RI
QG
R
KDGGNR
AVMR
W
KIA
A
L
STGE
E
D
F
ETFL
L
K
G
GI
AP
KAGQLVRLL
SI
P
FTD
T
T
V
F
NG
Y
D
D
G
D
Q
HA
R
A
I
K
RLSS
N
YC
GAAGREWV
R
WL
S
A
H
KEL
A
IN
T
TS
D
KENA
W
LGNL
PE
NASS
QV
R
RV
AS
RFA
M
L
D
AA
GE
L
A
TA
I
TGW
T
AE
ECR
E
A
T
Q
RA
F
DD
W
LQD
FG
LE
NRE
KY
Q
V
ISR
A
RD
F
I
QR
H
A
L
SR
F
Q
P
Y
TYGK
S
N
G
D
MD
N
HYASR
I
S
N
LAGY
L
V
S
GK
R
-
ED
GKPE
Y
H
II
P
S
VF
D
SEI
L
C
G
IS
RNFG
CQ
AL
E
E
AGML
V
--
CAEP
----
G
R
WTS
K
T
V
-
KI
N
G
T
Q
Q
R
F
I
VL
I
D
Q
AE
DE
fig|585055.8.peg.279
Escherichia coli 55989
MK
N
APNLK
Y
QPK
---
DK
F
TEVIIFAG
T
DA
Y
AHA
Q
H
W
I
E
SE
G
-RKH
GD
NV
PPV
Y
LG
PT
QL
AD
L
ANI
R
I
I
D
D
E
R
RF
A
RV
Y
I
AG
E
I
EPIQIN
AI
AE
KLA
L
AGVQDA
KL
Y
KG
I
T
D
R
--
E
P
---
ENWR
DY
L
Q
R
I
R
E
QA
ERG
ET
IP
EGI
G
I
A
GGS
P
G
K--
T
D
PM
KP
HITEK
-
DG
AL
W
YIE
P
IPNTRAE
E
MNYK
ETWL
SDRMRTA
G
I
G
ND
G
R
E
A
Y
LIIEMIP
E
GTQKI
I
YE
A
M
P
RNE
IG
MPA
GW
AR
L
RGR
G
VAI
TT
SAHLLNK
LA
EYL
Q
RH
G
D
RTV
W
EVTS
T
A
GWH
C
GAY
V
MPDG
EV
IG
V
PDR
P
VA
F
C
G
G
SAAI
K
GY
I
V
R
GT
V
H
E
WR
N
N
VA
S
L
MR
GN
H
SMMLGV
L
V
G
LAAPL
NS
LVG
GSC
FG
I
HLF
A
QSSAGKTTT
VEA
A
T
SL
Y
G
D
P
E
M
LK
L
S
W
DA
T
RH
G
LTV
EA
A
A
R
NDG
FI
P
I
DEIGQ
G
G
RVNDIAQ
SAY
S
LFNG
V
G
RI
QG
R
KDGGNR
AVMR
W
KIA
A
L
STGE
E
D
F
ETFL
L
K
G
GI
AP
KAGQLVRLL
SI
P
FTD
T
T
V
F
NG
Y
D
D
G
D
Q
HA
R
A
I
K
RLSS
N
YC
GAAGREWV
R
WL
S
A
H
KEL
A
IN
T
TS
D
KENA
W
LGNL
PE
NASS
QV
R
RV
AS
RFA
M
L
D
AA
GE
L
A
TA
I
TGW
T
AE
ECR
E
A
T
Q
RA
F
DD
W
LQD
FG
LE
NRE
KY
Q
V
ISR
A
RD
F
I
QR
H
A
L
SR
F
Q
P
Y
TYGK
S
N
G
D
MD
N
HYASR
I
S
N
LAGY
L
V
S
GK
R
-
ED
GKPE
Y
H
II
P
S
VF
D
SEI
L
C
G
IS
RNFG
CQ
AL
E
E
AGML
V
--
CAEP
----
G
R
WTS
K
T
V
-
KI
N
G
T
Q
Q
R
F
I
VL
I
D
Q
AE
DE
fig|340186.3.peg.5065
Escherichia coli E110019 (1-706/707)
MK
K
APNLK
H
QP
R
---
DK
M
TEVIIFAG
S
DAWAHAK
Q
W
Q
E
QD
G
-RLA
GD
SV
PPV
W
LG
EQ
QL
AE
L
DKL
QIV
P
EGR
KS
VR
I
F
RAG
H
L
EPVMIK
AI
GQ
KLA
A
AGVQDA
--
-
-N
F
Y
P
D
GM
H
G
QEV
ENWR
EY
LAR
E
RQ
NL
SD
G
--
LV
IEF
P
V
K
KKD
TG
SHS
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
T
ANHEVITMAIPCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSGSHEEWQLSTTTGWHFGAYIMPDGSIIG
E
SEKPILFTGKSAAINGYSVAGTA
D
GWRDSVARLAGGN
A
SMMLGVA
T
SLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
S
EGIKVKAGQLVRLLNVPMEKAT
K
FHEYS
N
GK
E
HADALKDAWT
A
NHGAAGREWVKWLA
G
HQQEAKDTVR
E
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GH
VTGW
V
VQECRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
SS
FG
F
SRY
L
P
Y
----
P
N
T
D
ER
D
---
LPIKELAGYR
K
G
SI
RNED
DEMR
YY
TF
P
H
VFESEIA
K
GFN
PAHF
ARAL
D
AAGML
E
--
KGSD
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PD
DE
fig|340186.5.peg.5333
Escherichia coli E110019 (1-706/707)
MK
K
APNLK
H
QP
R
---
DK
M
TEVIIFAG
S
DAWAHAK
Q
W
Q
E
QD
G
-RLA
GD
SV
PPV
W
LG
EQ
QL
AE
L
DKL
QIV
P
EGR
KS
VR
I
F
RAG
H
L
EPVMIK
AI
GQ
KLA
A
AGVQDA
--
-
-N
F
Y
P
D
GM
H
G
QEV
ENWR
EY
LAR
E
RQ
NL
SD
G
--
LV
IEF
P
V
K
KKD
TG
SHS
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
T
ANHEVITMAIPCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSGSHEEWQLSTTTGWHFGAYIMPDGSIIG
E
SEKPILFTGKSAAINGYSVAGTA
D
GWRDSVARLAGGN
A
SMMLGVA
T
SLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
S
EGIKVKAGQLVRLLNVPMEKAT
K
FHEYS
N
GK
E
HADALKDAWT
A
NHGAAGREWVKWLA
G
HQQEAKDTVR
E
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GH
VTGW
V
VQECRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
SS
FG
F
SRY
L
P
Y
----
P
N
T
D
ER
D
---
LPIKELAGYR
K
G
SI
RNED
DEMR
YY
TF
P
H
VFESEIA
K
GFN
PAHF
ARAL
D
AAGML
E
--
KGSD
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PD
DE
fig|585397.7.peg.2650
Escherichia coli ED1a (1-706/713)
MK
K
APNLK
H
QP
R
---
DK
M
TEVIIFAG
S
DAWAHAK
Q
W
Q
E
QD
G
-RLA
GD
NV
PPV
W
LG
EQ
QL
AE
L
DKL
QIV
P
EGR
KS
VR
I
F
RAG
H
L
EPVMIK
AI
GQ
KLA
A
AGVQDA
--
-
-N
F
Y
P
D
GM
H
G
QEV
ENWR
EY
LAR
E
RQ
NL
SD
G
--
LV
IEF
P
V
K
KKD
TG
SHS
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
L
ANHEVITMAIPCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSGSHEEWQLSTTTGWHF
D
AYIMPDGSIIG
D
SEKPILFTGKSAAINGYSVAGTA
E
GWRDSVARLAGGN
P
SMMLG
I
A
T
SLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
S
QRLTWYGTALGIANEAE
S
HNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
T
EGIKVKAGQLVRLLNVPMEKAT
H
FHEYS
T
GK
A
HADALKDAWT
E
NHGAAGREWVKWLA
G
HQQEAKDTVR
E
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GH
VTGW
A
A
QECRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
SS
FG
F
SRY
L
P
Y
----
P
N
S
D
ER
D
---
LPIK
D
LAGYR
K
G
SI
RNED
DEFR
F
Y
TF
P
H
VFE
G
EIA
Q
GFN
PSHF
ARAL
S
AAGML
E
--
AGND
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PE
A
E
fig|585397.9.peg.2647
Escherichia coli ED1a (1-706/713)
MK
K
APNLK
H
QP
R
---
DK
M
TEVIIFAG
S
DAWAHAK
Q
W
Q
E
QD
G
-RLA
GD
NV
PPV
W
LG
EQ
QL
AE
L
DKL
QIV
P
EGR
KS
VR
I
F
RAG
H
L
EPVMIK
AI
GQ
KLA
A
AGVQDA
--
-
-N
F
Y
P
D
GM
H
G
QEV
ENWR
EY
LAR
E
RQ
NL
SD
G
--
LV
IEF
P
V
K
KKD
TG
SHS
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
L
ANHEVITMAIPCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSGSHEEWQLSTTTGWHF
D
AYIMPDGSIIG
D
SEKPILFTGKSAAINGYSVAGTA
E
GWRDSVARLAGGN
P
SMMLG
I
A
T
SLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
S
QRLTWYGTALGIANEAE
S
HNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
T
EGIKVKAGQLVRLLNVPMEKAT
H
FHEYS
T
GK
A
HADALKDAWT
E
NHGAAGREWVKWLA
G
HQQEAKDTVR
E
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GH
VTGW
A
A
QECRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
SS
FG
F
SRY
L
P
Y
----
P
N
S
D
ER
D
---
LPIK
D
LAGYR
K
G
SI
RNED
DEFR
F
Y
TF
P
H
VFE
G
EIA
Q
GFN
PSHF
ARAL
S
AAGML
E
--
AGND
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PE
A
E
fig|562.371.peg.478
Escherichia coli 1044A (1-709/710)
MK
L
APN
V
K
K
QP
RGIKH
K
D
TEVIIFAG
S
DAWAHAK
Q
W
Q
E
QD
G
-PAS
GD
NV
PPV
W
LG
EQ
QL
SE
L
DKL
QIV
P
EGR
KS
VR
I
F
RAG
H
L
APVMIK
AI
GQ
KLA
A
AGVQDA
--
-
-N
F
Y
P
E
GM
H
G
QKV
ENWR
EY
LAR
E
RQ
NL
SD
G
--
LV
IEF
P
V
K
KKD
TG
SHS
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
T
ANHEVITMAIPCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSGSHEEWQLSTTTGWHFGAYIMPDGSIIG
E
SEKPILFTGKSAAINGYSVAGTA
D
GWRDSVARLAGGN
A
SM
I
LGVA
T
SLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
S
EGIKVKAGQLVRLLNVPMEKAT
K
FHEYS
N
GK
E
HADALKDAWT
A
NHGAAGREWVKWLA
G
HQQEAKDTVR
E
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GH
VTGW
V
VQECRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
SS
FG
F
SRY
L
P
H
----
P
N
T
D
ER
D
---
LPIKELAGYR
K
G
SI
RNED
DEMR
YY
TF
P
H
VFESEIA
K
GFN
PAHF
ARAL
D
AAGML
E
--
KGSD
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PD
DE
fig|562.374.peg.998
Escherichia coli 536A (1-709/710)
MK
L
APN
V
K
K
QP
RGIKH
K
D
TEVIIFAG
S
DAWAHAK
Q
W
Q
E
QD
G
-PAS
GD
NV
PPV
W
LG
EQ
QL
SE
L
DKL
QIV
P
EGR
KS
VR
I
F
RAG
H
L
APVMIK
AI
GQ
KLA
A
AGVQDA
--
-
-N
F
Y
P
E
GM
H
G
QKV
ENWR
EY
LAR
E
RQ
NL
SD
G
--
LV
IEF
P
V
K
KKD
TG
SHS
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
T
ANHEVITMAIPCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSGSHEEWQLSTTTGWHFGAYIMPDGSIIG
E
SEKPILFTGKSAAINGYSVAGTA
D
GWRDSVARLAGGN
A
SM
I
LGVA
T
SLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
S
EGIKVKAGQLVRLLNVPMEKAT
K
FHEYS
N
GK
E
HADALKDAWT
A
NHGAAGREWVKWLA
G
HQQEAKDTVR
E
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GH
VTGW
V
VQECRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
SS
FG
F
SRY
L
P
H
----
P
N
T
D
ER
D
---
LPIKELAGYR
K
G
SI
RNED
DEMR
YY
TF
P
H
VFESEIA
K
GFN
PAHF
ARAL
D
AAGML
E
--
KGSD
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PD
DE
fig|478006.5.peg.2968
Escherichia coli O157:H7 str. EC4501 (1-709/710)
MK
L
APN
V
K
K
QP
RGIKH
K
D
TEVIIFAG
S
DAWAHAK
Q
W
Q
E
QD
G
-PAS
GD
NV
PPV
W
LG
EQ
QL
SE
L
DKL
QIV
P
EGR
KS
VR
I
F
RAG
H
L
APVMIK
AI
GQ
KLA
A
AGVQDA
--
-
-N
F
Y
P
E
GM
H
G
QKV
ENWR
EY
LAR
E
RQ
NL
SD
G
--
LV
IEF
P
V
K
KKD
TG
SHS
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
T
ANHEVITMAIPCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSGSHEEWQLSTTTGWHFGAYIMPDGSIIG
E
SEKPILFTGKSAAINGYSVAGTA
D
GWRDSVARLAGGN
A
SM
I
LGVA
T
SLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
S
EGIKVKAGQLVRLLNVPMEKAT
K
FHEYS
N
GK
E
HADALKDAWT
A
NHGAAGREWVKWLA
G
HQQEAKDTVR
E
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GH
VTGW
V
VQECRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
SS
FG
F
SRY
L
P
H
----
P
N
T
D
ER
D
---
LPIKELAGYR
K
G
SI
RNED
DEMR
YY
TF
P
H
VFESEIA
K
GFN
PAHF
ARAL
D
AAGML
E
--
KGSD
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PD
DE
fig|502346.5.peg.2707
Escherichia coli O157:H7 str. TW14588 (1-709/710)
MK
L
APN
V
K
K
QP
RGIKH
K
D
TEVIIFAG
S
DAWAHAK
Q
W
Q
E
QD
G
-PAS
GD
NV
PPV
W
LG
EQ
QL
SE
L
DKL
QIV
P
EGR
KS
VR
I
F
RAG
H
L
APVMIK
AI
GQ
KLA
A
AGVQDA
--
-
-N
F
Y
P
E
GM
H
G
QKV
ENWR
EY
LAR
E
RQ
NL
SD
G
--
LV
IEF
P
V
K
KKD
TG
SHS
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
T
ANHEVITMAIPCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSGSHEEWQLSTTTGWHFGAYIMPDGSIIG
E
SEKPILFTGKSAAINGYSVAGTA
D
GWRDSVARLAGGN
A
SM
I
LGVA
T
SLAAPLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
S
EGIKVKAGQLVRLLNVPMEKAT
K
FHEYS
N
GK
E
HADALKDAWT
A
NHGAAGREWVKWLA
G
HQQEAKDTVR
E
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GH
VTGW
V
VQECRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
SS
FG
F
SRY
L
P
H
----
P
N
T
D
ER
D
---
LPIKELAGYR
K
G
SI
RNED
DEMR
YY
TF
P
H
VFESEIA
K
GFN
PAHF
ARAL
D
AAGML
E
--
KGSD
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PD
DE
fig|749548.3.peg.4095
Escherichia coli MS 196-1 (1-707/708)
MK
R
APNLK
H
QPK
---
DK
M
TEVIIFAG
S
DAWAHAK
E
W
S
E
WA
G
KHIA
A
D
DT
H
PV
I
LG
PE
QL
AS
L
ADT
QI
I
D
K
GR
YY
VRV
Y
RAG
E
I
SEQHLT
Q
I
AT
L
LA
V
AGV
KE
A
RC
Y
RS
F
V
D
Q
--
Q
P
---
E
D
W
T
PR
M
T
G
L
K
D
DA
ERG
NS
LV
INL
P
A
K
TNF
TG
NY-
DDELKPRVESR
V
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
P
ANHEVITMA
V
PCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSG
N
HEEWQLSTTTGWHFGAYIMPDGSIIG
E
SEKPILFTGK
T
AA
V
NGYSVAGTA
E
GWRD
T
VARLAGGN
P
SMMLGVAVSL
S
APLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
T
EGIK
I
KAGQLVRLLNVPMEKA
S
Q
FHEYS
T
GK
A
HADALK
N
AWT
E
NHGAAGREWVKWLA
G
HQQEAKD
A
V
K
A
CRERWRNLIPESYGEQVHRVGERFA
V
LEAALVLS
GH
VTGW
D
VQ
A
CRDA
V
QHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
SS
FG
F
SRY
L
P
Y
----
P
N
S
D
ER
D
---
LPIK
D
LAGYR
K
G
SI
RNED
EEFR
F
Y
TF
P
H
VFE
G
EIA
Q
GFN
PSHF
ARAL
S
AAGML
E
--
AGGD
----
RR
YKK
K
A
L
G
R
IGG
K
Q
H
I
F
Y
VLM
F
Q
PE
A
E
fig|216592.1.peg.2991
Escherichia coli 042 (1-709/712)
MK
R
APNLK
H
QPK
---
DK
M
TEVIIFAG
S
DAWAHAK
E
W
S
E
WA
G
KHIA
A
D
DT
H
PV
I
LG
PE
QL
AS
L
ADT
QI
I
D
K
GR
YY
VRV
Y
RAG
E
I
SEQHLT
Q
I
AT
L
LA
V
AGV
KE
A
RC
Y
RS
F
V
D
Q
--
Q
P
---
E
D
W
T
PR
M
A
G
L
K
D
DA
ERG
NS
LV
INL
P
A
K
TNF
TG
NY-
DDELKPRVESR
V
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
P
ANHEVITMA
V
PCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSG
N
HEEWQLSTTTGWHFGAYIMPDGSIIG
E
SEKPILFTGK
T
AA
V
NGYSVAGTA
E
GWRD
T
VARLAGGN
P
SMMLGVAVSL
S
APLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
S
EGIKVKAGQLVRLLNVPMEK
S
T
Q
FHEYS
T
GK
A
HADALKDAWT
A
NHGAAGREW
I
KWLA
A
HQQEAKDTVR
A
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GN
VTGW
D
VQ
A
CRDAIQHNFNAWVKEFGTGN
K
E
HR
Q
I
I
EQAEAFL
TA
Y
G
M
SR
F
A
P
V
----
-
N
Y
D
PA
S
---
LPI
S
EL
Y
GYR
E
S
EG
R
-
YG
EPVL
F
Y
VL
P
E
P
F
K
S
HV
A
K
GFN
KDAV
ARAL
H
E
AGML
K
KP
ASGE
GWQI
R
T
PRL
K
H
M
K
--
G
A
R
L
R
V
Y
G
L
L
L
A
Q
DH
D
D
fig|216592.3.peg.2537
Escherichia coli 042 (1-709/712)
MK
R
APNLK
H
QPK
---
DK
M
TEVIIFAG
S
DAWAHAK
E
W
S
E
WA
G
KHIA
A
D
DT
H
PV
I
LG
PE
QL
AS
L
ADT
QI
I
D
K
GR
YY
VRV
Y
RAG
E
I
SEQHLT
Q
I
AT
L
LA
V
AGV
KE
A
RC
Y
RS
F
V
D
Q
--
Q
P
---
E
D
W
T
PR
M
A
G
L
K
D
DA
ERG
NS
LV
INL
P
A
K
TNF
TG
NY-
DDELKPRVESR
V
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
P
ANHEVITMA
V
PCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSG
N
HEEWQLSTTTGWHFGAYIMPDGSIIG
E
SEKPILFTGK
T
AA
V
NGYSVAGTA
E
GWRD
T
VARLAGGN
P
SMMLGVAVSL
S
APLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
S
EGIKVKAGQLVRLLNVPMEK
S
T
Q
FHEYS
T
GK
A
HADALKDAWT
A
NHGAAGREW
I
KWLA
A
HQQEAKDTVR
A
CRERWRNLIPESYGEQVHRVGERFAILEAALVLS
GN
VTGW
D
VQ
A
CRDAIQHNFNAWVKEFGTGN
K
E
HR
Q
I
I
EQAEAFL
TA
Y
G
M
SR
F
A
P
V
----
-
N
Y
D
PA
S
---
LPI
S
EL
Y
GYR
E
S
EG
R
-
YG
EPVL
F
Y
VL
P
E
P
F
K
S
HV
A
K
GFN
KDAV
ARAL
H
E
AGML
K
KP
ASGE
GWQI
R
T
PRL
K
H
M
K
--
G
A
R
L
R
V
Y
G
L
L
L
A
Q
DH
D
D
fig|216592.1.peg.5592
Escherichia coli 042 (1-705/706)
MK
L
APN
V
K
L
L
PK
---
DK
G
EDAV
IFAG
D
DA
YS
HA
E
H
Y
M
Q
GG
E
ARKR
GD
KI
PPV
Y
LG
RR
D
L
GN
L
ENL
R
IVD
D
GR
LR
AM
V
R
RAG
K
L
DDRQAL
Q
I
ET
L
LA
V
AGV
KE
A
--
-
-S
F
C
D
E
--
N
G
ELL
E
D
W
T
PQ
LAR
L
K
D
EY
ERG
ES
LV
--L
P
L
K
KKI
T
E
SQG
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
P
ANHEVITMA
V
PCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSG
N
HEEWQLSTTTGWHFGAYIMPDGS
V
IG
E
SEKPILFTGK
T
AA
V
NGYSVAGTA
E
GWRD
T
VARLAGGN
P
SMMLGVAVSL
S
APLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
T
EGIKVKAGQLVRLLNVPMEKAT
Q
FHEYS
T
GK
A
HADALKDAWT
E
NHGAAGREWVKWLA
D
HQQEAKDTVR
A
CRERWRNLIPESYGEQVHRVGERFA
M
LE
S
ALVLS
VH
I
TGW
D
VQ
A
CRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
AS
FG
F
SRY
L
P
W
----
P
N
T
D
ER
D
---
LPIKELAGYR
K
G
SI
RNED
DEFR
F
Y
TF
P
H
VFE
G
EIA
Q
GFN
PSHF
ARAL
S
AAGML
E
--
AGND
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PE
A
E
fig|216592.3.peg.2256
Escherichia coli 042 (1-705/706)
MK
L
APN
V
K
L
L
PK
---
DK
G
EDAV
IFAG
D
DA
YS
HA
E
H
Y
M
Q
GG
E
ARKR
GD
KI
PPV
Y
LG
RR
D
L
GN
L
ENL
R
IVD
D
GR
LR
AM
V
R
RAG
K
L
DDRQAL
Q
I
ET
L
LA
V
AGV
KE
A
--
-
-S
F
C
D
E
--
N
G
ELL
E
D
W
T
PQ
LAR
L
K
D
EY
ERG
ES
LV
--L
P
L
K
KKI
T
E
SQG
DDELKPRVESR
A
DGVFW
-
VTPKVDKQSGEIIRPETWLCSPLELLGTGTIGKEHYRVMRWKK
P
ANHEVITMA
V
PCGGIGDRDGWRLLKDHGLNVTTNGKYRAILADWMQLSG
N
HEEWQLSTTTGWHFGAYIMPDGS
V
IG
E
SEKPILFTGK
T
AA
V
NGYSVAGTA
E
GWRD
T
VARLAGGN
P
SMMLGVAVSL
S
APLIGLVGADGFGVHLFEQSSAGKTTTQNIASSLWGEPD
A
QRLTWYGTALGIANEAEAHNDGLLPLDEIGQAGNAREVSTSAYTLFNGSGKLQGAKDGGNREIKHWRTVAISTGEMDVETFLK
T
EGIKVKAGQLVRLLNVPMEKAT
Q
FHEYS
T
GK
A
HADALKDAWT
E
NHGAAGREWVKWLA
D
HQQEAKDTVR
A
CRERWRNLIPESYGEQVHRVGERFA
M
LE
S
ALVLS
VH
I
TGW
D
VQ
A
CRDAIQHNFNAWVKEFGTGNRE
FK
QMVEQAEAFL
AS
FG
F
SRY
L
P
W
----
P
N
T
D
ER
D
---
LPIKELAGYR
K
G
SI
RNED
DEFR
F
Y
TF
P
H
VFE
G
EIA
Q
GFN
PSHF
ARAL
S
AAGML
E
--
AGND
----
RR
YKK
K
A
L
G
KIGG
K
Q
H
VF
Y
VLM
F
Q
PE
A
E
Consen1
Primary consensus
MK
APNlK
qPk
---
dK
teviIFAG
DAwaHAk
w
e
g
gD
pPV
LG
qL
L
qIvdegR
vrv
rAG
l
aI
kLA
AGVqdA
-
f
d
--
g
EnWr
lar
rq
erG
lv
p
k
tg
dDelKPrvesr
DGvfW
-
vtPkvdkqsgEiirpETWLcsplellGtGtiGkEhYrvmrwkk
anhevItmAiPcggIGdrdGWrlLkdhGlnvTTngkyraiLAdwmQlsGsheeWqlstTtGWHfgAYiMPDGsiIG
sekPilFtGksAAinGYsVaGTa
gWRdsVArLagGN
SMmLGvavsLaAPLigLVGadgFGvHLFeQSSAGKTTTqniAsSLwGePd
qrLtWygTalGianEAeahNDGllPlDEIGQaGnarevstSAYtLFNGsGklQGaKDGGNReikhWrtvAiSTGEmDvETFLk
eGIkvKAGQLVRLLnvPmekaT
FheYs
Gk
HAdAlKdawt
nhGAAGREWvkWLa
HqqeAkdTvr
crerWrnliPEsygeQVhRVgeRFAiLeaAlvLs
vTGW
vqeCRdAiQhnFnaWvkeFGtgNrE
QmveqAeaFl
fg
SRy
P
----
N
D
d
---
lpIkeLaGYr
g
Rned
yy
P
vFeseia
Gfn
arAL
aAGML
--
----
rr
K
l
kigg
q
vf
vLm
Q
de
Consen2
Secondary consensus
v
l
rgikh
edav
ys
y
q
e
a
h
d
r
ip
e
ami
i
i
q
l
ke
y
i
p
gm
p
d
t
mqg
k
sd
ip
g
a
pe
t
pm
hitek
al
yie
ipntrae
mnyk
sdrmrta
i
nd
r
a
liiemip
gtqki
ye
rne
mpa
ar
rgr
vai
sahllnk
eyl
rh
rtv
evts
a
cd
v
ev
pdr
va
c
gt
vk
i
r
v
e
n
s
mr
i
iltg
s
ns
gsc
i
a
vea
t
y
d
e
lk
s
da
rh
ltv
asr
fi
i
g
rvndiaq
s
v
ri
r
avmr
kia
l
e
f
l
g
ap
si
ftd
ng
d
d
r
i
rlss
yc
ir
s
kel
in
ts
kena
lgnl
nass
r
as
m
ds
ge
a
i
aea
e
t
ra
dd
lqd
le
k
isr
rd
i
a
f
tygk
hyasr
s
y
l
s
-
yg
fh
p
ghvl
is
cq
e
kp
gwqigt
--
na
l
ry
l
ad
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character