fig|749531.3.peg.771
Escherichia coli MS 69-1 (13-883/883)
MHQVL
I
L
P
RFVRLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
LAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SATS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|1040638.4.peg.4256
Escherichia coli O104:H4 str. LB226692
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSEL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|6666666.5357.peg.1600
Escherichia coli TY-2482
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSEL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|585055.6.peg.1665
Escherichia coli 55989
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSEL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
R
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|585055.8.peg.1669
Escherichia coli 55989
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSEL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
R
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|749531.3.peg.772
Escherichia coli MS 69-1
MHQVL
I
L
P
RFVRLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
LAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SATS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|340185.3.peg.2629
Escherichia coli E22 (13-883/883)
MHQVL
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|562.375.peg.3835
Escherichia coli EC4100B (13-883/883)
MHQVL
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|679204.3.peg.3869
Escherichia coli MS 145-7 (13-883/883)
MHQVL
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|340186.3.peg.136
Escherichia coli E110019 (13-883/883)
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|340185.4.peg.2771
Escherichia coli E22
MHQVL
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|585395.4.peg.1712
Escherichia coli O103:H2 str. 12009
MHQVL
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|340184.3.peg.52
Escherichia coli B7A (13-883/883)
MHQVL
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLTFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656408.3.peg.1665
Escherichia coli H591 (13-883/883)
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|679206.4.peg.3260
Escherichia coli MS 119-7 (13-883/883)
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|340184.6.peg.57
Escherichia coli B7A
MHQVL
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLTFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|340186.5.peg.150
Escherichia coli E110019
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|749532.3.peg.1015
Escherichia coli MS 78-1 (13-883/883)
MHQVL
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|550677.3.peg.2920
Escherichia coli B354
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKE
N
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
IY
QTT
VPPGPF
N
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SATS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|585396.4.peg.1973
Escherichia coli O111:H- str. 11128
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
IFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
ITVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|679205.4.peg.3340
Escherichia coli MS 124-1 (13-883/883)
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
TNG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|749533.3.peg.1872
Escherichia coli MS 84-1 (13-883/883)
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
TNG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656408.3.peg.1664
Escherichia coli H591
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|679206.4.peg.3259
Escherichia coli MS 119-7
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|409438.11.peg.1734
Escherichia coli SE11
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|573235.3.peg.2137
Escherichia coli O26:H11 str. 11368
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
ITVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
A
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|595495.4.peg.3471
Escherichia coli KO11 (13-883/883)
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELNE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGQLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|566546.3.peg.1193
Escherichia coli W (13-883/883)
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELNE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGQLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|749545.3.peg.5047
Escherichia coli MS 182-1
MHQVL
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|562.373.peg.5043
Escherichia coli 1125A (13-883/883)
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|562.372.peg.1181
Escherichia coli 1212A (13-883/883)
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|562.374.peg.2402
Escherichia coli 536A (13-883/883)
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|155864.1.peg.1958
Escherichia coli O157:H7 EDL933 (13-883/883)
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|701177.3.peg.1859
Escherichia coli O55:H7 str. CB9615
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNLPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656443.3.peg.1855
Escherichia coli TA271 (13-883/883)
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFA
Q
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|562.373.peg.5044
Escherichia coli 1125A
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|562.372.peg.1182
Escherichia coli 1212A
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|562.374.peg.2401
Escherichia coli 536A
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|155864.8.peg.1781
Escherichia coli O157:H7 EDL933
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|444454.5.peg.1019
Escherichia coli O157:H7 str. EC4024
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|444449.5.peg.344
Escherichia coli O157:H7 str. EC4042
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|444448.5.peg.4703
Escherichia coli O157:H7 str. EC4045
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|444453.5.peg.2899
Escherichia coli O157:H7 str. EC4076
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|444452.5.peg.1917
Escherichia coli O157:H7 str. EC4113
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|444450.8.peg.2162
Escherichia coli O157:H7 str. EC4115
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|444451.5.peg.1910
Escherichia coli O157:H7 str. EC4196
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|444447.5.peg.5612
Escherichia coli O157:H7 str. EC4206
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|478004.5.peg.2888
Escherichia coli O157:H7 str. EC4401
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|478005.5.peg.2935
Escherichia coli O157:H7 str. EC4486
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|478006.5.peg.1903
Escherichia coli O157:H7 str. EC4501
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|478007.5.peg.2107
Escherichia coli O157:H7 str. EC508
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|478008.5.peg.3626
Escherichia coli O157:H7 str. EC869
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|637388.3.peg.1613
Escherichia coli O157:H7 str. FRIK2000
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|570506.3.peg.2984
Escherichia coli O157:H7 str. FRIK966
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|544404.4.peg.2025
Escherichia coli O157:H7 str. TW14359
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|502346.5.peg.5256
Escherichia coli O157:H7 str. TW14588
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|679207.4.peg.4465
Escherichia coli MS 107-1
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPETLLNQQT
A
I
CR
fig|566546.3.peg.1194
Escherichia coli W
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELNE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGQLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|566546.4.peg.1625
Escherichia coli W
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELNE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGQLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|550672.3.peg.2371
Escherichia coli B088
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELNE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGQLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|562.371.peg.1697
Escherichia coli 1044A (13-883/883)
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
DQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|679205.4.peg.3339
Escherichia coli MS 124-1
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
TNG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|749533.3.peg.1873
Escherichia coli MS 84-1
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
TNG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|585034.4.peg.1498
Escherichia coli IAI1
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITAS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|585034.5.peg.1495
Escherichia coli IAI1
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KIAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITAS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|749549.3.peg.244
Escherichia coli MS 198-1 (13-883/883)
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFI
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GKNKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656437.3.peg.1728
Escherichia coli TA143 (13-883/883)
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFI
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GKNKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|562.371.peg.1698
Escherichia coli 1044A
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
DQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|386585.9.peg.2216
Escherichia coli O157:H7 str. Sakai
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
DQEQI
S
IS
Q
QLG-NYG
A
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
STLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656380.3.peg.2897
Escherichia coli FVEC1412
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFI
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GKNKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|749549.3.peg.245
Escherichia coli MS 198-1
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFI
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GKNKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656437.3.peg.1727
Escherichia coli TA143
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFI
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GKNKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|585056.7.peg.1947
Escherichia coli UMN026
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFI
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GKNKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656443.3.peg.1854
Escherichia coli TA271
MHQVL
I
MP
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-ELIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYYSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFA
Q
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656444.3.peg.2305
Escherichia coli TA280 (13-883/883)
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIAGDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
N
GG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNS
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656419.3.peg.2183
Escherichia coli M718 (13-883/883)
L
HQVL
L
L
P
RFVRLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|344601.3.peg.277
Escherichia coli B171
M
L
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTL
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|344601.5.peg.272
Escherichia coli B171
M
L
I
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
EL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSNSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RN
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTL
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656444.3.peg.2304
Escherichia coli TA280
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIAGDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
N
GG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNS
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|216592.1.peg.2051
Escherichia coli 042 (13-883/883)
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TEL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNST
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
N
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|656393.3.peg.2254
Escherichia coli H299 (13-883/883)
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVS
--------
DNSA
C
TPLRDRLADASSEF
N
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEKT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFSD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNI
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|216592.3.peg.1694
Escherichia coli 042
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TEL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNST
C
TPLQDRLADASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
N
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|655817.3.peg.1822
Escherichia coli ABU 83972 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NVQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|199310.1.peg.1873
Escherichia coli CFT073 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NVQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|199310.4.peg.1801
Escherichia coli CFT073 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NVQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|749528.3.peg.3903
Escherichia coli MS 45-1 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NVQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|525281.3.peg.1021
Escherichia coli 83972
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NVQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|655817.3.peg.1821
Escherichia coli ABU 83972
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NVQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVEYKL
P
EVSPGTLLNQQT
A
I
CR
fig|550676.3.peg.1843
Escherichia coli B185
MHQVL
L
L
P
RFARLTIALSLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFITDDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLVDASTEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
FK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPR
F
IQGSLM
HG
LEENW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
G
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGSM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
EVSPGTLLNQQT
A
I
CR
fig|340197.3.peg.2942
Escherichia coli F11 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
IV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
AVSPGTLLNQQT
A
I
CR
fig|340197.5.peg.3074
Escherichia coli F11 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
IV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
AVSPGTLLNQQT
A
I
CR
fig|749550.3.peg.183
Escherichia coli MS 200-1 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
IV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
AVSPGTLLNQQT
A
I
CR
fig|753642.3.peg.1648
Escherichia coli NC101 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAKI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
VVSPGTLLNQQT
A
I
CR
fig|216593.1.peg.312
Escherichia coli E2348/69 (13-883/883)
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHAP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QIT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
C
Y
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
N
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
L
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
NMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNI
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
AVSPGTLLNQQT
A
I
CR
fig|431946.3.peg.1466
Escherichia coli SE15 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-HYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPLGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
AVSPGTLLNQQT
A
I
CR
fig|753642.3.peg.1649
Escherichia coli NC101
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAKI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
VVSPGTLLNQQT
A
I
CR
fig|405955.9.peg.1320
Escherichia coli APEC O1 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFDG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
G
N
L
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVDYKL
P
VVSPGTLLNQQT
A
I
CR
fig|749550.3.peg.182
Escherichia coli MS 200-1
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
IV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
AVSPGTLLNQQT
A
I
CR
fig|574521.7.peg.1667
Escherichia coli O127:H6 str. E2348/69
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHAP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QIT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
C
Y
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
N
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
L
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
NMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNI
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
AVSPGTLLNQQT
A
I
CR
fig|439855.10.peg.1825
Escherichia coli SMS-3-5
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EFY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVS
--------
DNSA
C
TPLRDRLADASSEF
N
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRGNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RS
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DG
-
T
---
RHS
G
Q
S
I
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDSNEQT
-
QFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFSD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNI
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
AVSPGTLLNQQT
A
I
CR
fig|670897.3.peg.2390
Escherichia coli 2362-75
MHQVL
L
L
P
RFARLTIALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
TSR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHAP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
W
Q
LR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QIT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
C
Y
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
IQGSLM
HG
LEGNW
T
P
YGG
M
-
QIA
E
N
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
L
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNI
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
AVSPGTLLNQQT
A
I
CR
fig|405955.13.peg.1625
Escherichia coli APEC O1
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFDG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
G
N
L
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVDYKL
P
VVSPGTLLNQQT
A
I
CR
fig|714962.3.peg.1691
Escherichia coli IHE3034 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMAHG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
G
N
L
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVDYKL
P
VVSPGTLLNQQT
A
I
CR
fig|869729.3.peg.2018
Escherichia coli UM146 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SADKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMAHG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
G
N
L
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
FV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
KDKNSN
-----
C
IVDYKL
P
VVSPGTLLNQQT
A
I
CR
fig|562.376.peg.3005
Escherichia coli WV_060327 (13-883/883)
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LD
S
KEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASSEF
D
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNG-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSTPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFRNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTQGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
A
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
A
T
G
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDYKL
P
VVSPGTLLNQQT
A
I
CR
fig|685038.3.peg.1501
Escherichia coli O83:H1 str. NRG 857C
MHQVL
I
L
P
RFARLTFALGLATAVFPVD
A
EYY--
---
FN
PR
F
LS-N
--
DLAESV
D
-
LSA
F
TK
--
GREAP
PG
T
Y
R
V
D
I
Y
LN
DEFM
-
ASR-D
I
TFIADDNNA----
-
-DLIP
CL
S
--
TDL
L
VSL
G
I
K
KSAL
LDNKEH
SAEKHVP
--------
DNSA
C
TPLQDRLADASSEF
N
V
--
GQQH
L
SL
SVPQ
I
Y
V
GRMARG
Y
V
S
P
DL
W
E
E
GINA
GLLN
Y
SFNS-NSINNRSNHN
-
AGK
S
NYAYLNLQS
G
I
N
I
G
S
WRLR
D
N
STWSYNSGS
-
SN-
-
-
---------
SSDSNK
W
QHI
--------
NTS
A
E
R
DIIP
L
R
-
SR
L
T
V
GD
S
Y
TD
G
---
DIFDS
VN
F
R
G
LK
I
NS
T
EA
MLP
DSQH
GFAP
V
I
H
GIA
RG
T
A
Q
V
SV
K
Q
N
G
YD
V
Y
QTT
VPPGPF
T
I
D
D
I
NS
A
ANG
GDL
Q
V
T
I
K
E
A
DG
SIQTLY
VP
Y
SS
V
P
V
L
Q
R
A
G
YT
RY
AL
A
M
GE
YRSG-
--
NNLQSSPK
F
VQASLM
HG
LKGNW
T
P
YGG
M
-
QIA
E
D
Y
Q
A
FNL
G
I
G
K
D
LGLF
GA
F
SFD
I
T
Q
A
NTTLA
-----
DD
-
T
---
RHS
G
Q
S
V
K
SV
YSK
SFYQTG
T
NIQVA
GYRYS
TQG
F
Y
N
L
SD
SAYS
--
RMSGYTVKPPTGDTSEQT
-
LFIDYFNL
-
FYSK
-
R
GQEQI
S
IS
Q
QLG-NYG
T
TFF
S
ASRQS
YW
NTSRSDQ-QISF
G
LNVPFGD-ITTS
L
N
Y
S
YSNNIW
Q
N
-----------
-DR
D
HLLAFTL
N
V
P
FSHWMRT
--
DSQSAFHNSNAS
Y
SMSN
-
--DLKGGM
T
NLS
G
VYG
TL
LP
D
NN
-
L
N
Y
S
V
QV
G
NTHGGNTS-SGTS
GY
-
SSLN--
Y
RGAY
---
G
NTNV
G
Y
S
RS
--
-GDSS
Q
IYYGM
SGG
II
A
HAD
G
IT
F
G
-
QPLG--D
T
MV
LV
K
APG
AD
N
VK
I
E
-
NQTGIH
TD
WR
G
YA
I
LPFA
T
E
Y
RE
N
RVA
L
N
A
N
S
L
AD
-
N
V
ELDE
T
V
VTVIPTH
GAI
ARAT
F
NAQI
G
GKVLMTLKY-G
N
KSVP
FGA
I
V
T
---
H-
-
GENKNG
S
IV
A
E
N
G
QV
Y
------
L
T
G
L
PQ
--
SGKLQ
V
S
WG
NDKNSN
-----
C
IVDFKL
P
VVSPGTLLNQQT
A
I
CR
fig|550676.3.peg.4559
Escherichia coli B185 (5-856/856)
FVRLVVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|679204.3.peg.2710
Escherichia coli MS 145-7 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
VQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|679204.3.peg.2711
Escherichia coli MS 145-7 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
VQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|573235.3.peg.5702
Escherichia coli O26:H11 str. 11368 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|701177.3.peg.5123
Escherichia coli O55:H7 str. CB9615 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|413997.3.peg.4421
Escherichia coli B str. REL606 (5-856/856)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|511693.5.peg.4444
Escherichia coli BL21 (5-856/856)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|469008.4.peg.3849
Escherichia coli BL21(DE3) (12-863/863)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656379.3.peg.119
Escherichia coli FVEC1302 (5-856/856)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656380.3.peg.59
Escherichia coli FVEC1412 (27-878/878)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656419.3.peg.49
Escherichia coli M718 (27-878/878)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656419.3.peg.50
Escherichia coli M718 (5-856/856)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749540.3.peg.3242
Escherichia coli MS 146-1 (27-878/878)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749540.3.peg.3243
Escherichia coli MS 146-1 (5-856/856)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749549.3.peg.189
Escherichia coli MS 198-1 (27-878/878)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749549.3.peg.188
Escherichia coli MS 198-1 (12-863/863)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|585056.7.peg.5087
Escherichia coli UMN026 (12-863/863)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|679207.4.peg.3440
Escherichia coli MS 107-1 (27-878/878)
FVRLVVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSIIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|679207.4.peg.3439
Escherichia coli MS 107-1 (12-863/863)
FVRLVVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSIIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|340185.3.peg.2053
Escherichia coli E22 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTTMIQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|340185.4.peg.2177
Escherichia coli E22 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTTMIQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|585395.4.peg.5359
Escherichia coli O103:H2 str. 12009 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTTMIQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749545.3.peg.4255
Escherichia coli MS 182-1 (27-878/878)
FVRLVVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749545.3.peg.4254
Escherichia coli MS 182-1 (5-856/856)
FVRLVVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.371.peg.3831
Escherichia coli 1044A (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.371.peg.3832
Escherichia coli 1044A (5-856/856)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.373.peg.2372
Escherichia coli 1125A (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.373.peg.2371
Escherichia coli 1125A (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.372.peg.3225
Escherichia coli 1212A (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.374.peg.2997
Escherichia coli 536A (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.374.peg.2996
Escherichia coli 536A (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|550677.3.peg.285
Escherichia coli B354 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
V
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|83334.1.peg.5256
Escherichia coli O157:H7 (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|444454.5.peg.4364
Escherichia coli O157:H7 str. EC4024 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|444449.5.peg.3817
Escherichia coli O157:H7 str. EC4042 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|444448.5.peg.2574
Escherichia coli O157:H7 str. EC4045 (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|444452.5.peg.2532
Escherichia coli O157:H7 str. EC4113 (5-856/856)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|444450.8.peg.5654
Escherichia coli O157:H7 str. EC4115 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|444451.5.peg.1637
Escherichia coli O157:H7 str. EC4196 (5-856/856)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|444447.5.peg.2740
Escherichia coli O157:H7 str. EC4206 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|478004.5.peg.1673
Escherichia coli O157:H7 str. EC4401 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|478005.5.peg.787
Escherichia coli O157:H7 str. EC4486 (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|478006.5.peg.1105
Escherichia coli O157:H7 str. EC4501 (5-856/856)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|478007.5.peg.723
Escherichia coli O157:H7 str. EC508 (5-856/856)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|386585.9.peg.5512
Escherichia coli O157:H7 str. Sakai (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|544404.4.peg.5464
Escherichia coli O157:H7 str. TW14359 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|502346.5.peg.1358
Escherichia coli O157:H7 str. TW14588 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|595495.4.peg.2118
Escherichia coli KO11 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
V
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|595495.4.peg.2117
Escherichia coli KO11 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
V
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|566546.3.peg.4960
Escherichia coli W (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
V
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|566546.3.peg.4959
Escherichia coli W (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
V
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|566546.4.peg.4613
Escherichia coli W (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
V
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|585034.4.peg.4402
Escherichia coli IAI1 (12-863/863)
FVRLFVACAFAAQTPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSIIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|585034.5.peg.4397
Escherichia coli IAI1 (12-863/863)
FVRLFVACAFAAQTPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSIIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749548.3.peg.575
Escherichia coli MS 196-1 (27-878/878)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
V
PG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|585057.4.peg.4936
Escherichia coli IAI39 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|585057.6.peg.4945
Escherichia coli IAI39 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|478008.5.peg.334
Escherichia coli O157:H7 str. EC869 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLNN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|637388.3.peg.3744
Escherichia coli O157:H7 str. FRIK2000 (5-856/856)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLNN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|570506.3.peg.2814
Escherichia coli O157:H7 str. FRIK966 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLNN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|316401.4.peg.5266
Escherichia coli ETEC H10407 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASI
------
SGMNLLA
--------
DD-A
C
VPLTAMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
IYG
TL
LE
D
ND
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|316401.4.peg.5267
Escherichia coli ETEC H10407 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASI
------
SGMNLLA
--------
DD-A
C
VPLTAMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
IYG
TL
LE
D
ND
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|431946.3.peg.4438
Escherichia coli SE15 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASI
------
SGMNLLA
--------
DD-A
C
VPLTAMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
IYG
TL
LE
D
ND
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|155864.1.peg.5241
Escherichia coli O157:H7 EDL933 (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
X
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|155864.8.peg.5227
Escherichia coli O157:H7 EDL933 (12-863/863)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLARNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
X
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|409438.11.peg.4774
Escherichia coli SE11 (12-863/863)
FVRLVVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
G
H
RYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.375.peg.2765
Escherichia coli EC4100B (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNRSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.375.peg.2764
Escherichia coli EC4100B (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNRSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
IANYQL
P
PESQQQLLTQLS
A
E
CR
fig|595496.3.peg.4403
Escherichia coli BW2952 (12-863/863)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKTR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|536056.3.peg.3898
Escherichia coli DH1 (12-863/863)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKTR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|83333.1.peg.4227
Escherichia coli K12 (27-878/878)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKTR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|316407.3.peg.4145
Escherichia coli W3110 (27-878/878)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKTR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|511145.12.peg.4458
Escherichia coli str. K-12 substr. MG1655 (12-863/863)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKTR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|511145.6.peg.4437
Escherichia coli str. K-12 substr. MG1655 (12-863/863)
FVRLVVACAFAAQAPLSS
A
DLY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
AGMNLLA
--------
DD-A
C
VPLTTMVQDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKTR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749527.3.peg.2986
Escherichia coli MS 21-1 (27-878/878)
FVRLFVACAFSAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QYI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749527.3.peg.2985
Escherichia coli MS 21-1 (12-863/863)
FVRLFVACAFSAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QYI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|550672.3.peg.4626
Escherichia coli B088 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNRSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|679205.4.peg.3587
Escherichia coli MS 124-1 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNRSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749533.3.peg.3126
Escherichia coli MS 84-1 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNRSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749533.3.peg.3127
Escherichia coli MS 84-1 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNRSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749547.3.peg.1081
Escherichia coli MS 187-1 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASI
------
SGMNLLA
--------
DD-A
C
VPLTAMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TIWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTX
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
IYG
TL
LE
D
ND
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656408.3.peg.4795
Escherichia coli H591 (27-878/878)
FVRLFVACAFAAQTPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
N
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PERQQQLLTQLS
A
E
CR
fig|656408.3.peg.4796
Escherichia coli H591 (12-863/863)
FVRLFVACAFAAQTPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
N
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PERQQQLLTQLS
A
E
CR
fig|679206.4.peg.3535
Escherichia coli MS 119-7 (27-878/878)
FVRLFVACAFAAQTPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
N
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PERQQQLLTQLS
A
E
CR
fig|679206.4.peg.3536
Escherichia coli MS 119-7 (12-863/863)
FVRLFVACAFAAQTPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
N
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PERQQQLLTQLS
A
E
CR
fig|331112.3.peg.4253
Escherichia coli HS (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNRSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PERQQQLLTQLS
A
E
CR
fig|331112.6.peg.4426
Escherichia coli HS (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNRSD
-
RS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PERQQQLLTQLS
A
E
CR
fig|749537.3.peg.336
Escherichia coli MS 115-1 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GIN
S
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QYI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
A
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
N
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749537.3.peg.337
Escherichia coli MS 115-1 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GIN
S
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
RS-
-
-
---------
SGSKNK
W
QYI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
A
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
N
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|685038.3.peg.4444
Escherichia coli O83:H1 str. NRG 857C (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGRKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|405955.13.peg.4869
Escherichia coli APEC O1 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|405955.9.peg.4103
Escherichia coli APEC O1 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|714962.3.peg.4844
Escherichia coli IHE3034 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|869729.3.peg.4787
Escherichia coli UM146 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|869729.3.peg.4788
Escherichia coli UM146 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|364106.7.peg.4881
Escherichia coli UTI89 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|364106.8.peg.4882
Escherichia coli UTI89 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.376.peg.833
Escherichia coli WV_060327 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|562.376.peg.834
Escherichia coli WV_060327 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|199310.1.peg.5287
Escherichia coli CFT073 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|199310.4.peg.5054
Escherichia coli CFT073 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749546.3.peg.2188
Escherichia coli MS 185-1 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749546.3.peg.2189
Escherichia coli MS 185-1 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749528.3.peg.1496
Escherichia coli MS 45-1 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749528.3.peg.1497
Escherichia coli MS 45-1 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749531.3.peg.2134
Escherichia coli MS 69-1 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NAYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RVQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GSN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|216593.1.peg.3674
Escherichia coli E2348/69 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GSN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
A
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|574521.7.peg.4728
Escherichia coli O127:H6 str. E2348/69 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GSN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
A
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656417.3.peg.5516
Escherichia coli M605 (27-878/878)
FVRLFVACAFTAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LA
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656417.3.peg.5517
Escherichia coli M605 (5-856/856)
FVRLFVACAFTAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LA
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|753642.3.peg.3754
Escherichia coli NC101 (27-878/878)
FVRLFVACAFTAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGTL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|753642.3.peg.3753
Escherichia coli NC101 (5-856/856)
FVRLFVACAFTAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGTL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656440.3.peg.4941
Escherichia coli TA206 (5-856/856)
FVRLFVACAFTAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656437.3.peg.4826
Escherichia coli TA143 (27-878/878)
FVRLFVACAFTAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QMA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656437.3.peg.4827
Escherichia coli TA143 (5-856/856)
FVRLFVACAFTAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QMA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656393.3.peg.511
Escherichia coli H299 (27-878/878)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GSN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTNNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656393.3.peg.512
Escherichia coli H299 (5-856/856)
FVRLFVACAFAVQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GSN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTNNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|344601.5.peg.829
Escherichia coli B171 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTL
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|344601.3.peg.798
Escherichia coli B171 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTL
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|340184.3.peg.2246
Escherichia coli B7A (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
E
LN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|340184.6.peg.2357
Escherichia coli B7A (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
E
LN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656444.3.peg.201
Escherichia coli TA280 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
AGM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTTMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIVTQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTNNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656444.3.peg.202
Escherichia coli TA280 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
AGM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTTMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIVTQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RSS
T
LYL
S
GSHQT
YW
GTNNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|670888.3.peg.477
Escherichia coli 1827-70 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-SNWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
IL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|340186.3.peg.3345
Escherichia coli E110019 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-SNWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
IL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|340186.5.peg.3489
Escherichia coli E110019 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-SNWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
IL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|585035.6.peg.4836
Escherichia coli S88 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-SNWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
IL
A
HAN
G
VT
L
G
-
QPLN--E
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656443.3.peg.12
Escherichia coli TA271 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
E
LN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
NSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|656443.3.peg.13
Escherichia coli TA271 (12-863/863)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
E
LN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
NSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLV
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749550.3.peg.3881
Escherichia coli MS 200-1 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|362663.8.peg.4671
Escherichia coli 536 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|362663.9.peg.4687
Escherichia coli 536 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|340197.3.peg.754
Escherichia coli F11 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|340197.5.peg.782
Escherichia coli F11 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|749550.3.peg.3880
Escherichia coli MS 200-1 (27-878/878)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
TFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAQL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRLR
D
N
TSWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
RG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
VYG
TL
LE
D
NN
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
Q
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
A
E
CR
fig|670897.3.peg.1728
Escherichia coli 2362-75 (5-856/856)
FVRLFVACAFAAQAPLSS
A
ELY--
---
FN
PR
F
LA-D
--
DPQAVA
D
-
LSR
F
EN
--
GQELP
PG
T
Y
R
V
D
I
Y
LN
NGYM
-
ATR-D
V
SFNTGDSEQ----
-
-GIVP
CLT
--
RAQ
L
ASM
GLN
TASV
------
SGMNLLA
--------
DD-A
C
VPLTSMIHDATAHL
D
V
--
GQQR
L
NL
T
I
PQA
F
M
SNRARG
Y
I
PP
EL
WD
P
GINA
GLLN
Y
NFSG-NSVQNRI---
-
GGN
S
HYAYLNLQS
GLN
I
G
A
WRL
C
D
N
TTWSYNSSD
-
SS-
-
-
---------
SGSKNK
W
QHI
--------
NTW
L
E
R
DIIP
L
R
-
SR
L
T
LGD
G
Y
TQ
G
---
DIFD
G
IN
F
R
G
AQ
L
AS
D
DN
MLP
DSQR
GFAP
V
I
H
GIA
HG
T
A
Q
VTI
K
Q
N
G
YD
IY
NST
VPPGPF
T
I
N
D
I
YA
A
GNS
GDL
Q
V
T
I
K
E
A
DG
STQIFT
VP
Y
SS
V
P
L
L
Q
R
E
G
HT
RY
SI
T
A
GE
YRSG-
--
NAQQEKPR
F
FQSTLL
HG
LPAGW
T
I
YGG
T
-
QLA
D
R
Y
R
A
FNF
G
I
G
K
N
MGAL
GA
L
S
V
D
M
T
Q
A
NSTLP
-----
DD
-
S
---
QHD
G
Q
S
V
R
FL
Y
N
K
SLNESG
T
NIQLV
GYRYS
TSG
Y
F
N
F
A
D
TTYS
--
RMNGYNIETQDGVIQVKP
-
KFTDYYNL
-
AYNK
-
R
GKLQL
T
VT
Q
QLG-RTS
T
LYL
S
GSHQT
YW
GTSNVDE-QFQA
G
LNTAFED-INWT
LS
Y
S
LTKNAW
Q
K
-----------
-GR
D
QMLALNV
N
IP
FSHWLRS
--
DSKSQWRHASAS
Y
SMSH
-
--DLNGRM
T
NLA
G
IYG
TL
LE
D
ND
-
L
SY
S
V
QT
G
YAGGGDGN-SGST
GY
-
ATLN--
Y
RGGY
---
G
NANI
G
Y
S
HS
--
-DDIK
R
LYYGV
SGG
VL
A
HAN
G
VT
L
G
-
QPLN--D
T
VV
LV
K
APG
AK
D
AK
V
E
-
NQTGVR
TD
WR
G
YA
V
LPYA
T
E
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DLDN
AV
ANVVPTR
GAI
VRAE
F
KARV
G
IKLLMTLTH-N
N
KPLP
FGA
M
V
T
---
S-
-
ESSQSS
GIV
A
D
N
G
QV
Y
------
L
SG
M
PL
--
AGKVQ
V
K
WG
EEENAH
-----
C
VANYQL
P
PESQQQLLTQLS
V
E
CR
fig|749546.3.peg.3220
Escherichia coli MS 185-1 (27-876/876)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
I
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVP
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
V
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
TL
LE
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
V
S
CR
fig|362663.8.peg.292
Escherichia coli 536 (27-876/876)
R
FN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
I
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVP
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
V
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RYD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
TL
LE
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
V
S
CR
fig|362663.9.peg.291
Escherichia coli 536 (27-876/876)
R
FN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
I
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVP
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
V
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RYD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
TL
LE
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
V
S
CR
fig|749550.3.peg.425
Escherichia coli MS 200-1 (27-876/876)
R
FN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
I
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVP
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
V
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RYD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
TL
LE
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
V
S
CR
fig|656440.3.peg.3131
Escherichia coli TA206 (27-876/876)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
L
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVP
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
I
N
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
V
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
TL
LG
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
A
S
CR
fig|199310.4.peg.1177
Escherichia coli CFT073 (27-876/876)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
L
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVS
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
V
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
TL
LG
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
A
S
CR
fig|749528.3.peg.6
Escherichia coli MS 45-1 (27-876/876)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
L
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVS
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
V
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
TL
LG
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
A
S
CR
fig|199310.1.peg.1199
Escherichia coli CFT073 (43-892/892)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
L
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVS
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
V
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
TL
LG
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
A
S
CR
fig|753642.3.peg.553
Escherichia coli NC101 (27-876/876)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
L
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVS
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
V
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
TL
LE
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
V
S
CR
fig|869729.3.peg.2632
Escherichia coli UM146 (41-890/890)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
L
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVP
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
M
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
T
S
LG
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
V
S
CR
fig|364106.7.peg.1193
Escherichia coli UTI89 (43-892/892)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
L
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVP
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
M
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
T
S
LG
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
V
S
CR
fig|714962.3.peg.1120
Escherichia coli IHE3034 (27-876/876)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
L
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVP
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
M
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
T
S
LG
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
V
S
CR
fig|364106.8.peg.1191
Escherichia coli UTI89 (27-876/876)
KFN
I
L
P
L
A
FFIGIIVSPA--------R
A
ELY--
---
FN
PR
F
LS-D
--
DPDAVA
D
-
LSA
F
TQ
--
GQELP
PG
V
Y
R
V
D
I
Y
LN
DTYI
-
STR-D
V
QFQMSQDGK----
-
-QLAP
CL
S
--
PEH
M
SAM
G
V
N
RYAV
------
PGMERLP
--------
AD-T
C
TSLNSMIQGATFRF
D
V
--
GQQR
L
YL
T
VPQ
L
Y
M
SNQARG
Y
I
A
P
EY
WD
N
GI
T
A
ALLN
Y
DFSG-NRVRDSY---
-
GGT
S
DYAYLNLKT
GLN
I
G
S
WRLR
D
N
TSWSYSAG-
-
-K-
-
-
---------
GYSQNN
W
QHI
--------
NTW
L
E
R
DIVP
L
R
-
SR
L
T
M
GD
S
Y
TR
G
---
DIFD
G
VN
F
R
G
IQ
L
AS
D
DN
M
V
P
DSQR
G
Y
AP
T
I
H
GI
S
RG
T
S
R
IS
I
R
Q
N
G
YE
IY
QST
L
PPGPF
E
I
N
D
I
YP
A
GSG
GDL
Q
V
T
L
Q
E
A
DG
SVQRFN
VP
W
SS
V
P
V
L
Q
R
E
G
HL
K
Y
AL
S
A
GE
FRSG-
--
GHQQDNPR
F
AEGTLK
Y
G
LPAGW
T
V
YGG
A
-
WIA
E
R
Y
R
A
FNL
G
M
G
K
N
MGWL
GA
V
S
L
D
A
T
R
A
NARLP
-----
DE
-
S
---
RHD
G
Q
S
Y
R
FL
Y
N
K
SLTETG
T
NIQLI
GYRYS
TRG
Y
F
S
F
A
D
TAWK
--
KMSGYSVLTQDGVIQIQP
-
KYTDYYNL
-
AYNK
-
R
GRVQV
S
IS
Q
QTG-ESS
T
LYL
S
GSHQS
YW
GTDRTDR-QLNA
G
FNSSVND-ISWS
L
N
Y
S
LSRNAW
Q
H
-----------
-ET
D
RILSFDV
SIP
FSHWMRS
--
DSTSAWRNASAR
Y
SQTL
-
--EAHGQA
A
STA
G
LYG
T
S
LG
D
NN
-
L
G
Y
S
I
QS
G
YTRGGYEG-SSKT
GY
-
ASLN--
Y
RGGY
---
G
NASA
G
Y
S
HS
--
-GGYR
Q
LYYGL
SGG
IL
A
HAN
G
LT
L
S
-
QPLG--D
T
LI
LV
R
APG
AS
D
TR
I
E
-
NQTGVS
TD
WR
G
YA
V
LPYA
T
D
Y
RE
N
RVA
LD
T
N
T
L
AD
-
N
V
DIEN
T
V
VSVVPTH
GA
V
VRAD
Y
KTRV
G
VKVLMTLMR-N
G
KAVP
FG
S
V
V
T
---
A-
-
RNGGSS
-
I
A
G
E
N
G
QV
Y
------
L
SG
M
PL
--
SGQVS
V
K
WG
SQTTDQ
-----
C
TADYKL
P
KESAGQILSHVT
V
S
CR
fig|749531.3.peg.3144
Escherichia coli MS 69-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
ER
--
GQKIT
P
R
V
Y
R
V
D
I
V
LN
QTMV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
IDAF
------
LAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHRSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SINSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNDQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AML
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-DYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLSR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
SGSQQ
Q
LNYAL
SG
S
LV
A
HSR
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIV
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
GRVLMKTST-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINAAITRMD
A
I
CR
fig|749531.3.peg.3143
Escherichia coli MS 69-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
ER
--
GQKIT
P
R
V
Y
R
V
D
I
V
LN
QTMV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
IDAF
------
LAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHRSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SINSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNDQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AML
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-DYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLSR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
SGSQQ
Q
LNYAL
SG
S
LV
A
HSR
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIV
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
GRVLMKTST-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINAAITRMD
A
I
CR
fig|344601.5.peg.2765
Escherichia coli B171
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|340185.4.peg.3165
Escherichia coli E22
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|656408.3.peg.1009
Escherichia coli H591
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|679206.4.peg.3926
Escherichia coli MS 119-7
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|656443.3.peg.1237
Escherichia coli TA271
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|550672.3.peg.1218
Escherichia coli B088 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|344601.3.peg.2658
Escherichia coli B171 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|340185.3.peg.3000
Escherichia coli E22 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|656408.3.peg.1008
Escherichia coli H591 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|679206.4.peg.3927
Escherichia coli MS 119-7 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|656443.3.peg.1236
Escherichia coli TA271 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|331111.12.peg.1343
Escherichia coli E24377A
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
NTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|331111.3.peg.3545
Escherichia coli E24377A (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
NTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|585055.8.peg.1008
Escherichia coli 55989
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|340184.6.peg.2150
Escherichia coli B7A
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|562.375.peg.4600
Escherichia coli EC4100B
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|585034.4.peg.973
Escherichia coli IAI1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|585034.5.peg.969
Escherichia coli IAI1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|595495.4.peg.2283
Escherichia coli KO11
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|679207.4.peg.4123
Escherichia coli MS 107-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|749545.3.peg.3338
Escherichia coli MS 182-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|749532.3.peg.2021
Escherichia coli MS 78-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|585396.4.peg.1052
Escherichia coli O111:H- str. 11128
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|573235.3.peg.1095
Escherichia coli O26:H11 str. 11368
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|409438.11.peg.1133
Escherichia coli SE11
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|566546.3.peg.1828
Escherichia coli W
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|566546.4.peg.1041
Escherichia coli W
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|585055.6.peg.1006
Escherichia coli 55989 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|340184.3.peg.2043
Escherichia coli B7A (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|562.375.peg.4599
Escherichia coli EC4100B (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|595495.4.peg.2284
Escherichia coli KO11 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|679207.4.peg.4122
Escherichia coli MS 107-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|749545.3.peg.3339
Escherichia coli MS 182-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|749532.3.peg.2022
Escherichia coli MS 78-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|566546.3.peg.1829
Escherichia coli W (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|749537.3.peg.4484
Escherichia coli MS 115-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|749537.3.peg.4485
Escherichia coli MS 115-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|6666666.5357.peg.4743
Escherichia coli TY-2482
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GNAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|536056.3.peg.2852
Escherichia coli DH1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|749538.3.peg.2985
Escherichia coli MS 116-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|749548.3.peg.1449
Escherichia coli MS 196-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|316385.7.peg.1025
Escherichia coli str. K-12 substr. DH10B
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|595496.3.peg.873
Escherichia coli BW2952 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|83333.1.peg.925
Escherichia coli K12 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|749538.3.peg.2986
Escherichia coli MS 116-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|749548.3.peg.1448
Escherichia coli MS 196-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|316407.3.peg.906
Escherichia coli W3110 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|316385.5.peg.1010
Escherichia coli str. K-12 substr. DH10B (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|511145.6.peg.967
Escherichia coli str. K-12 substr. MG1655 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|155864.1.peg.1123
Escherichia coli O157:H7 EDL933 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLS-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLSR
--
SNDSYTSKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGVF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
ISLR
FGA
I
A
T
--
L
D-
-
GVQTNS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIT
V
R
WG
EAPDQI
-----
C
HISYEL
T
EQQINSAITRMD
A
I
CR
fig|679205.4.peg.970
Escherichia coli MS 124-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
C
D
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|749533.3.peg.967
Escherichia coli MS 84-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
C
D
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|679205.4.peg.971
Escherichia coli MS 124-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
C
D
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|749533.3.peg.968
Escherichia coli MS 84-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
C
D
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GIQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYQL
T
EQQINSAITRMD
A
I
CR
fig|656417.3.peg.4275
Escherichia coli M605 (7-853/853)
MKRVVSLLLVIMPACSI
A
GMR--
---
FN
PA
F
LS-G
--
DTEAVA
D
-
LSR
F
EK
--
GMTYL
PG
S
Y
E
V
E
V
W
V
N
DSPL
-
LSR-T
V
TFKADDENQ----
-
--LIP
CL
S
--
LAD
L
LSL
G
I
N
KNAL
------
PEQALAS
--------
SENS
C
LDLRIWFPDVHYMP
E
L
--
DAQR
L
KL
T
F
PQA
I
I
KRDARG
Y
I
PP
EQ
WD
N
GI
T
A
FLLN
Y
DFSGNNDRGDYS---
-
---
S
NNYYLNLRA
G
I
N
I
G
A
WR
F
R
D
Y
STWSRGSNS
-
AG-
-
-
---------
-----K
L
EHI
--------
SST
L
Q
R
VIIP
F
R
-
SE
L
T
LGD
T
W
SS
S
---
D
V
FDS
VS
I
R
G
IK
L
ES
D
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
A
Q
VTI
K
Q
N
G
YV
IY
QTY
M
PPGPF
E
I
S
DL
NP
T
SSA
GDL
E
V
T
I
K
E
S
D
N
SETVYT
VP
Y
AA
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
Q
YRSN-
--
SYNQKSPY
V
FQGELI
W
G
LPWDI
T
A
YGG
A
-
QFS
E
D
Y
R
A
LAL
G
L
G
L
N
LGVF
GA
T
SFD
V
T
Q
A
NSSLV
-----
DG
-
S
---
KHQ
G
Q
S
Y
R
FL
YSK
SLVQTG
T
AFHII
GYRYS
TQG
F
Y
T
L
SD
TTYQ
--
QMSGTVVDPKTLDDKDYV
Y
NWNDFYNL
-
RYSK
-
R
GKFQA
S
VS
Q
PFG-NYG
S
MYL
S
ASQQT
YW
NTDKKDS-LYQV
G
YNTSIKG-IYLN
VA
W
N
YSKSPG
T
N
-----------
--A
D
KIVSLNV
S
L
P
ISNWLSS
TN
DGRSSSNAMTAT
Y
GYSQ
-
--DNHGQV
N
QYT
G
VSG
S
L
LE
Q
HN
-
L
SY
N
I
QH
G
FANQDNSS-SGSV
G
--
--VN--
Y
RGAY
---
G
SLNS
A
Y
S
YD
N
-
EGN-Q
Q
INYGI
SG
A
LV
V
HEN
G
LT
L
S
-
QPLG--E
T
NV
L
I
K
APG
AN
N
VD
V
Q
-
RGTGIS
TD
WR
G
YA
V
VPYA
T
E
Y
RR
N
NIS
LD
P
M
S
M
NM
-
H
T
ELDI
TS
TEVIPGK
GA
L
VRAE
F
AAHI
G
IRGLFTVRY-R
N
KSVP
FGA
T
A
S
AQI
K-
-
NSSQIT
GIV
G
D
N
G
QL
Y
------
L
SG
L
PL
--
EGVIN
I
Q
WG
DGVQQK
-----
C
QANYNL
P
ETELNNPVSYAT
L
E
CR
fig|656417.3.peg.4276
Escherichia coli M605 (18-864/864)
MKRVVSLLLVIMPACSI
A
GMR--
---
FN
PA
F
LS-G
--
DTEAVA
D
-
LSR
F
EK
--
GMTYL
PG
S
Y
E
V
E
V
W
V
N
DSPL
-
LSR-T
V
TFKADDENQ----
-
--LIP
CL
S
--
LAD
L
LSL
G
I
N
KNAL
------
PEQALAS
--------
SENS
C
LDLRIWFPDVHYMP
E
L
--
DAQR
L
KL
T
F
PQA
I
I
KRDARG
Y
I
PP
EQ
WD
N
GI
T
A
FLLN
Y
DFSGNNDRGDYS---
-
---
S
NNYYLNLRA
G
I
N
I
G
A
WR
F
R
D
Y
STWSRGSNS
-
AG-
-
-
---------
-----K
L
EHI
--------
SST
L
Q
R
VIIP
F
R
-
SE
L
T
LGD
T
W
SS
S
---
D
V
FDS
VS
I
R
G
IK
L
ES
D
EN
MLP
DSQS
GFAP
T
V
R
GIA
KS
R
A
Q
VTI
K
Q
N
G
YV
IY
QTY
M
PPGPF
E
I
S
DL
NP
T
SSA
GDL
E
V
T
I
K
E
S
D
N
SETVYT
VP
Y
AA
V
P
I
L
Q
R
E
G
HS
K
Y
ST
T
V
G
Q
YRSN-
--
SYNQKSPY
V
FQGELI
W
G
LPWDI
T
A
YGG
A
-
QFS
E
D
Y
R
A
LAL
G
L
G
L
N
LGVF
GA
T
SFD
V
T
Q
A
NSSLV
-----
DG
-
S
---
KHQ
G
Q
S
Y
R
FL
YSK
SLVQTG
T
AFHII
GYRYS
TQG
F
Y
T
L
SD
TTYQ
--
QMSGTVVDPKTLDDKDYV
Y
NWNDFYNL
-
RYSK
-
R
GKFQA
S
VS
Q
PFG-NYG
S
MYL
S
ASQQT
YW
NTDKKDS-LYQV
G
YNTSIKG-IYLN
VA
W
N
YSKSPG
T
N
-----------
--A
D
KIVSLNV
S
L
P
ISNWLSS
TN
DGRSSSNAMTAT
Y
GYSQ
-
--DNHGQV
N
QYT
G
VSG
S
L
LE
Q
HN
-
L
SY
N
I
QH
G
FANQDNSS-SGSV
G
--
--VN--
Y
RGAY
---
G
SLNS
A
Y
S
YD
N
-
EGN-Q
Q
INYGI
SG
A
LV
V
HEN
G
LT
L
S
-
QPLG--E
T
NV
L
I
K
APG
AN
N
VD
V
Q
-
RGTGIS
TD
WR
G
YA
V
VPYA
T
E
Y
RR
N
NIS
LD
P
M
S
M
NM
-
H
T
ELDI
TS
TEVIPGK
GA
L
VRAE
F
AAHI
G
IRGLFTVRY-R
N
KSVP
FGA
T
A
S
AQI
K-
-
NSSQIT
GIV
G
D
N
G
QL
Y
------
L
SG
L
PL
--
EGVIN
I
Q
WG
DGVQQK
-----
C
QANYNL
P
ETELNNPVSYAT
L
E
CR
fig|358709.5.peg.3458
Escherichia coli 101-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
FGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|749547.3.peg.1896
Escherichia coli MS 187-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
FGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|749547.3.peg.1895
Escherichia coli MS 187-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
FGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|316401.4.peg.1160
Escherichia coli ETEC H10407
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YKTN-
--
SNEQQVSK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|550677.3.peg.2216
Escherichia coli B354
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
IDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
D
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSSDSD
-
STD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SINSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNDQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AML
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-DYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGEF
---
A
DARV
G
Y
N
YS
D
-
SGSQQ
Q
LNYAL
SG
S
LV
A
HSR
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIV
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYDL
T
EQQINAAITRMD
A
I
CR
fig|413997.3.peg.991
Escherichia coli B str. REL606 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
G
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
FGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|511693.5.peg.1015
Escherichia coli BL21 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
G
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
FGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|469008.4.peg.2746
Escherichia coli BL21(DE3) (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
G
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
FGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|656414.3.peg.1161
Escherichia coli H736
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
I
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|749540.3.peg.135
Escherichia coli MS 146-1
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
I
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|656414.3.peg.1160
Escherichia coli H736 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
I
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|749540.3.peg.134
Escherichia coli MS 146-1 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
VPLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
I
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|656444.3.peg.1564
Escherichia coli TA280 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
ER
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTMV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
IDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
D
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSSDSD
-
STD
S
Y--FXNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SINSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNDQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AML
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNHLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-DYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGETA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSR
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIV
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQTNS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
QGAIT
V
R
WG
EAPDQI
-----
C
HISYEL
T
EQQINAAITRMD
A
I
CR
fig|550676.3.peg.411
Escherichia coli B185
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
ALLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
INLLLPR
--
SNDSYTSKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTTK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIT
V
R
WG
EAPDQI
-----
C
HISYEL
T
EQQINSAITRMD
A
I
CR
fig|656419.3.peg.1243
Escherichia coli M718
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVELTPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDARVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
P
V
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQVTLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTTK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIT
V
R
WG
EAPDQI
-----
C
HISYEL
T
EQQINSAITRMD
A
I
CR
fig|656419.3.peg.1242
Escherichia coli M718 (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVELTPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDARVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
P
V
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQVTLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTTK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIT
V
R
WG
EAPDQI
-----
C
HISYEL
T
EQQINSAITRMD
A
I
CR
fig|331112.6.peg.1018
Escherichia coli HS
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
E
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVELTPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
ALLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
ISY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
G
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPR
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHI
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
IPLR
FGA
I
A
T
--
L
D-
-
GVQANS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIS
V
R
WG
EAPDQI
-----
C
HINYEL
T
EQQINSAITRMD
A
I
CR
fig|562.371.peg.2090
Escherichia coli 1044A (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
P
X
XR
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLS-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLSR
--
SNDSYTSKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGVF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
ISLR
FGA
I
A
T
--
L
D-
-
GVQTNS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIT
V
R
WG
EAPDQI
-----
C
HISYEL
T
EQQINSAITRMD
A
I
CR
fig|562.373.peg.1072
Escherichia coli 1125A (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
P
X
XR
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLS-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLSR
--
SNDSYTSKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGVF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
ISLR
FGA
I
A
T
--
L
D-
-
GVQTNS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIT
V
R
WG
EAPDQI
-----
C
HISYEL
T
EQQINSAITRMD
A
I
CR
fig|562.372.peg.938
Escherichia coli 1212A (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
P
X
XR
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLS-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLSR
--
SNDSYTSKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGVF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
ISLR
FGA
I
A
T
--
L
D-
-
GVQTNS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIT
V
R
WG
EAPDQI
-----
C
HISYEL
T
EQQINSAITRMD
A
I
CR
fig|562.374.peg.2472
Escherichia coli 536A (26-866/866)
FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
P
X
XR
WD
E
GINA
LLLG
Y
SFSGANSIHSSADSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
I
TI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
D
I
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNEQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
V
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLS-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
I
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLSR
--
SNDSYTSKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGVF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
ISLR
FGA
I
A
T
--
L
D-
-
GVQTNS
GI
I
D
D
D
G
SL
Y
------
M
A
G
L
PA
--
KGTIT
V
R
WG
EAPDQI
-----
C
HISYEL
T
EQQINSAITRMD
A
I
CR
fig|439855.10.peg.2334
Escherichia coli SMS-3-5 (1-850/851)
MP
SFIGGLVV--FVSAAFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSAGSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
VTI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNDQQESK
F
AQATLQ
W
G
GPRGT
T
W
YGG
G
-
QYA
E
Y
Y
R
A
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPK
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHT
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
V
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
MPLR
FGA
M
A
T
--
L
D-
-
GAQTIS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
KGTIT
V
R
WG
DAPDQI
-----
C
HISYEL
T
EQQINAAITRMD
S
V
C
fig|585057.4.peg.2297
Escherichia coli IAI39 (26-865/866)
FVSAVFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSAGSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
VTI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNDQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
T
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPK
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHS
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
I
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
MPLR
FGA
M
A
T
--
L
D-
-
GAQTIS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
KGTIT
V
R
WG
DAPDQI
-----
C
HISYEL
T
EQQINAAITRMD
S
V
C
fig|585057.6.peg.2298
Escherichia coli IAI39 (26-865/866)
FVSAVFNAQ
A
ETW--
---
F
D
PA
F
FK-D
--
DPSMVA
D
-
LSR
F
EK
--
GQKIT
PG
V
Y
R
V
D
I
V
LN
QTIV
-
DTR-N
V
NFVEITPEK----
-
-GIAA
CLT
--
TES
L
DAM
G
V
N
TDAF
------
PAFKQL-
--------
DKQA
C
APLAEIIPDASVTF
N
V
--
NKLR
L
EI
SVPQ
I
A
I
KSNARG
Y
V
PP
ER
WD
E
GINA
LLLG
Y
SFSGANSIHSSAGSD
-
SGD
S
Y--FLNLNS
G
V
N
L
G
P
WRLR
N
N
STWSRSSGQ
-
TA-
-
-
---------
-----E
W
KNL
--------
SSY
L
Q
R
AVIP
L
K
-
GE
L
T
V
GD
D
Y
TA
G
---
D
F
FDS
VS
F
R
G
VQ
L
AS
D
DN
MLP
DSLK
GFAP
V
V
R
GIA
KS
NA
Q
VTI
K
Q
N
G
YT
IY
QTY
V
S
PG
A
F
E
I
S
DL
YS
T
SSS
GDL
L
V
E
I
K
E
A
DG
SVNSYS
VP
F
SS
V
P
L
L
Q
R
Q
G
RI
K
Y
AV
T
L
A
K
YRTN-
--
SNDQQESK
F
AQATLQ
W
G
GPWGT
T
W
YGG
G
-
QYA
E
Y
Y
R
T
AMF
G
L
G
F
N
LGDF
GA
I
SFD
A
T
Q
A
KSTLA
-----
DQ
-
S
---
EHK
G
Q
S
Y
R
FL
Y
A
K
TLNQLG
T
NFQLM
GYRYS
TSG
F
Y
T
L
SD
TMYK
--
HMDGYEFN--DGDDEDTP
-
MWSRYYNL
-
FYTK
-
R
GKLQV
N
IS
Q
QLG-EYG
S
FYL
S
GSQQT
YW
HTDQQDR-LLQF
G
YNTQIKD-LSLG
V
S
W
N
YSKSRG
Q
P
-----------
-DA
D
QVFALNF
S
L
P
LNLLLPK
--
SNDSYTRKKNYA
W
MTSN
T
SIDNEGHS
T
QNL
G
LTE
TL
LD
D
GN
-
L
SY
S
I
QQ
G
YNSEGKTA-NGS-
---
ASMD--
Y
KGAF
---
A
DARV
G
Y
N
YS
D
-
NGSQQ
Q
LNYAL
SG
S
LV
A
HSQ
G
IT
L
G
-
QSLG--E
T
NV
L
I
A
APG
AE
N
TR
V
A
-
NSTGLK
TD
WR
G
YT
V
VPYA
T
S
Y
RE
N
RIA
LD
A
A
S
L
KR
-
N
V
DLEN
AV
VNVVPTK
GA
L
VLAE
F
NAHA
G
ARVLMKTSK-Q
G
MPLR
FGA
M
A
T
--
L
D-
-
GAQTIS
GI
I
D
D
D
G
SL
Y
------
M
SG
L
PA
--
KGTIT
V
R
WG
DAPDQI
-----
C
HISYEL
T
EQQINAAITRMD
S
V
C
fig|656440.3.peg.3823
Escherichia coli TA206 (15-841/843)
FSFSLLALTIASALPAYG
G
K----
---
FN
PK
F
LE-N
VQ
GIDQHV
D
-
LSV
Y
DS
PV
GQQI-
PG
K
Y
R
V
F
V
F
V
N
EEKM
-
ASR-T
L
DFSTASEAQRKAS
G
ESLMP
CL
S
--
RVQ
L
EEM
G
V
R
IDSF
------
PALKILP
--------
PE-A
C
VAFDEIIPQATSRF
D
F
--
NTQT
L
HL
T
F
PQA
A
M
MMTARG
T
V
D
P
SR
WD
E
GI
P
A
LLLD
Y
SFSGSNGRNEGSGSS
P
DST
S
NSYYLNLRS
GLN
V
G
P
WRLR
N
N
SIWNRTDG-
-
---
-
-
---------
---KNQ
W
DNV
--------
GTS
L
N
R
AIIP
L
K
-
SQ
I
T
LGD
T
A
TP
G
---
E
IFDS
VQ
M
R
G
AL
L
AS
D
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
NA
E
V
S
I
E
Q
N
G
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
GDL
T
V
I
I
K
E
S
DG
SEQRFI
Q
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
GE
YRAG-
--
NYDSGKPR
F
GQFTAM
Y
G
LPWGM
T
A
YGG
A
-
LLS
A
D
Y
N
A
LAL
G
L
G
K
N
FGTI
GA
V
S
V
D
V
T
Q
A
KSQLR
-----
NN
-
E
---
KDE
G
Q
S
Y
R
FL
YSK
SF-EGG
T
DLRLL
GY
K
YS
TSG
Y
Y
T
F
Q
E
ATDV
--
RSD---------------
-
ADSDYRR-
-
-YHK
-
R
SQIQG
N
IT
Q
QLG-DYG
S
VYF
N
MTQQD
YW
NVDGKEN-SLSA
G
YHGHIGR-VNYS
IA
Y
T
WTRSPE
W
D
-----------
-ED
D
RLWSFSL
SIP
LGGAWGS
--
------------
Y
RMTT
-
--DQNGKT
S
QQA
S
VSG
TL
LE
D
RN
-
L
N
Y
N
V
QQ
G
YTSNGVGN-SGS-
---
VNMG--
Y
MGGS
---
G
NIDV
G
Y
N
YS
--
-KDNQ
Q
VNYGV
R
GG
VI
V
HSE
G
IT
L
S
-
QPLG--E
S
LA
I
V
S
APG
AR
G
GH
V
V
-
NSSGVE
V
D
WM
G
NA
V
VPYL
T
P
Y
RE
T
IVE
L
R
S
D
T
L
GQ
-
N
V
ELQE
A
F
QKVVPTR
GAI
VRSR
F
DTRV
G
YRVLMSLKRAN
G
NAVP
FGA
T
A
A
-
LS
D-
-
ESKPAS
S
IV
G
E
E
G
QL
Y
------
I
SG
M
PE
--
EGELQ
V
S
WG
HEQAQR
-----
C
RVPFRL
P
EKKDNSGIVMVN
A
V
C
fig|585397.7.peg.4224
Escherichia coli ED1a (15-841/843)
FSFSLLALTIASALPAYG
G
K----
---
FN
PK
F
LE-N
VQ
GIDQHV
D
-
LSV
Y
DF
PV
GQQI-
PG
K
Y
R
V
F
V
F
V
N
EEKM
-
ASR-T
L
DFSTASEAQRKAS
G
ESLMP
CL
S
--
RVQ
L
EEM
G
V
R
VDSF
------
PALKILP
--------
PE-A
C
VAFDEIIPQATSRF
D
F
--
NTQT
L
HL
T
F
PQA
A
M
MMTARG
T
V
D
P
SR
WD
E
GI
P
A
LLLD
Y
SFSGSNGRNEGSGSS
P
DST
S
DSYYLNLRS
GLN
V
G
P
WRLR
N
N
SIWNRTDG-
-
---
-
-
---------
---KNQ
W
DNV
--------
GTS
L
N
R
AIIP
L
K
-
SQ
I
T
LGD
T
A
TP
G
---
E
IFDS
VQ
M
R
G
AL
L
AS
D
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
NA
E
V
S
I
E
Q
N
G
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
GDL
T
V
I
I
K
E
S
DG
SEQRFI
Q
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
GE
YRAG-
--
NYDSGKPR
F
GQFTAM
Y
G
LPWGM
T
A
YGG
A
-
LLS
A
D
Y
N
A
LAL
G
L
G
K
N
FGTI
GA
V
S
V
D
V
T
Q
A
KSQLR
-----
NN
-
E
---
KDE
G
Q
S
Y
R
FL
YSK
SF-EGG
T
DLRLL
GY
K
YS
TSG
Y
Y
T
F
Q
E
ATDV
--
RSD---------------
-
ADSDYRR-
-
-YHK
-
R
SQIQG
N
IT
Q
QLG-DYG
S
VYF
N
MTQQD
YW
NVDGKEN-SLSA
G
YHGHIGR-VNYS
IA
Y
T
WTRSPE
W
D
-----------
-ED
D
RLWSFSL
SIP
LGGAWGS
--
------------
Y
RMTT
-
--DQNGKT
S
QQA
S
VSG
TL
LE
D
RN
-
L
N
Y
N
V
QQ
G
YTSNGVGN-SGS-
---
VNMG--
Y
MGGS
---
G
NIDV
G
Y
N
YS
--
-KDNQ
Q
VNYGV
R
GG
VI
V
HSE
G
IT
L
S
-
QPLG--E
S
LA
I
V
S
APG
AR
G
GH
V
V
-
NSSGVE
V
D
WM
G
NA
V
VPYL
T
P
Y
RE
T
IVE
L
R
S
D
T
L
GQ
-
N
V
ELQE
A
F
QKVVPTR
GAI
VRSR
F
DTRV
G
YRVLMSLKRAN
G
NAVP
FGA
T
A
A
-
LS
D-
-
ESKPAS
S
IV
G
E
E
G
QL
Y
------
I
SG
M
PE
--
EGELQ
V
S
WG
HEQAQR
-----
C
RVPFRL
P
EKKDNSGIVMVN
A
V
C
fig|585397.9.peg.4221
Escherichia coli ED1a (15-841/843)
FSFSLLALTIASALPAYG
G
K----
---
FN
PK
F
LE-N
VQ
GIDQHV
D
-
LSV
Y
DF
PV
GQQI-
PG
K
Y
R
V
F
V
F
V
N
EEKM
-
ASR-T
L
DFSTASEAQRKAS
G
ESLMP
CL
S
--
RVQ
L
EEM
G
V
R
VDSF
------
PALKILP
--------
PE-A
C
VAFDEIIPQATSRF
D
F
--
NTQT
L
HL
T
F
PQA
A
M
MMTARG
T
V
D
P
SR
WD
E
GI
P
A
LLLD
Y
SFSGSNGRNEGSGSS
P
DST
S
DSYYLNLRS
GLN
V
G
P
WRLR
N
N
SIWNRTDG-
-
---
-
-
---------
---KNQ
W
DNV
--------
GTS
L
N
R
AIIP
L
K
-
SQ
I
T
LGD
T
A
TP
G
---
E
IFDS
VQ
M
R
G
AL
L
AS
D
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
NA
E
V
S
I
E
Q
N
G
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
GDL
T
V
I
I
K
E
S
DG
SEQRFI
Q
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
GE
YRAG-
--
NYDSGKPR
F
GQFTAM
Y
G
LPWGM
T
A
YGG
A
-
LLS
A
D
Y
N
A
LAL
G
L
G
K
N
FGTI
GA
V
S
V
D
V
T
Q
A
KSQLR
-----
NN
-
E
---
KDE
G
Q
S
Y
R
FL
YSK
SF-EGG
T
DLRLL
GY
K
YS
TSG
Y
Y
T
F
Q
E
ATDV
--
RSD---------------
-
ADSDYRR-
-
-YHK
-
R
SQIQG
N
IT
Q
QLG-DYG
S
VYF
N
MTQQD
YW
NVDGKEN-SLSA
G
YHGHIGR-VNYS
IA
Y
T
WTRSPE
W
D
-----------
-ED
D
RLWSFSL
SIP
LGGAWGS
--
------------
Y
RMTT
-
--DQNGKT
S
QQA
S
VSG
TL
LE
D
RN
-
L
N
Y
N
V
QQ
G
YTSNGVGN-SGS-
---
VNMG--
Y
MGGS
---
G
NIDV
G
Y
N
YS
--
-KDNQ
Q
VNYGV
R
GG
VI
V
HSE
G
IT
L
S
-
QPLG--E
S
LA
I
V
S
APG
AR
G
GH
V
V
-
NSSGVE
V
D
WM
G
NA
V
VPYL
T
P
Y
RE
T
IVE
L
R
S
D
T
L
GQ
-
N
V
ELQE
A
F
QKVVPTR
GAI
VRSR
F
DTRV
G
YRVLMSLKRAN
G
NAVP
FGA
T
A
A
-
LS
D-
-
ESKPAS
S
IV
G
E
E
G
QL
Y
------
I
SG
M
PE
--
EGELQ
V
S
WG
HEQAQR
-----
C
RVPFRL
P
EKKDNSGIVMVN
A
V
C
fig|685038.3.peg.3604
Escherichia coli O83:H1 str. NRG 857C (15-841/843)
FSFSLLALTIASALPAYG
G
K----
---
FN
PK
F
LE-N
VQ
GIDQHV
D
-
LSV
Y
DS
PV
GQQI-
PG
K
Y
R
V
F
V
F
V
N
EEKM
-
ASR-T
L
DFSTASEAQRKAS
G
ESLMP
CL
S
--
RVQ
L
EEM
G
V
R
IDSF
------
PALKILP
--------
PE-A
C
VAFDEIIPQATSRF
D
F
--
NTQT
L
HL
T
F
PQA
A
M
MMTARG
T
V
D
P
SR
WD
E
GI
P
A
LLLD
Y
SFSGSNGRNEGSGSS
P
DST
S
DSYYLNLRS
GLN
V
G
P
WRLR
N
N
SIWNRTDG-
-
---
-
-
---------
---KNQ
W
DNV
--------
GTS
L
N
R
AIIP
L
K
-
SQ
I
T
LGD
T
A
TP
G
---
E
IFDS
VQ
M
R
G
TL
L
AS
D
DE
MLP
DSQR
GFAP
V
V
R
GIA
KS
NA
E
V
S
I
E
Q
N
G
YV
IY
RTF
V
Q
PG
A
F
E
I
N
DL
YA
T
SGS
GDL
T
V
I
I
K
E
S
DG
SEQRFI
Q
P
F
S
A
V
A
I
F
Q
R
E
G
YL
K
Y
SL
A
A
GE
YRAG-
--
NYDSGKPR
F
GQFTAM
Y
G
LPWGM
T
A
YGG
A
-
LLS
A
D
Y
N
A
LAL
G
L
G
K
N
FGTI
GA
V
S
V
D
V
T
Q
A
KSQLR
-----
NN
-
E
---
KDE
G
Q
S
Y
R
FL
YSK
SF-EGG
T
DLRLL
GY
K
YS
TSG
Y
Y
T
F
Q
E
ATDV
--
RSD---------------
-
ADSDYRR-
-
-YHK
-
R
SQIQG
N
IT
Q
QLG-DYG
S
VYF
N
MTQQD
YW
NVDGKEN-SLSA
G
YHGHIGR-VNYS
IA
Y
T
WTRSPE
W
D
-----------
-ED
D
RLWSFSL
SIP
LGGAWGS
--
------------
Y
RMTT
-
--DQNGKT
S
QQA
S
VSG
TL
LE
D
RN
-
L
N
Y
N
V
QQ
G
YTSNGVGN-SGS-
---
VNMG--
Y
MGGS
---
G
NIDV
G
Y
N
YS
--
-KDNQ
Q
VNYGV
R
GG
VI
V
HSE
G
IT
L
S
-
QPLG--E
S
LA
I
V
S
APG
AR
G
GH
V
V
-
NSSGVE
V
D
WM
G
NA
V
VPYL
T
P
Y
RE
T
IVE
L
R
S
D
T
L
GQ
-
N
V
ELQE
A
F
QKVVPTR
GAI
VRSR
F
DTRV
G
YRVLMSLKRAN
G
NAVP
FGA
T
A
A
-
LS
D-
-
ESKPAS
S
IV
G
E
E
G
QL
Y
------
I
SG
M
PE
--
EGELQ
V
S
WG
HEQAQR
-----
C
RVPFRL
P
EKKDNSGIVMVN
A
V
C
fig|550676.3.peg.3941
Escherichia coli B185 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFVQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YG
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
TEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
SSNHHDDNND
N
FGK
E
QIFMFSM
SIP
LSGWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FNYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|562.371.peg.2995
Escherichia coli 1044A (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|562.373.peg.922
Escherichia coli 1125A (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|562.372.peg.3880
Escherichia coli 1212A (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|562.374.peg.1969
Escherichia coli 536A (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|83334.1.peg.4638
Escherichia coli O157:H7 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|155864.1.peg.4671
Escherichia coli O157:H7 EDL933 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|155864.8.peg.4630
Escherichia coli O157:H7 EDL933 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|444454.5.peg.3755
Escherichia coli O157:H7 str. EC4024 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|444449.5.peg.3209
Escherichia coli O157:H7 str. EC4042 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|444448.5.peg.1965
Escherichia coli O157:H7 str. EC4045 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|444453.5.peg.1495
Escherichia coli O157:H7 str. EC4076 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|444452.5.peg.1603
Escherichia coli O157:H7 str. EC4113 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|444450.8.peg.5047
Escherichia coli O157:H7 str. EC4115 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|444451.5.peg.18
Escherichia coli O157:H7 str. EC4196 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|478004.5.peg.522
Escherichia coli O157:H7 str. EC4401 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|478005.5.peg.456
Escherichia coli O157:H7 str. EC4486 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|478006.5.peg.507
Escherichia coli O157:H7 str. EC4501 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|478007.5.peg.1566
Escherichia coli O157:H7 str. EC508 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|478008.5.peg.1848
Escherichia coli O157:H7 str. EC869 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|386585.9.peg.4872
Escherichia coli O157:H7 str. Sakai (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|544404.4.peg.4857
Escherichia coli O157:H7 str. TW14359 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|502346.5.peg.1950
Escherichia coli O157:H7 str. TW14588 (7-844/844)
FITLASGICLLCSISAFA
R
DSL--
---
FN
PR
L
LELD
--
HPADNI
D
-
IHQ
F
NR
--
SNTLP
A
G
T
Y
K
V
D
V
M
I
N
GMLF
-
ERQ-E
V
KFAQDNPDA----
-
-ELHP
C
YVAI
KNV
L
ATY
G
I
K
VDAI
------
KSLANV-
--------
DDKT
C
VNPVPLIDGATWLL
D
A
--
SKLA
L
NI
T
I
PQ
I
Y
L
NNAVNG
Y
I
S
P
SR
WD
Q
GINA
MMMN
Y
DFSASHTIRSNY---
-
DDD
D
DSYYLNLRN
G
I
N
L
G
A
WR
F
R
N
Y
STLNSYDGN
-
V--
-
-
---------
-----D
Y
HSV
--------
SNY
I
Q
R
DIMA
L
R
-
SQ
I
M
I
GD
T
W
TA
S
---
D
V
FDS
TQ
V
R
G
VR
L
YT
D
DD
MLP
SSQN
GFAP
V
V
H
GIA
KT
NA
T
V
I
I
K
Q
N
G
YV
IY
QSA
VP
Q
G
A
F
A
L
T
DL
NT
T
SSG
GDL
D
V
T
I
K
E
E
DG
SEQHFI
Q
P
F
T
S
L
A
I
L
K
R
E
G
QT
DV
DL
S
I
GE
VRDE-
--
SGF--TPE
V
LQLQAM
HG
FPLGI
T
L
YGG
T
-
QLA
N
D
Y
A
S
AAL
G
I
G
K
D
MGAL
GA
I
SFD
V
T
H
A
RSQFD
-----
YD
-
D
---
NES
G
Q
S
Y
R
FL
YSK
RFEDTN
T
TFRLV
GYRYS
MEG
F
Y
T
L
NE
WV-S
--
RQD---------------
-
NDSDFW-V
-
TGNR
-
R
SRFEG
T
WT
Q
SFTPGWG
N
IYL
T
FSRQE
YW
QTDEVER-LLQF
G
YNNNWRN-ISWN
V
S
W
N
YTDSIK
R
S
LGNHHDDNNDD
FGK
E
QIFMFSM
SIP
LSCWMED
--
--------SYVN
Y
SLTQ
-
--NNHHES
T
MQV
G
LNG
T
M
LE
G
RN
-
L
SY
N
V
QE
S
WMHSPDDSYSGN-
---
AGMT--
Y
DGTY
---
G
SVNG
S
Y
S
WS
--
-RDSQ
H
FDYGA
R
GG
VL
V
HSD
G
VT
F
S
-
QELG--E
T
VA
LV
K
APG
AE
G
LS
I
E
-
NATGIS
TD
WR
G
YT
V
KTQL
S
P
Y
DE
N
RVA
L
N
S
D
Y
F
SK
A
N
I
ELEN
T
V
INLVPTR
GA
V
VKAE
F
VTHV
G
YRVLFNVRQVN
G
KPIM
FGA
M
A
T
-
AS
L-
-
ETGTVT
GIV
G
D
N
G
EL
Y
------
L
SG
M
PE
--
KGEFL
L
S
WG
QAADEK
-----
C
KAAYHI
T
HKPDDTSLVQMD
A
I
CR
fig|409438.11.peg.16
Escherichia coli SE11 (2-831/835)
K
Y
SK
LFLSVGLALVTL---SGWG
R
TYT--
---
F
D
PS
L
VE-S
--
SGGDSV
D
-
VSL
F
NQ
--
GLQL-
PG
E
Y
F
V
S
I
F
V
N
GEKV
-
GSD-N
I
NFRIE-NHNGED-
-
-TLSP
CL
N
--
ADQ
L
TKY
G
ID
IHKY
------
SDLFNA-
--------
GPEQ
C
ANLWA-IPQADIQF
D
F
--
NQQK
L
SL
L
L
P
TQ
A
L
LPKLNG
I
A
P
E
QL
WD
D
GI
P
A
LFMN
Y
QTNMQQREYQGAY--
-
KSH
D
ESYYAQLQP
GLN
I
G
P
WR
F
R
S
A
ASWQKEQG-
-
---
-
-
---------
------
W
QRS
--------
YIY
A
E
R
GLNT
I
K
-
GR
L
T
LG
E
S
Y
SD
G
---
S
IFDS
IP
F
T
G
GK
L
AS
D
ET
MLP
YDQW
S
F
S
P
V
I
R
G
V
A
RT
Q
A
R
V
E
V
Q
Q
N
G
YT
V
S
NDL
I
P
S
GPF
E
L
T
N
L
PL
G
GGS
GDL
K
V
I
V
H
E
S
DG
TQQVFT
VP
Y
D
T
P
A
V
A
L
R
Q
G
YF
E
Y
SV
M
G
GE
YRPA-
--
NDAVQTTP
V
GALEMK
Y
G
LPWNL
T
L
YGG
L
-
QGA
G
N
Y
Q
A
AAL
G
I
G
S
L
LGDF
GA
L
S
A
D
V
V
Q
S
NSKKD
-----
NQ
-
Q
---
KES
G
Q
R
W
R
VR
Y
N
K
SLD-SG
T
SVNIA
SEE
Y
A
TEG
F
N
T
L
SD
TLNT
YC
KPDAGNI-----------
-
CYSD----
-
YKKP
-
K
NKVNL
S
IS
Q
TTD-GWG
T
FNF
N
GYRQN
YW
NDKSTTT-SFTA
G
YSRMFDSGISLN
VN
L
S
KTQNID
K
N
---
G
------
K
KTN
D
RLTSLWL
S
F
P
LSRWL--
--
----SNSSVNAN
Y
QMTS
-
--DTRGDS
M
HEF
G
VYG
DA
F-
N
RQ
-
L
H
W
D
L
RE
R
YRDNASDN-KAS-
---
SALSLN
Y
RGTY
---
G
ELRG
N
Y
S
YD
--
-KKQR
Q
LGIGI
N
G
N
IV
A
TQY
G
IT
A
G
-
QSSG--D
T
MA
LV
Q
APG
VD
G
AS
V
G
-
YWPGMK
TD
FR
G
YT
S
YGYL
T
P
Y
RE
N
NID
I
N
P
V
T
L
PK
-
N
A
EISQ
TS
TRVVPTK
GA
V
VLAK
F
DTRI
G
GRLLLQLKRSD
N
KPVP
FG
S
V
A
T
VEG
Q-
-
--ASSS
GIV
G
D
N
S
QV
Y
------
L
T
GV
PK
--
EATVK
I
Q
WG
KDKTQS
-----
C
HARVLL
P
EDVNTTGIYNLT
A
V
C
fig|656417.3.peg.5216
Escherichia coli M605 (16-830/845)
LLCPAFSWGAASPVPGHN
N
ELS--
---
FN
PD
F
LELS
DG
NNAKNI
D
-
LSY
F
MN
--
ASGAA
PG
E
Y
T
V
D
V
I
M
N
GKIV
-
DSQIK
I
DFTEQDN------
-
-ELTA
R
LT
--
PQQ
L
ERW
G
ID
TNKL
------
---SVSS
--------
PSAY
Q
QNIAYIIQGATEKF
D
A
--
NNQK
L
IL
NI
PQ
S
Y
L
KPQDWL
S
T
PP
HL
WD
E
G
M
P
A
LMVN
Y
LFNGIKQNNN-----
-
GYG
S
RSQFLSLDS
S
LN
L
G
G
WRLR
H
N
GNWSTNSYN
-
KE-
-
-
---------
S----H
W
QPV
--------
SVY
L
Q
H
DYSF
L
Q
G
GQ
F
T
V
G
Q
T
S
TD
G
---
A
IFDS
FP
F
E
G
AQ
F
SS
D
DG
M
I
A
PELS
Q
Y
S
P
V
V
R
GIA
YS
Q
A
Q
V
SV
K
Q
N
G
VV
IY
QKN
VPPGPF
E
L
R
D
F
NQ
I
FT-
GDL
E
V
E
I
R
E
A
DG
TIRHFT
QA
T
A
V
L
P
I
L
Q
R
Q
G
RL
RY
NL
A
F
G
K
YRSSS
TL
NNSVSEPQ
F
VQSSAA
I
G
LPDEY
T
F
YGG
G
-
IKA
D
N
Y
A
A
VLL
G
L
G
K
Y
SDFF
GA
F
S
L
D
V
T
H
A
RSQFS
-----
QN
Y
K
SFG
KQQ
G
Q
S
Y
R
FM
YS
R
GFGETN
T
TLNIT
GYRY
A
TRG
Y
Y
D
F
D
E
LQQI
--
QS----------------
-
GFVDEKNI
N
SYHQ
-
R
SRIST
T
IS
Q
DLQ-DWG
Q
FYL
S
ASKDQ
YW
DTADGYN--LSA
S
YSLPFRY-ISAM
LS
L
G
YNKSPY
Y
H
-----------
-EA
D
KSLFLSV
S
V
P
LNSLLNG
--
----------NN
L
FLTT
N
TITNNGQV
Q
QQV
G
LNG
S
S
-Q
D
GE
-
F
N
Y
A
V
AQ
G
WQNQNRGE-SGN-
---
INLN--
Y
RGPY
---
A
QMNG
G
Y
A
WQ
--
-KESS
Q
WTYGI
SGG
IT
L
HPH
G
LT
L
S
-
QPLSLDS
A
SA
LV
Q
A
RD
AA
S
VK
V
L
-
NGSGIY
TD
WR
G
YA
V
VPYL
N
P
Y
NR
N
QIS
LD
V
N
S
V
KD
-
N
V
ELLN
TD
VTVIPSR
GA
L
VSAP
F
KVNV
G
NKAIITLIQKN
G
EPVP
FG
S
V
V
T
-
LD
S-
-
ENSINS
S
IV
A
D
Q
G
QV
Y
------
M
SG
L
PE
--
EDTLI
A
Q
WG
EEASQK
-----
C
KTNYKL
fig|431946.3.peg.4170
Escherichia coli SE15 (16-830/845)
LLCPAFSWGAASPVPGHN
N
ELS--
---
FN
PD
F
LELS
DG
NNAKNI
D
-
LSY
F
MN
--
ASGAA
PG
E
Y
T
V
D
V
I
M
N
GKIV
-
DSQIK
I
DFTEQDN------
-
-ELTA
R
LT
--
PQQ
L
ERW
G
ID
TNKL
------
---SVSS
--------
PSAY
Q
QNIAYIIQGATEKF
D
A
--
NNQK
L
IL
NI
PQ
S
Y
L
KPQDWL
S
T
PP
HL
WD
E
G
M
P
A
LMVN
Y
LFNGIKQNNN-----
-
GYG
S
RSQFLSLDS
S
LN
L
G
G
WRLR
H
N
GNWSTNSYN
-
KE-
-
-
---------
S----H
W
QPV
--------
SVY
L
Q
H
DYSF
L
Q
G
GQ
F
T
V
G
Q
T
S
TD
G
---
A
IFDS
FP
F
E
G
AQ
F
SS
D
DG
M
I
A
PELS
Q
Y
S
P
V
V
R
GIA
YS
Q
A
Q
V
SV
K
Q
N
G
VV
IY
QKN
VPPGPF
E
L
R
D
F
NQ
I
FT-
GDL
E
V
E
I
R
E
A
DG
TIRHFT
QA
T
A
V
L
P
I
L
Q
R
Q
G
RL
RY
NL
A
F
G
K
YRSSS
TL
NNSVSEPQ
F
VQSSAA
I
G
LPDEY
T
F
YGG
G
-
IKA
D
N
Y
A
A
VLL
G
L
G
K
Y
SDFF
GA
F
S
L
D
V
T
H
A
RSQFS
-----
QN
Y
K
SFG
KQQ
G
Q
S
Y
R
FM
YS
R
GFGETN
T
TLNIT
GYRY
A
TRG
Y
Y
D
F
D
E
LQQI
--
QS----------------
-
GFVDEKNI
N
SYHQ
-
R
SRIST
T
IS
Q
DLQ-DWG
Q
FYL
S
ASKDQ
YW
DTADGYN--LSA
S
YSLPFRY-ISAM
LS
L
G
YNKSPY
Y
H
-----------
-EA
D
KSLFLSV
S
V
P
LNSLLNG
--
----------NN
L
FLTT
N
TITNNGQV
Q
QQV
G
LNG
S
S
-Q
D
GE
-
F
N
Y
A
V
AQ
G
WQNQNRGE-SGN-
---
INLN--
Y
RGPY
---
A
QMNG
G
Y
A
WQ
--
-KESS
Q
WTYGI
SGG
IT
L
HPH
G
LT
L
S
-
QPLSLDS
A
SA
LV
Q
A
RD
AA
S
VK
V
L
-
NGSGIY
TD
WR
G
YA
V
VPYL
N
P
Y
NR
N
QIS
LD
V
N
S
V
KD
-
N
V
ELLN
TD
VTVIPSR
GA
L
VSAP
F
KVNV
G
NKAIITLIQKN
G
EPVP
FG
S
V
V
T
-
LD
S-
-
ENSINS
S
IV
A
D
Q
G
QV
Y
------
M
SG
L
PE
--
EDTLI
A
Q
WG
EEASQK
-----
C
KTNYKL
fig|216593.1.peg.3415
Escherichia coli E2348/69 (3-811/825)
L
P
M
HR
TFVLTGIIFALSAVYSLSY
A
RDE--
---
FN
LR
I
LELD
--
SPLENT
QV
LAD
F
IN
--
NNNLT
PG
V
Y
L
T
S
V
M
WG
QDSL
-
DKR-N
I
TFVLSSDKK----
-
-SLIP
RF
T
--
KAD
L
REF
GL
K
VDVI
------
PALKVMN
--------
DDTE
V
GDIAQIIDGARYDF
Q
L
--
DSQT
L
WL
R
I
PQ
I
Y
Q
NAIAAG
S
I
A
P
KY
W
N
D
G
E
S
A
AWLS
Y
YASGSRQNSD-----
-
GDN
L
SSNWLNLNS
G
I
N
L
G
A
WRLR
N
N
TVYNES---
-
---
-
-
---------
-----N
W
ESI
--------
STS
L
Q
R
DIKA
L
R
-
SQ
M
E
I
G
Q
T
F
TN
G
---
D
L
FDS
VQ
M
T
G
IK
L
ET
D
TS
MLP
DSEQ
GFAP
V
V
R
V
IA
NS
D
A
Q
V
V
I
K
Q
N
G
YV
IY
QTW
V
SA
GPF
E
I
K
DL
SQ
V
TAG
S
DL
E
V
T
I
K
E
T
N
G
QEHSFI
QA
S
S
T
V
P
I
L
Q
R
E
G
AL
K
Y
SL
A
A
G
K
YRDS-
--
DNNAETPV
F
GVATAI
Y
G
LPYGI
T
I
YGG
I
-
LGA
S
M
Y
H
S
GVT
G
I
G
A
D
LGRL
G
S
V
S
V
D
I
T
A
A
KTKFD
-----
DG
R
D
---
DAT
G
L
S
W
R
AQ
Y
A
K
DFPDTD
T
TVTLA
S
YRYS
TSQ
F
Y
T
F
Q
E
ALDQ
--
R-----------DTPDDK
G
IYS--YRQ
-
TNNR
-
R
NRLQI
N
LS
Q
NIG-RWG
S
VYL
N
GYQQD
YW
GMHGAER-SIGM
G
YSTTWNN-INWS
VN
Y
T
LTKTPG
M
T
-----------
--G
E
QQFSLTL
N
IP
LSRWLPD
--
-----------S
W
AMYN
V
NRSDKSNT
S
HQL
G
IGG
T
A
LQ
D
NN
-
L
SY
N
L
QQ
S
YT-------DNNV
GY
G
ASMNGR
Y
RSSV
---
G
EFGL
G
Y
S
YD
--
-KNSR
Q
WNYSA
Q
G
A
VV
A
HAH
G
VT
L
G
-
QSVQ--D
S
FA
I
V
H
INE
GA
N
VK
V
Q
-
NAQGVY
TD
FW
G
NA
I
VPNM
T
N
Y
RH
N
AIT
V
N
T
Q
G
H
D-
-
S
L
DISD
A
T
QDVIPSK
GA
V
VGVD
F
DARS
G
MRALLTLVH-N
K
ERVP
FGA
L
L
T
---
--
-
-LGNST
A
IV
G
E
D
G
EV
Y
------
I
T
GV
QE
--
SMTFT
V
Q
WG
KEINQQ
-----
C
TGVITE
P
E
fig|574521.7.peg.162
Escherichia coli O127:H6 str. E2348/69 (3-811/825)
L
P
M
HR
TFVLTGIIFALSAVYSLSY
A
RDE--
---
FN
LR
I
LELD
--
SPLENT
QV
LAD
F
IN
--
NNNLT
PG
V
Y
L
T
S
V
M
WG
QDSL
-
DKR-N
I
TFVLSSDKK----
-
-SLIP
RF
T
--
KAD
L
REF
GL
K
VDVI
------
PALKVMN
--------
DDTE
V
GDIAQIIDGARYDF
Q
L
--
DSQT
L
WL
R
I
PQ
I
Y
Q
NAIAAG
S
I
A
P
KY
W
N
D
G
E
S
A
AWLS
Y
YASGSRQNSD-----
-
GDN
L
SSNWLNLNS
G
I
N
L
G
A
WRLR
N
N
TVYNES---
-
---
-
-
---------
-----N
W
ESI
--------
STS
L
Q
R
DIKA
L
R
-
SQ
M
E
I
G
Q
T
F
TN
G
---
D
L
FDS
VQ
M
T
G
IK
L
ET
D
TS
MLP
DSEQ
GFAP
V
V
R
V
IA
NS
D
A
Q
V
V
I
K
Q
N
G
YV
IY
QTW
V
SA
GPF
E
I
K
DL
SQ
V
TAG
S
DL
E
V
T
I
K
E
T
N
G
QEHSFI
QA
S
S
T
V
P
I
L
Q
R
E
G
AL
K
Y
SL
A
A
G
K
YRDS-
--
DNNAETPV
F
GVATAI
Y
G
LPYGI
T
I
YGG
I
-
LGA
S
M
Y
H
S
GVT
G
I
G
A
D
LGRL
G
S
V
S
V
D
I
T
A
A
KTKFD
-----
DG
R
D
---
DAT
G
L
S
W
R
AQ
Y
A
K
DFPDTD
T
TVTLA
S
YRYS
TSQ
F
Y
T
F
Q
E
ALDQ
--
R-----------DTPDDK
G
IYS--YRQ
-
TNNR
-
R
NRLQI
N
LS
Q
NIG-RWG
S
VYL
N
GYQQD
YW
GMHGAER-SIGM
G
YSTTWNN-INWS
VN
Y
T
LTKTPG
M
T
-----------
--G
E
QQFSLTL
N
IP
LSRWLPD
--
-----------S
W
AMYN
V
NRSDKSNT
S
HQL
G
IGG
T
A
LQ
D
NN
-
L
SY
N
L
QQ
S
YT-------DNNV
GY
G
ASMNGR
Y
RSSV
---
G
EFGL
G
Y
S
YD
--
-KNSR
Q
WNYSA
Q
G
A
VV
A
HAH
G
VT
L
G
-
QSVQ--D
S
FA
I
V
H
INE
GA
N
VK
V
Q
-
NAQGVY
TD
FW
G
NA
I
VPNM
T
N
Y
RH
N
AIT
V
N
T
Q
G
H
D-
-
S
L
DISD
A
T
QDVIPSK
GA
V
VGVD
F
DARS
G
MRALLTLVH-N
K
ERVP
FGA
L
L
T
---
--
-
-LGNST
A
IV
G
E
D
G
EV
Y
------
I
T
GV
QE
--
SMTFT
V
Q
WG
KEINQQ
-----
C
TGVITE
P
E
fig|431946.3.peg.168
Escherichia coli SE15 (5-811/825)
L
HR
TFVLTGITFALSAVYSLSY
A
RDE--
---
FN
LR
I
LELD
--
SPLENT
QV
LED
F
VN
--
NNNLT
PG
V
Y
L
T
S
V
M
WG
QEYL
-
DKR-N
I
TFILSSDKK----
-
-RLIP
RF
T
--
KAD
L
REF
GL
K
VDDI
------
PALQVMD
--------
DDTE
F
GDIAQIIDGARYDF
Q
L
--
DSQT
L
CL
R
I
PQ
I
Y
Q
NARAAG
S
I
S
P
KY
W
S
D
G
E
S
A
VWLS
Y
YASGSRQNSD-----
-
GDN
L
NSNWLNLNS
G
I
N
L
G
V
WRLR
N
N
TVYSDS---
-
---
-
-
---------
-----S
W
ESI
--------
STS
L
Q
R
DIKA
L
R
-
SQ
M
E
V
G
Q
T
F
TN
G
---
D
L
FDS
VQ
M
T
G
IK
L
ET
D
TS
MLP
DSEQ
GFAP
V
V
R
GIA
NS
D
A
Q
V
V
I
K
Q
N
G
YV
IY
QTW
V
SA
GPF
E
I
K
DL
SQ
V
TAG
A
DL
E
V
T
I
K
E
T
N
G
QEHSFI
QA
S
S
T
V
P
I
L
Q
R
E
G
AL
K
Y
SL
A
T
G
K
YRDN-
--
DNHAETPV
F
GVATAI
Y
G
LPYGI
T
I
YGG
I
-
LGA
S
I
Y
H
S
GVT
G
I
G
A
D
LGRL
G
S
V
S
V
D
I
T
A
A
ETKFD
-----
DG
R
D
---
DAT
G
L
S
W
R
AQ
Y
A
K
DFPDTD
T
TVTLA
S
YRYS
TSQ
F
Y
T
F
Q
E
ALDQ
--
R-----------DTPDDK
G
IYS--YRQ
-
TNNR
-
R
NRLQI
N
LS
Q
NIG-RWG
S
VYL
N
GYQQD
YW
GMHGAER-SIGM
G
YSTTWSN-INWS
VN
Y
T
LTKTPG
M
A
-----------
--G
E
QQFSLTL
N
IP
LSRWLPD
--
-----------S
W
AMYN
V
NRSDKSNT
S
HQL
G
IGG
T
A
LQ
D
NN
-
L
SY
N
L
QQ
S
YT-------DNNV
GY
D
ASMNGR
Y
RSSV
---
G
EFGL
G
Y
S
YD
--
-KNSR
Q
WNYSA
Q
G
A
VV
A
HAH
G
VT
L
G
-
QSVQ--D
S
FA
I
V
H
INE
GA
N
VK
V
Q
-
NAQGVY
TD
YW
G
NA
I
VPNM
T
N
Y
RH
N
AIT
V
N
T
Q
G
H
D-
-
S
L
DISD
A
T
QDVIPSK
GA
V
VGVD
F
DARS
G
IRALLTLVH-N
K
ERVP
FGA
L
L
T
---
--
-
-LGNST
A
IV
G
E
D
G
EV
Y
------
I
T
GV
QE
--
SMTFT
V
Q
WG
KEINQQ
-----
C
TGVVTV
P
E
fig|656417.3.peg.249
Escherichia coli M605 (5-811/825)
L
HR
TFVLTGITFALSAVYSLSY
A
RDE--
---
FN
LR
I
LELD
--
SPLENT
QV
LED
F
VN
--
NNNLT
PG
V
Y
L
T
S
V
M
WG
QEYL
-
DKR-N
I
TFILSSDKK----
-
-RLIP
RF
T
--
KAD
L
REF
GL
K
VDDI
------
PALQVMD
--------
DDTE
F
GDIAQIIDGARYDF
Q
L
--
DSQT
L
CL
R
I
PQ
I
Y
Q
NARAAG
S
I
S
P
KY
W
S
D
G
E
S
A
VWLS
Y
YASGSRQNSD-----
-
GDN
L
NSNWLNLNS
G
I
N
L
G
V
WRLR
N
N
TVYSDS---
-
---
-
-
---------
-----S
W
ESI
--------
STS
L
Q
R
DIKA
L
R
-
SQ
M
E
V
G
Q
T
F
TN
G
---
D
L
FDS
VQ
M
T
G
IK
L
ET
D
TS
MLP
DSEQ
GFAP
V
V
R
GIA
NS
D
A
Q
V
V
I
K
Q
N
G
YV
IY
QTW
V
SA
GPF
E
I
K
DL
SQ
V
TAG
A
DL
E
V
T
I
K
E
T
N
G
QEHSFI
QA
S
S
T
V
P
I
L
Q
R
E
G
AL
K
Y
SL
A
T
G
K
YRDN-
--
DNHAETPV
F
GVATAI
Y
G
LPYGI
T
I
YGG
I
-
LGA
S
I
Y
H
S
GVT
G
I
G
A
D
LGRL
G
S
V
S
V
D
I
T
A
A
ETKFD
-----
DG
R
D
---
DAT
G
L
S
W
R
AQ
Y
A
K
DFPDTD
T
TVTLA
S
YRYS
TSQ
F
Y
T
F
Q
E
ALDQ
--
R-----------DTPDDK
G
IYS--YRQ
-
TNNR
-
R
NRLQI
N
LS
Q
NIG-RWG
S
VYL
N
GYQQD
YW
GMHGAER-SIGM
G
YSTTWSN-INWS
VN
Y
T
LTKTPG
M
A
-----------
--G
E
QQFSLTL
N
IP
LSRWLPD
--
-----------S
W
AMYN
V
NRSDKSNT
S
HQL
G
IGG
T
A
LQ
D
NN
-
L
SY
N
L
QQ
S
YT-------DNNV
GY
G
ASINGR
Y
RSSV
---
G
EFGL
G
Y
S
YD
--
-KNSR
Q
WNYSA
Q
G
A
VV
A
HAH
G
VT
L
G
-
QSVQ--D
S
FA
I
V
H
INE
GA
N
VK
V
Q
-
NAQGVY
TD
YW
G
NA
I
VPNM
T
N
Y
RH
N
AIT
V
N
T
Q
G
H
D-
-
S
L
DISD
A
T
QDVIPSK
GA
V
VGVD
F
DARS
G
IRALLTLVH-N
K
ERVP
FGA
L
L
T
---
--
-
-LGNST
A
IV
G
E
D
G
EV
Y
------
I
T
GV
QE
--
SMTFT
V
Q
WG
KEINQQ
-----
C
TGVVTV
P
E
fig|409438.11.peg.4936
Escherichia coli SE11 (20-851/870)
LSSAAY
A
EDY--
---
F
D
PD
L
LSLG
--
NRDMSL
TD
LSA
F
SE
--
QGYSA
PG
V
Y
I
V
D
I
Y
V
N
GNYL
-
KTD-S
I
RFEHDKTN-----
-
-TLKP
LF
S
--
LND
L
NEI
G
V
N
LHSL
------
KGTEHLP
--------
HDRA
S
IDNLSLIPFSSFVF
D
N
--
SKQR
L
NI
N
V
A
Q
V
H
M
QKETDN
R
L
AR
KF
WD
Q
GI
P
A
LFVN
Y
SYSGSQGQTRGNKN-
-
KSS
T
TSDFLSLNA
G
A
N
L
G
A
WRLR
S
N
MNWTQSGFE
-
SEQ
Y
D
TFEDAYRKQ
KSSQSK
W
DTG
--------
DTY
L
Q
R
DVQF
L
N
-
SE
L
T
I
GD
Y
R
TT
S
ITEQ
L
I
D
G
FQ
F
R
G
VS
L
SS
S
EY
M
I
P
AALR
GFAP
V
I
T
G
Y
A
RT
NA
E
V
I
V
T
Q
N
G
YS
IY
QTH
V
A
PGPF
R
I
D
DL
PG
G
SSA
GD
I
Y
V
S
V
K
E
S
DG
TVHGFR
QA
Y
S
T
L
P
E
M
Q
R
Q
G
DF
K
F
EY
S
V
G
R
YKQSG
Y
-
STYENTPL
F
SNTSFL
Y
G
LPHNV
T
A
M
G
N
L
-
LYS
G
D
Y
Q
S
VSL
G
A
A
F
S
LGML
G
T
L
S
TS
V
T
S
S
VTEGR
-----
DN
-
D
---
KLR
G
Y
S
V
N
AR
YSK
SLTETG
T
LFQLA
S
YRYS
TPD
F
R
T
F
S
E
ANVE
EY
RGS---------------
-
SYINYML-
-
SGRR
-
K
DTWSL
I
LN
Q
SIT-SGL
S
VGV
S
GRRDN
YW
DRHSTTS--LSA
G
LNGTFRQ-TSWS
L
N
Y
N
IDRVRG
N
G
----------
S
WPE
N
RELSLSV
S
V
P
FSAFMSS
--
---GSMSSANFN
Y
RTAH
-
--NNQGRT
T
NMV
S
LNG
S
A
LE
E
NR
-
L
S
W
N
I
SE
N
WSNSSRNY-QRDE
NFS
AGVS--
Y
DSQY
---
A
RLYG
G
Y
G
RT
--
-SQSN
T
YNYGA
SG
S
LL
A
HPG
G
VI
V
S
R
QNIG--N
A
AA
LV
H
V
P
D
VP
G
AR
V
M
-
NGRDIH
TD
NK
G
FA
L
VPYV
A
I
Y
EK
N
NIT
I
D
P
V
S
L
SD
-
G
I
ELSE
TS
KATYPTK
GAI
VSVE
Y
KVHS
G
QQALINLTH-D
G
KPVP
L
GA
F
V
T
---
--
-
-IGDQV
F
IV
G
H
S
G
QV
Y
------
V
SGV
PE
--
SGRLK
V
K
WG
DKESY-
-----
-
VANYKL
N
AKSP
fig|749527.3.peg.4078
Escherichia coli MS 21-1 (16-845/860)
LALAVMVACVMFRAESGIA
R
TYS--
---
F
D
AA
M
LK--
--
GGGKGV
D
-
LTL
F
EE
--
GGQL-
PG
I
Y
P
V
D
I
I
LN
GSRV
-
DSQ-E
M
AFHAERDAEGRP-
-
-YLKT
CLT
--
REM
L
ARY
G
V
R
IEEY
------
PALFRAS
GEGRGASV
AEEA
C
ADLTA-IPQATESY
Q
F
--
AAQQ
L
VL
G
I
PQ
V
A
L
RPQLRG
I
A
P
E
AL
WD
D
GI
P
A
FLLN
W
QADAGRSEYRGYG--
-
KRV
T
DSYWVSLQP
G
I
N
I
G
P
WR
V
R
N
L
TTWNRSSGQ
-
SG-
-
-
---------
-----K
W
ESS
--------
YIR
A
E
R
GLNG
I
K
-
SR
L
T
LG
E
D
Y
TP
S
---
DIFDS
VP
F
R
G
AM
M
SS
D
ES
M
V
P
YNLR
E
FAP
V
V
R
GIA
RT
Q
A
R
I
E
V
R
Q
N
G
YL
I
Q
SQT
V
A
PG
A
F
A
L
T
DL
PV
T
GSG
S
DL
Q
V
T
V
L
E
S
DG
TAQVFT
VP
F
TT
P
A
I
A
L
R
E
G
YL
K
Y
NV
T
A
G
Q
YRSS-
--
DDAVEHTS
L
GQVTAM
Y
G
LPWGL
T
V
YGG
L
-
QGA
E
H
Y
Q
S
AAL
G
L
G
W
S
LGRL
GA
V
S
L
D
T
T
H
S
RGQQK
-----
GH
-
D
---
YET
G
D
T
W
R
IR
Y
N
K
SFELTG
T
SFTAA
S
Y
Q
YS
SDG
Y
H
T
L
P
D
VLDT
W
-
RDD--RY-----------
-
AYRH----
-
TENR
-
S
RRTTL
S
LS
Q
SLG-QWG
Y
VGL
N
GSRDE
Y
-
RDRPHRD-YFGA
S
YSTSWNN-ISLS
VN
W
S
RNRN--
-
S
---
GGYYGGWS
RTE
D
SV-SMWM
S
V
P
LGRWFGG
--
----TDNDISAT
A
QMQR
-
--STGQDT
R
YEA
G
LNG
R
A
F-
D
RR
-
L
Y
W
D
V
RE
Q
MVPGSESH-ADT-
---
SRLNLT
W
YGTY
---
G
ELTG
M
Y
S
YS
--
-STMR
Q
LNAGM
SG
S
MV
A
HSE
G
VT
F
G
-
QRTG--D
T
VA
L
I
A
APG
VS
G
AS
V
G
-
GWPGVR
TD
FR
G
YT
L
AGYA
S
P
Y
QE
N
VLT
LD
P
T
T
F
PE
-
D
A
EVPQ
TD
SRVVPTK
GA
V
VRAG
F
RTRV
G
GRALVSLARQD
G
TPLP
FGA
V
V
T
VEG
EA
G
QAAGSA
G
V
V
G
D
R
G
EV
Y
------
L
SG
L
KE
--
SGKLK
A
Q
WG
ENSL--
-----
C
HADYRL
P
E
fig|362663.8.peg.3815
Escherichia coli 536 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINAI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSASINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
SQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYSSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
I
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GESVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
L
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFGA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AA
I
N
-
NASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|362663.9.peg.3829
Escherichia coli 536 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINAI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSASINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
SQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYSSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
I
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GESVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
L
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFGA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AA
I
N
-
NASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|525281.3.peg.1568
Escherichia coli 83972 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINVI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSASINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYNSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
I
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFSA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|655817.3.peg.5064
Escherichia coli ABU 83972 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINVI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSASINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYNSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
I
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFSA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|749546.3.peg.3032
Escherichia coli MS 185-1 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINVI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSASINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYNSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
I
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFSA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|749528.3.peg.1477
Escherichia coli MS 45-1 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINVI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSASINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYNSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
I
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFSA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
I
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|340197.3.peg.849
Escherichia coli F11 (37-838/863)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINAI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSARINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYSSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
L
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFGA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|340197.5.peg.891
Escherichia coli F11 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINAI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSARINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYSSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
L
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFGA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|749550.3.peg.1517
Escherichia coli MS 200-1 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINAI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSARINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYSSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
L
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFGA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|869729.3.peg.4667
Escherichia coli UM146 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINAI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSARINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYSSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
L
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFGA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|364106.7.peg.4774
Escherichia coli UTI89 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINAI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSARINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYSSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
L
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFGA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|364106.8.peg.4773
Escherichia coli UTI89 (18-819/844)
LFAALGLTVTN---HSFA
A
EEAE-
---
F
D
SE
F
LHLD
--
KGINAI
D
-
IRR
F
SH
--
GNPVP
E
G
R
Y
Y
S
D
I
Y
V
N
NVWK
-
GKA-D
L
QYLRTANTG----
-
-APTL
CLT
--
PEL
L
SLI
D
L
V
KDTM
------
SGNTS--
--------
----
C
FPASTGLSSARINF
D
L
--
STLR
L
NI
E
I
PQA
L
L
NTRPRG
Y
I
S
P
AQ
W
Q
S
G
V
P
A
AFIN
Y
DANYY--QYSSS---
-
GTS
N
EQTYLGLKA
G
F
N
L
W
G
W
A
LR
H
R
GSESWNNSY
-
PA-
-
-
---------
-----G
Y
QNI
--------
ETS
I
M
H
DLAP
L
R
-
AQ
F
T
LGD
F
Y
TN
G
---
EL
M
DS
LS
L
R
G
VR
L
AS
D
ER
MLP
GSLR
G
Y
AP
A
V
R
GIA
NS
NA
K
VTI
Y
Q
N
A
HI
L
Y
ETT
VP
A
GPF
V
I
N
DL
YP
S
GYA
GDL
L
V
K
I
T
E
S
N
G
QTRMFT
VP
F
AA
V
A
Q
L
I
R
P
G
FS
R
W
QM
S
V
G
K
YRY--
--
ANKTYNDL
I
AQGTYQ
Y
G
LTNDI
T
L
NS
G
L
-
TTA
S
G
Y
T
A
GLA
G
L
A
F
N
TP-L
GA
I
A
S
D
I
T
L
S
RTAFR
-----
YS
G
V
---
TRK
G
Y
S
L
H
SS
YS
I
NIPASN
T
NITLA
A
YRYS
SKD
F
Y
H
L
K
D
ALSA
--
NH--------NAFIDDVS
V
KSTAF---
-
-YRP
-
R
NQFQI
S
IN
Q
ELGEKWG
G
MYL
T
GTTYN
YW
GHKGSRN-EYQM
G
YSNFWKQ-LGYQ
I
G
L
S
Q--SRD
N
E
----------
Q
QRR
D
DRFYINF
TL
P
L------
--
-GGSVQSP----
V
FSTV
L
NYSKEEKN
S
IQT
S
ISG
T
G
GE
D
NQ
-
F
SY
G
I
SG
N
SQENGPSGY----
---
A-MNGG
Y
RSPY
VNIT
TTVG
H
D
T
QN
--
-N--N
Q
RSFGA
SG
A
VV
A
HPY
G
VT
L
S
-
NDLS--D
T
FA
II
H
A
E
G
AQ
G
AV
I
N
-
NASGSR
L
D
FW
G
NG
V
VPYV
T
P
Y
EK
N
QIS
I
D
P
S
N
L
DL
-
N
V
ELSA
TE
QEIIPRA
N
S
A
TLVK
F
DTKT
G
RSLLFDIRMST
G
NPPP
MA
S
E
V
L
---
--
D
EHGQLA
G
Y
V
A
Q
A
G
KV
F
------
T
R
G
L
PE
--
KGHLS
V
V
WG
PDNKDR
-----
C
SFVYHV
fig|362663.8.peg.3840
Escherichia coli 536 (15-834/835)
STKVF
A
EDY--
---
F
D
PS
L
LATD
I
-
IGEGNI
D
-
LSA
F
SR
--
PGGGM
E
G
E
Q
E
V
A
I
Y
V
N
DEFY
-
SRN-T
L
FFKNTLDKGLLP-
-
-EFTP
GFF
--
DEL
L
SGD
F
L
V
SEED
------
-------
--------
KTIS
S
SDFLKKVPYSDINF
N
Q
--
GMSR
V
NV
S
I
PQA
Y
L
GDGAKL
I
S
S
P
DT
W
E
Y
G
GP
A
FLLD
Y
NISGNRNDSG-----
-
NYD
S
RSLYISSQM
G
V
N
L
M
K
WRLR
T
S
SSYSNYKTN
-
---
-
-
---------
----SV
W
GGA
RSEQNSFY
NTY
A
E
R
DISS
L
R
-
AI
L
R
LG
E
V
S
TA
G
---
L
I
L
DS
VP
F
R
G
MK
L
SS
S
DD
ML
G
MRLR
NYT
P
T
V
R
G
M
A
SS
Q
A
V
VTI
T
Q
N
G
RQ
V
Y
QTN
VP
A
GPF
E
L
N
D
F
YL
S
GYS
GD
M
L
V
T
V
R
E
A
DG
SEHSFL
Q
P
Y
S
T
L
P
E
M
K
R
E
G
VS
G
F
EV
S
V
G
R
YDNNG
A
-
EHYYDAES
F
VYGNWS
R
G
FARGV
T
F
F
AE
T
-
LQA
E
K
Y
Q
S
LGG
G
S
T
L
S
LGRL
GA
A
S
A
D
I
S
L
S
RADKY
-----
GD
-
-
---
IRI
G
Q
S
Y
G
FK
YSK
SQIETG
T
TVTLA
T
YRYS
TEN
F
Y
T
F
R
D
FVSK
--
------------------
-
--TDTARY
I
WENK
L
K
SRMTF
S
LS
Q
SLG-EYG
Y
LSA
N
ASQQD
YW
NSREVSR-NYSL
T
HSFSWND-IYFS
T
T
L
S
MDDQRG
R
E
---------
TG
HLS
N
KQAGIYA
S
V
P
LSKLLPR
--
---TDPTSSSLT
W
STSH
-
---ADHKV
R
NSV
T
LDG
K
V
-P
E
SD
-
V
R
Y
R
V
GG
S
WGNGTTEG-SRM-
---
ASVS--
W
TGDH
---
A
STSL
G
Y
T
RV
--
-GKYR
T
LDYSM
SG
A
AV
M
YPW
G
IA
V
G
-
NSSVTGD
G
AI
V
V
E
T
PG
AK
G
--
V
R
-
TSTGYK
T
S
WL
G
TA
L
ISSP
Q
K
Y
TE
N
RIN
L
Y
P
D
G
L
PS
-
D
T
VLGE
TS
KTAVPAK
GA
V
VVLD
Y
TVFR
G
SQVVFTLRQTD
G
NPLP
FG
T
V
I
T
-
LD
GV
S
RGKENS
GIV
G
E
E
G
RV
Y
------
M
A
G
I
PE
--
KGTLT
A
S
WG
LNKT--
-----
C
SIPFRI
N
QHKAEAVIREVQ
G
V
CR
fig|362663.9.peg.3854
Escherichia coli 536 (15-834/835)
STKVF
A
EDY--
---
F
D
PS
L
LATD
I
-
IGEGNI
D
-
LSA
F
SR
--
PGGGM
E
G
E
Q
E
V
A
I
Y
V
N
DEFY
-
SRN-T
L
FFKNTLDKGLLP-
-
-EFTP
GFF
--
DEL
L
SGD
F
L
V
SEED
------
-------
--------
KTIS
S
SDFLKKVPYSDINF
N
Q
--
GMSR
V
NV
S
I
PQA
Y
L
GDGAKL
I
S
S
P
DT
W
E
Y
G
GP
A
FLLD
Y
NISGNRNDSG-----
-
NYD
S
RSLYISSQM
G
V
N
L
M
K
WRLR
T
S
SSYSNYKTN
-
---
-
-
---------
----SV
W
GGA
RSEQNSFY
NTY
A
E
R
DISS
L
R
-
AI
L
R
LG
E
V
S
TA
G
---
L
I
L
DS
VP
F
R
G
MK
L
SS
S
DD
ML
G
MRLR
NYT
P
T
V
R
G
M
A
SS
Q
A
V
VTI
T
Q
N
G
RQ
V
Y
QTN
VP
A
GPF
E
L
N
D
F
YL
S
GYS
GD
M
L
V
T
V
R
E
A
DG
SEHSFL
Q
P
Y
S
T
L
P
E
M
K
R
E
G
VS
G
F
EV
S
V
G
R
YDNNG
A
-
EHYYDAES
F
VYGNWS
R
G
FARGV
T
F
F
AE
T
-
LQA
E
K
Y
Q
S
LGG
G
S
T
L
S
LGRL
GA
A
S
A
D
I
S
L
S
RADKY
-----
GD
-
-
---
IRI
G
Q
S
Y
G
FK
YSK
SQIETG
T
TVTLA
T
YRYS
TEN
F
Y
T
F
R
D
FVSK
--
------------------
-
--TDTARY
I
WENK
L
K
SRMTF
S
LS
Q
SLG-EYG
Y
LSA
N
ASQQD
YW
NSREVSR-NYSL
T
HSFSWND-IYFS
T
T
L
S
MDDQRG
R
E
---------
TG
HLS
N
KQAGIYA
S
V
P
LSKLLPR
--
---TDPTSSSLT
W
STSH
-
---ADHKV
R
NSV
T
LDG
K
V
-P
E
SD
-
V
R
Y
R
V
GG
S
WGNGTTEG-SRM-
---
ASVS--
W
TGDH
---
A
STSL
G
Y
T
RV
--
-GKYR
T
LDYSM
SG
A
AV
M
YPW
G
IA
V
G
-
NSSVTGD
G
AI
V
V
E
T
PG
AK
G
--
V
R
-
TSTGYK
T
S
WL
G
TA
L
ISSP
Q
K
Y
TE
N
RIN
L
Y
P
D
G
L
PS
-
D
T
VLGE
TS
KTAVPAK
GA
V
VVLD
Y
TVFR
G
SQVVFTLRQTD
G
NPLP
FG
T
V
I
T
-
LD
GV
S
RGKENS
GIV
G
E
E
G
RV
Y
------
M
A
G
I
PE
--
KGTLT
A
S
WG
LNKT--
-----
C
SIPFRI
N
QHKAEAVIREVQ
G
V
CR
fig|331111.12.peg.72
Escherichia coli E24377A (22-797/817)
AKLFVLLFLCDSVNA
E
KYI--
---
F
E
RD
F
LA-D
--
--SEKI
D
-
LTL
L
E-
--
SSAYP
S
G
R
Y
Y
V
S
L
Y
LN
GEYI
-
TKE-Y
M
YFDAGESED----
-
----F
C
I
Q
--
YSV
L
QDI
G
V
T
VSG-
------
-------
--------
NQDE
C
ANLDDEL-NLRTRF
D
F
--
YSKR
M
DI
F
V
SPK
F
V
PRKKNG
L
A
P
I
KL
WD
E
G
E
NA
LFTS
Y
NFSEDYYHFKGD---
-
ARD
S
YSQYANIQP
R
LN
I
G
P
WR
I
R
T
Q
AIWNKNNNT
-
KG-
-
-
---------
-----E
W
SNN
--------
YLY
A
E
R
GLGN
I
K
-
SR
L
Y
I
GD
G
Y
FP
L
---
KN
F
N
S
FK
F
K
G
GV
L
KT
D
EN
M
Y
P
YSEK
T
Y
S
P
I
V
K
G
S
A
KT
Q
A
K
V
EF
F
Q
D
G
VK
IY
SSI
VPPG
D
F
S
I
S
D
Y
IL
S
GSN
S
DL
Y
V
K
V
I
E
E
N
G
SIQEFI
VP
F
T
Y
P
A
V
A
V
R
E
G
FT
Y
Y
EI
A
M
GE
TQQS-
--
-----NDY
F
TQLSFT
R
G
LPYDF
T
V
LTS
L
-
EYS
G
F
Y
R
S
LEI
G
L
G
K
M
LGNL
GA
L
S
LI
Y
G
Q
S
NFSKS
-----
DN
-
S
---
--K
N
K
K
W
D
IR
Y
N
K
NIPDLN
T
YLSFS
A
VSQ
T
RGG
Y
S
S
L
R
D
ALDY
--
------------------
-
EIGEY---
-
TFNS
-
K
NSYTA
S
IN
H
SLG-ELG
S
LNF
S
GTWRN
YW
ENKNQTR-SYNL
S
YSTQIFN-GKAY
LS
G
S
LIRSEL
M
N
--------
FN
N
KIS
D
TILNIGV
N
IP
FGLSRGI
--
--------QSVS
Y
NTSS
-
--VKGGRS
T
HQL
G
ISG
S
E
F-
D
NK
-
L
Y
W
H
V
NQ
G
YSDNYSN------
---
TSMYGY
Y
KAKY
---
A
QVNA
G
Y
S
VS
--
-ERYN
H
AYGGI
E
GG
IL
V
YDG
G
II
L
G
-
RNLG--D
T
MS
II
E
APG
AE
N
TK
I
R
-
GWGSIE
TD
WR
G
RA
F
IGYL
S
P
Y
QN
N
DIS
LD
P
S
S
L
PL
-
D
S
SLDI
TT
NSVIPTT
GAI
VKTT
Y
NVKK
G
KKVMLTLKKSN
G
DAVP
FGA
I
V
T
VMD
G-
-
--DQNT
S
IV
G
D
N
G
QL
Y
------
L
GSS
MD
--
TGRLK
V
I
WG
NGEDKK
-----
C
VVDY
fig|331112.3.peg.2102
Escherichia coli HS (1-813/826)
M
L
R
M
T
PLASVIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQK
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|331112.6.peg.2197
Escherichia coli HS (1-813/826)
M
L
R
M
T
PLASVIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQK
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656440.3.peg.3147
Escherichia coli TA206 (9-830/830)
G
KVSRALAIVLCAFPPFSGG
A
ENTET
VYQ
FN
DG
F
IVGS
--
R--ERV
D
-
LSR
F
S-
--
TSAID
E
G
V
Y
S
L
D
V
Y
T
N
GEWK
-
GRY-D
L
N-VRRRQDG----
-
-QSGV
C
Y
T
--
REM
L
VQY
G
I
A
AEKL
------
NRRLSG-
--------
QAGY
C
GPLKAWRSEDNVHD
T
L
IP
SSLR
L
EI
SVPQ
I
Y
E
DQRLKD
Y
V
S
P
AF
WD
K
GI
T
A
LSLG
W
TANAWNNHESGQ---
-
GED
N
NSVYLGLNG
GL
S
W
N
G
W
L
L
K
H
I
GNLNWQEQQ
G
GA-
-
-
---------
-----H
W
SSN
--------
QTY
L
Q
R
PLPA
M
N
-
AI
L
S
G
G
Q
F
F
TS
G
---
E
F
FD
T
TG
M
R
G
VN
L
AT
D
DN
M
F
P
DGMR
S
Y
AP
E
I
R
G
V
A
RS
NA
L
VT
V
R
Q
G
N
NI
IY
QTT
VPPGPF
I
L
Q
D
V
YP
S
GYG
S
DL
D
V
S
V
K
E
A
DG
TVNVFS
VP
Y
A
S
V
T
Q
L
L
R
P
G
MT
RY
AF
S
A
G
K
ADD--
--
DFLRHKPV
L
WQATWQ
HG
LSNMF
T
G
Y
T
G
V
-
TGF
N
D
Y
Q
A
FLL
G
T
G
M
N
TG-I
GA
L
SFD
V
T
H
S
R--LK
-----
SD
M
L
---
DES
G
Q
S
W
R
AT
FNR
MFTETQ
T
SIVLA
A
YRYS
TRG
Y
Y
N
L
N
D
ALYA
--
-------------VDQTR
N
KRNNYV--
-
LWRE
-
K
NGMTF
T
VN
Q
NLPEGWG
G
FWL
S
GRVSS
YW
DRSGTEK-QYQM
S
YNNSAGR-LSWS
V
S
A
Q
RVYTHD
S
S
----------
G
HRR
D
DRVSLNF
S
L
P
LW-----
--
FGENRTAN----
L
TSNT
V
-FSNSHFS
S
SQT
G
ING
S
L
DS
E
NN
-
L
SY
G
I
ST
T
TTTGGRHDV----
---
A-LNGS
W
RMPW
---
T
TLNG
S
Y
S
QG
--
-SGWR
Q
SGIGG
SG
T
LI
V
HSG
G
VT
L
S
-
PETG--S
T
MA
L
I
E
A
RD
AK
G
AM
L
P
-
GSPGTR
V
D
GN
G
YA
V
LPYL
R
P
Y
RI
N
SVE
I
D
P
K
G
S
ED
-
D
V
KFDR
T
V
ARVVPWE
D
SV
VKIT
F
ATEV
Q
NTLTLPVHRAD
G
RPLP
F
A
A
T
I
Y
---
--
D
PAGREI
G
V
V
G
Q
G
S
MM
F
I
NRAGA
T
RA
V
VR
WA
GGQCS
T
A
L
D
PVSALK
TKELV
C
R
fig|656419.3.peg.2863
Escherichia coli M718 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PS
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|481805.3.peg.1648
Escherichia coli ATCC 8739 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|481805.6.peg.1643
Escherichia coli ATCC 8739 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|358709.5.peg.552
Escherichia coli 101-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|595496.3.peg.2077
Escherichia coli BW2952 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|536056.3.peg.1634
Escherichia coli DH1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|83333.1.peg.2084
Escherichia coli K12 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|316407.3.peg.2043
Escherichia coli W3110 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|316385.5.peg.2225
Escherichia coli str. K-12 substr. DH10B (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|316385.7.peg.2275
Escherichia coli str. K-12 substr. DH10B (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|511145.12.peg.2186
Escherichia coli str. K-12 substr. MG1655 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|511145.6.peg.2171
Escherichia coli str. K-12 substr. MG1655 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|340186.3.peg.2488
Escherichia coli E110019 (1-813/826)
M
L
R
MP
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
I
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|340186.5.peg.2568
Escherichia coli E110019 (1-813/826)
M
L
R
MP
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
I
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749547.3.peg.3088
Escherichia coli MS 187-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDT----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|316401.4.peg.2573
Escherichia coli ETEC H10407 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
V
S
PGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749531.3.peg.2040
Escherichia coli MS 69-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
C
I
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNSN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSE
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRTD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
I
N
KQQGLS
-----
C
TITF
fig|749527.3.peg.1477
Escherichia coli MS 21-1 (1-813/826)
M
L
R
MP
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNSN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|670888.3.peg.4182
Escherichia coli 1827-70 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DL
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
A
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
I
PP
--
--SVN
V
A
I
N
KQHGLS
-----
C
TITF
fig|749538.3.peg.1651
Escherichia coli MS 116-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YLSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSE
F
VQVGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QSLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|562.375.peg.2368
Escherichia coli EC4100B (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
I
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656393.3.peg.3134
Escherichia coli H299 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
I
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|550676.3.peg.2403
Escherichia coli B185 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----A
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FLSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAASYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749537.3.peg.860
Escherichia coli MS 115-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
S
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
SVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|550677.3.peg.1652
Escherichia coli B354 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----T
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
AFDDQGFA
S
NNT
G
LSG
T
V
GN
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTEQ
R
KPWFIKALRTD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656379.3.peg.2639
Escherichia coli FVEC1302 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNSN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
S
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GN
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRTD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656380.3.peg.2196
Escherichia coli FVEC1412 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNSN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
S
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GN
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRTD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749549.3.peg.4988
Escherichia coli MS 198-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNSN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
S
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GN
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRTD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585056.7.peg.2622
Escherichia coli UMN026 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNSN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
S
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GN
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRTD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656417.3.peg.2772
Escherichia coli M605 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|439855.10.peg.1092
Escherichia coli SMS-3-5 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNSN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|405955.13.peg.2323
Escherichia coli APEC O1 (1-813/826)
M
L
R
MP
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQVGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|405955.9.peg.1894
Escherichia coli APEC O1 (1-813/826)
M
L
R
MP
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQVGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|714962.3.peg.2382
Escherichia coli IHE3034 (1-813/826)
M
L
R
MP
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQVGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585035.6.peg.2231
Escherichia coli S88 (1-813/826)
M
L
R
MP
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQVGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|869729.3.peg.1329
Escherichia coli UM146 (1-813/826)
M
L
R
MP
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQVGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|364106.7.peg.2409
Escherichia coli UTI89 (1-813/826)
M
L
R
MP
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQVGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|364106.8.peg.2412
Escherichia coli UTI89 (1-813/826)
M
L
R
MP
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQVGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|679205.4.peg.3623
Escherichia coli MS 124-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
G
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
I
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749533.3.peg.404
Escherichia coli MS 84-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
G
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVSTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
I
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|431946.3.peg.2077
Escherichia coli SE15 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELEIG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749546.3.peg.2884
Escherichia coli MS 185-1 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
P
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GAFR
L
DF
SVPQA
W
V
EDLESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
Q
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SIQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585397.7.peg.2478
Escherichia coli ED1a (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
V
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
Q
E
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRHDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585397.9.peg.2476
Escherichia coli ED1a (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
V
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
Q
E
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRHDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585057.4.peg.961
Escherichia coli IAI39 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNSN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IV
Y
N
K
FVSKTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
M
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585057.6.peg.958
Escherichia coli IAI39 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNSN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IV
Y
N
K
FVSKTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
M
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|525281.3.peg.3908
Escherichia coli 83972 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
P
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EDLESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
Q
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SIQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSNK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|655817.3.peg.2549
Escherichia coli ABU 83972 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
P
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EDLESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
Q
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SIQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSNK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749528.3.peg.1573
Escherichia coli MS 45-1 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
P
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EDLESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
Q
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SIQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|216592.1.peg.2897
Escherichia coli 042 (6-818/831)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYSW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
S
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GN
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRTD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|216592.3.peg.2443
Escherichia coli 042 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYSW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
S
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GN
R
DQ
-
F
N
Y
G
V
NL
S
HQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRTD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|362663.8.peg.2160
Escherichia coli 536 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSKS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|362663.9.peg.2165
Escherichia coli 536 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSKS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|340197.3.peg.167
Escherichia coli F11 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|340197.5.peg.172
Escherichia coli F11 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749550.3.peg.1555
Escherichia coli MS 200-1 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|199310.1.peg.2567
Escherichia coli CFT073 (2-818/831)
Q
ELP
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
P
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EDLESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
Q
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SIQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRXGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|562.376.peg.4023
Escherichia coli WV_060327 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
ED
--
NQPLP
-
G
P
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EDLESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
Q
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SIQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGVNLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|199310.4.peg.2479
Escherichia coli CFT073 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
P
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EDLESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
Q
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SIQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRXGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|753642.3.peg.2987
Escherichia coli NC101 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQVGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|216593.1.peg.2647
Escherichia coli E2348/69 (2-818/831)
Q
ELP
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GM
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
C
YS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-GYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
LLVN
F
DTDQ
R
KPCFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
LHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656440.3.peg.2089
Escherichia coli TA206 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
N
V
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|670897.3.peg.4574
Escherichia coli 2362-75 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GM
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
C
YS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-GYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
LLVN
F
DTDQ
R
KPCFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
LHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|574521.7.peg.2303
Escherichia coli O127:H6 str. E2348/69 (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YMSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GM
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
C
YS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-GYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
LLVN
F
DTDQ
R
KPCFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
LHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656408.3.peg.2351
Escherichia coli H591 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|679207.4.peg.3837
Escherichia coli MS 107-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|679206.4.peg.1562
Escherichia coli MS 119-7 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|679204.3.peg.137
Escherichia coli MS 145-7 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656443.3.peg.2807
Escherichia coli TA271 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|344601.3.peg.3208
Escherichia coli B171 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGRSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|344601.5.peg.3353
Escherichia coli B171 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGRSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|340185.3.peg.1264
Escherichia coli E22 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGRSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|340185.4.peg.1333
Escherichia coli E22 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGRSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585395.4.peg.2704
Escherichia coli O103:H2 str. 12009 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGRSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|595495.4.peg.334
Escherichia coli KO11 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|566546.3.peg.4642
Escherichia coli W (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|566546.4.peg.2290
Escherichia coli W (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
R
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKHSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749545.3.peg.2549
Escherichia coli MS 182-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTR
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749532.3.peg.3754
Escherichia coli MS 78-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTR
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-F
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585396.4.peg.2919
Escherichia coli O111:H- str. 11128 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----G
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|550672.3.peg.1825
Escherichia coli B088 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585034.4.peg.2157
Escherichia coli IAI1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585034.5.peg.2152
Escherichia coli IAI1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|409438.11.peg.2530
Escherichia coli SE11 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|685038.3.peg.2163
Escherichia coli O83:H1 str. NRG 857C (1-813/826)
M
L
R
M
T
PLASAIVALLIGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
S
-
NIR
L
ED
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REM
I
KRL
G
I
N
TDSF
------
ASGKQ--
--------
----
C
LTFKQLIQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
E
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGHQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRHDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
HAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQYQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-SAYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
YLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|331111.12.peg.2679
Escherichia coli E24377A (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
M
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|331111.3.peg.141
Escherichia coli E24377A (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
M
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|1040638.4.peg.3190
Escherichia coli O104:H4 str. LB226692 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKELRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585055.6.peg.2398
Escherichia coli 55989 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKELRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|585055.8.peg.2405
Escherichia coli 55989 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKELRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|6666666.5357.peg.1516
Escherichia coli TY-2482 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
N
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKELRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|749540.3.peg.3
Escherichia coli MS 146-1 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYSW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
E
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
N
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQHQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
S
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
QE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656414.3.peg.2466
Escherichia coli H736 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYSW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
N
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RY
L
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDKNDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGVA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQHQENETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
I
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|656437.3.peg.2367
Escherichia coli TA143 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
VKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
REI
I
KRL
G
I
N
TDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----T
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
V
S
PGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSE
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDIYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNLRR-ISYT
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTTRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
HQYQGNETT----
---
VGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
A
S
G
IK
D
AF
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAAPYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLT
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGFS
-----
C
TITF
fig|573235.3.peg.3083
Escherichia coli O26:H11 str. 11368 (1-813/826)
M
L
R
M
T
PLASAIVALLLGI--EAYA
A
EET--
---
F
D
TH
F
MIGG
--
MKDQQV
A
-
NIR
L
DD
--
NQPLP
-
G
Q
Y
D
I
D
I
Y
V
N
KQWR
-
GKY-E
I
I----VKDN----
-
-PQET
CL
S
--
IEV
I
KRL
G
I
N
SDNF
------
ASGKQ--
--------
----
C
LTFEQLVQGGSYTW
D
I
--
GVFR
L
DF
SVPQA
W
V
EELESG
Y
V
PP
EN
W
E
R
GINA
FYTS
Y
YVSQYYSDYKAS---
-
G-N
S
KSTYVRFNS
GLN
L
L
G
W
Q
L
H
S
D
ASFSKTNNN
-
PG-
-
-
---------
-----V
W
KSN
--------
TLY
L
E
R
GFAQ
L
L
-
GT
L
R
V
GD
M
Y
TS
S
---
DIFDS
VR
F
S
G
VR
L
FR
D
MQ
MLP
NSKQ
N
F
T
P
R
V
Q
GIA
QS
NA
L
VTI
E
Q
N
G
FV
V
Y
QKE
VPPGPF
A
I
T
DL
QL
A
GGG
A
DL
D
V
S
V
K
E
A
DG
SVTTYL
VP
Y
AA
V
P
N
M
L
Q
P
G
VS
K
Y
DF
A
A
G
R
SHI--
--
EGASKQSD
F
VQAGYQ
Y
G
FNNLL
T
L
YGG
S
-
MVA
N
N
Y
Y
A
FTL
G
T
G
W
N
TR-I
GA
I
S
V
D
A
T
K
S
H-SKQ
-----
DN
G
D
---
VFD
G
Q
S
Y
Q
IA
Y
N
K
FVSQTS
T
RFGLA
AW
RYS
SRD
Y
R
T
F
N
D
HVWA
--
NNKDNYRRDENDVYDIAD
Y
YQNDF---
-
--GR
-
K
NSFSA
N
MS
Q
SLPEGWG
S
VSL
S
TLWRD
YW
GRSGSSK-DYQL
S
YSNNWRR-ISYI
L
A
A
S
QAY---
D
E
----------
N
HHE
E
KRFNIFI
SIP
FD-----
--
WGDDVTTPRRQI
Y
MSNS
T
TFDDQGFA
S
NNT
G
LSG
T
V
GS
R
DQ
-
F
N
Y
G
V
NL
S
YQNQGNETT----
---
AGANLT
W
NAPV
---
A
TVNG
S
Y
S
QS
--
-STYR
Q
AGASV
SGG
IV
A
WSG
G
VN
L
A
-
NRLS--E
T
FA
VM
N
APG
IK
D
AY
V
N
-
GQKYRT
T
N
RN
G
VV
V
YDGM
T
P
Y
RE
N
HLM
LD
V
S
Q
S
DS
-
E
A
ELRG
N
R
KIAASYR
GA
V
VLVN
F
DTDQ
R
KPWFIKALRAD
G
QPLM
FG
Y
E
V
N
---
--
D
IHGHNI
G
V
V
G
Q
G
S
QL
F
IR
----
T
N
E
V
PP
--
--SVN
V
A
ID
KQQGLS
-----
C
TITF
fig|331112.3.peg.3014
Escherichia coli HS (12-802/821)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
CLT
--
PEQ
L
TLL
G
FT
DEII
------
EEAQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
WD
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QANNLD
F
P--
--------
RIY
L
F
R
PIPE
I
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
S
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAT
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
N
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
S
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
D
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNNI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMEAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
AE
D
IQ
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIMGIIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DSKIT
L
H
WG
D-KS--
-----
C
FIQ
fig|331112.6.peg.3148
Escherichia coli HS (27-817/836)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
CLT
--
PEQ
L
TLL
G
FT
DEII
------
EEAQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
WD
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QANNLD
F
P--
--------
RIY
L
F
R
PIPE
I
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
S
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAT
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
N
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
S
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
D
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNNI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMEAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
AE
D
IQ
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIMGIIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DSKIT
L
H
WG
D-KS--
-----
C
FIQ
fig|585057.4.peg.3661
Escherichia coli IAI39 (12-815/822)
IVIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EEAQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GRMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
WD
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
XANNLD
F
P--
--------
RIY
L
F
R
PIPA
I
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
S
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAT
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
N
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
S
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
D
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNNI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMEAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
AE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLPD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DSKLT
L
R
WG
D-KS--
-----
C
FIQPPK
S
SNLTTGTVI
fig|481805.3.peg.695
Escherichia coli ATCC 8739 (12-815/820)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
L
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEII
------
EEAQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
WD
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QANNLD
F
P--
--------
RIY
L
F
R
PIPA
I
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
VS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
S
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAT
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
N
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
S
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
D
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNNI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMEAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
AE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIMGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DSKLT
L
R
WG
D-KS--
-----
C
FIQPPK
S
SNLTTGTVI
fig|481805.6.peg.693
Escherichia coli ATCC 8739 (27-830/835)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
L
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEII
------
EEAQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
WD
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QANNLD
F
P--
--------
RIY
L
F
R
PIPA
I
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
VS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
S
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAT
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
N
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
S
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
D
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNNI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMEAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
AE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIMGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DSKLT
L
R
WG
D-KS--
-----
C
FIQPPK
S
SNLTTGTVI
fig|344610.3.peg.4737
Escherichia coli 53638 (12-815/820)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
L
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEII
------
EEAQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
WD
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QANNLD
F
P--
--------
RIY
L
F
R
PIPA
I
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
VS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
S
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAT
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
N
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
S
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
KYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
D
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNNI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMEAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
AE
D
IS
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIMGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DSKLT
L
R
WG
D-KS--
-----
C
FIQPPK
S
SNLTTGTVI
fig|344610.7.peg.3203
Escherichia coli 53638 (27-830/835)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
L
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEII
------
EEAQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
WD
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QANNLD
F
P--
--------
RIY
L
F
R
PIPA
I
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
VS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
S
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAT
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
N
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
S
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
KYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
D
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNNI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMEAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
AE
D
IS
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIMGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DSKLT
L
R
WG
D-KS--
-----
C
FIQPPK
S
SNLTTGTVI
fig|749537.3.peg.668
Escherichia coli MS 115-1 (24-827/832)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
L
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQT-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
WD
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QANNLD
F
P--
--------
RIY
L
F
R
PIPA
I
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
VS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAT
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
N
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
S
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
D
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNNI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMEAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
AE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLPD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DSKLT
L
R
WG
D-KS--
-----
C
FIQPPK
S
SNLTTGTVI
fig|749547.3.peg.3987
Escherichia coli MS 187-1 (24-814/834)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
L
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQT-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
WD
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QANNLD
F
P--
--------
RIY
L
F
R
PIPA
I
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
VS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
QVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
D
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNKI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
AE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|316401.4.peg.3765
Escherichia coli ETEC H10407 (27-817/837)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYI
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|595496.3.peg.3023
Escherichia coli BW2952 (7-801/821)
ANPVIIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|536056.3.peg.686
Escherichia coli DH1 (7-801/821)
ANPVIIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|316407.3.peg.2932
Escherichia coli W3110 (7-801/821)
ANPVIIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|316385.5.peg.3173
Escherichia coli str. K-12 substr. DH10B (7-801/821)
ANPVIIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|316385.7.peg.3242
Escherichia coli str. K-12 substr. DH10B (7-801/821)
ANPVIIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|511145.12.peg.3139
Escherichia coli str. K-12 substr. MG1655 (7-801/821)
ANPVIIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|511145.6.peg.3124
Escherichia coli str. K-12 substr. MG1655 (7-801/821)
ANPVIIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|83333.1.peg.2994
Escherichia coli K12 (7-801/821)
ANPVIIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYF
D
K
--
GKMQ
L
SI
FA
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|358709.5.peg.1197
Escherichia coli 101-1 (24-814/834)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|656414.3.peg.3507
Escherichia coli H736 (27-817/837)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|749538.3.peg.3863
Escherichia coli MS 116-1 (27-817/837)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTSVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|413997.3.peg.3053
Escherichia coli B str. REL606 (24-814/834)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTNVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|511693.5.peg.3062
Escherichia coli BL21 (27-817/837)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTNVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|469008.4.peg.712
Escherichia coli BL21(DE3) (27-817/837)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTNVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|637912.3.peg.374
Escherichia coli OP50 (27-817/837)
IIIGCASAYA-
-
---VE
---
FN
KD
L
IEA-
--
EDRENV
N
-
LSQ
F
ET
--
DGQLP
V
G
K
Y
S
L
S
T
L
I
N
NKRT
-
PIHLD
L
QWV------LIDN
Q
TA--V
C
V
T
--
PEQ
L
TLL
G
FT
DEFI
------
EKTQQN-
--------
LIDG
C
YPIEK-EKQITTYL
D
K
--
GKMQ
L
SI
S
A
PQA
W
L
KYKDAN
W
T
PP
EL
W
N
H
GI
A
G
AFLD
Y
NLYA-SHYAPHQ---
-
GDN
S
QNISSYGQA
G
V
N
L
G
A
WRLR
T
D
YQ--YDQSF
-
NNG
K
S
---------
QATNLD
F
P--
--------
RIY
L
F
R
PIPA
M
N
-
AK
L
T
I
G
Q
Y
D
TE
S
---
S
IFDS
FH
F
S
G
IS
L
KS
D
EN
MLP
PDLR
G
Y
AP
Q
I
T
G
V
A
QT
NA
K
VT
V
S
Q
N
N
RI
IY
QEN
VPPGPF
A
I
T
N
L
FN
T
LQ-
G
Q
L
D
V
K
V
E
E
E
DG
RVTQWQ
V
A
S
N
S
I
P
Y
L
T
R
K
G
QI
RY
TT
A
M
G
K
PTNVG
--
GDSLQQPF
F
WTGEFS
W
G
WLNNV
S
L
YGG
S
V
LTN
R
D
Y
Q
S
LAA
G
V
G
F
N
LNSL
G
S
L
SFD
V
T
R
S
DAQLH
-----
NQ
-
D
---
KET
G
Y
S
Y
R
AN
YSK
RFESTG
S
QLTFA
GYR
F
S
DKN
F
V
T
M
NE
YIND
--
------------------
-
TNHYT---
-
NYQN
EK
ESYIV
T
FN
Q
YLESLRL
N
TYV
S
LARNT
YW
DASSNVNYSLSL
S
RDFDIGPLKNVS
T
S
L
T
FSRINW
E
E
-----------
DNQ
D
-QLYLNI
SIP
WG-----
--
------TSRTLS
Y
GMQR
-
--NQDNEI
S
HTA
S
WYD
S
S
--
D
RN
-
N
S
W
S
V
SA
S
GDNDEFKDMKAS-
---
LRASYQ
H
NTEN
---
G
RLYL
S
G
T
SQ
--
RDSYY
S
LNASW
N
G
S
FT
A
TRH
G
AA
F
H
-
DYSGSAD
S
RF
MI
D
A
D
G
TE
D
IP
L
-
-
NNKRAV
T
N
RY
G
IG
V
IPSV
S
S
Y
IT
T
SLS
V
D
T
R
N
L
PE
-
N
V
DIEN
S
V
ITTTLTE
GAI
GYAK
L
DTRK
G
YQIIGVIRLAD
G
SHPP
L
G
I
S
V
K
---
-D
E
TSHKEL
G
L
V
A
D
G
G
FV
Y
------
L
N
G
I
QD
--
DNKLA
L
R
WG
D-KS--
-----
C
FIQ
fig|656419.3.peg.3114
Escherichia coli M718 (8-843/879)
RL
RVLPCCIALAMSGSYVNAW
A
EDEIQ
---
F
D
SR
F
LEL-
--
KDDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---ASENDAS
K
TY--A
CLT
--
PEL
V
SQF
GL
K
EDVA
------
KNLQWI-
--------
HDGK
C
LKPGQ-LEGIDIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTDIN
W
D
PP
SR
WD
D
GI
S
G
LIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
A
WRLR
A
D
WQTDYLHSK
-
SN-
D
D
DVINGDD
--
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
I
N
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KL
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
H
Y
Q
S
AAL
G
V
G
R
D
LSVF
GA
V
A
FD
I
T
H
S
HTRLD
KETAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDELN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DRDEQTNYNVML
S
HYFNLGSIRNMS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GVYISL
S
M
P
WG-----
--
------DSSTIS
Y
NGN-
-
--YGSGSD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHSS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
P
TAQ
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
G
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VADV
N
N
Y
YR
N
QAY
I
D
L
N
N
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
SVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NA-QNV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
S
WG
GVAH--
-----
C
DIHLPD
P
LPADLFNGLLLP
C
Q
fig|749537.3.peg.2162
Escherichia coli MS 115-1 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
V
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|344610.3.peg.300
Escherichia coli 53638 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
EEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
TVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|344610.7.peg.3982
Escherichia coli 53638 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
EEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
TVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|331112.3.peg.2336
Escherichia coli HS (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGVEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
TVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|749547.3.peg.844
Escherichia coli MS 187-1 (8-843/879)
RL
RVLPCCIALAMSGSYVNAW
A
EDEIQ
---
F
D
SR
F
LEL-
--
KDDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---ASENDAS
K
TY--A
CLT
--
PEL
V
SQF
GL
K
EDVA
------
KNLQWI-
--------
HDGK
C
LKPGQ-LEGIDIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTDIN
W
D
PP
SR
WD
D
GI
S
G
LIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
A
WRLR
A
D
WQTDYLHSK
-
SN-
D
D
DVINGDD
--
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
G
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
I
N
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KL
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
H
Y
Q
S
AAL
G
V
G
R
D
LSVF
GA
V
A
FD
I
P
H
S
HTRLD
KETAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDELN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DRDEQTNYNVML
S
HYFNLGSIRNMS
V
S
M
T
GYRYEY
D
N
-----------
QTD
K
-GVYISL
S
M
P
WG-----
--
------DSSTIS
Y
NGN-
-
--YGSGSD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----NHSS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAQ
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
G
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VADV
N
N
Y
YR
N
QAY
I
D
L
N
N
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
SVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NA-QNV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMI
V
S
WG
GVAH--
-----
C
DIHLPD
P
LPADLFNGLLLP
C
Q
fig|344601.3.peg.2924
Escherichia coli B171 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|344601.5.peg.3048
Escherichia coli B171 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|340185.3.peg.765
Escherichia coli E22 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|340185.4.peg.808
Escherichia coli E22 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|585034.4.peg.2382
Escherichia coli IAI1 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|585034.5.peg.2380
Escherichia coli IAI1 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|595495.4.peg.101
Escherichia coli KO11 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|585395.4.peg.2934
Escherichia coli O103:H2 str. 12009 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|566546.3.peg.1300
Escherichia coli W (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|566546.4.peg.2510
Escherichia coli W (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|585055.6.peg.2619
Escherichia coli 55989 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|585055.8.peg.2625
Escherichia coli 55989 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|340184.3.peg.583
Escherichia coli B7A (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|340184.6.peg.614
Escherichia coli B7A (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|656408.3.peg.2584
Escherichia coli H591 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|679206.4.peg.1989
Escherichia coli MS 119-7 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|656443.3.peg.3047
Escherichia coli TA271 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|573235.3.peg.3396
Escherichia coli O26:H11 str. 11368 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHIEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|585396.4.peg.3179
Escherichia coli O111:H- str. 11128 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|340186.3.peg.2923
Escherichia coli E110019 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|340186.5.peg.3040
Escherichia coli E110019 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|679207.4.peg.2449
Escherichia coli MS 107-1 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|409438.11.peg.2810
Escherichia coli SE11 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|331111.12.peg.2916
Escherichia coli E24377A (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
I
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
RED
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GIAH--
-----
C
DINLPA
P
LPADLFNGLLLP
C
Q
fig|331111.3.peg.368
Escherichia coli E24377A (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
I
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
RED
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLSD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GIAH--
-----
C
DINLPA
P
LPADLFNGLLLP
C
Q
fig|679205.4.peg.3358
Escherichia coli MS 124-1 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
EEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
L
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|749533.3.peg.1779
Escherichia coli MS 84-1 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
EEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
L
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|481805.3.peg.1408
Escherichia coli ATCC 8739 (16-844/881)
IALAISGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
EEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
C
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMR
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|481805.6.peg.1406
Escherichia coli ATCC 8739 (16-844/881)
IALAISGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
EEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
C
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMR
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|550672.3.peg.2052
Escherichia coli B088 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDEAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|6666666.5357.peg.592
Escherichia coli TY-2482 (16-843/880)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDAS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
NNLQWS-
--------
HDAK
C
LKSGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTYPD
W
D
PP
SR
WD
D
GI
S
G
IVAD
Y
SINAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SN-
D
D
DEFSGDE
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LR
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGEF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSGM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNVS
I
S
M
T
GYRYEY
D
N
-----------
QAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
R
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVQRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|656393.3.peg.3373
Escherichia coli H299 (16-842/879)
IALAISGSPFNVL
A
DDTIQ
---
F
D
AR
F
LEL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NHPL
-
PDEYD
I
YWY---VAENDPN
K
SY--A
CL
P
--
PEL
I
AQF
G
F
K
DDFA
------
KSLQWG-
--------
HDGQ
C
LKTDQ-IGGMEIKG
D
L
--
SQSA
L
LV
SVPQA
Y
L
EYTDDD
W
D
PP
SR
WD
E
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNMR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALF
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDTDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFRDARV
S
VYL
N
YSHHT
YW
DREDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
SQA
S
LYH
R
I
--
D
DA
-
S
H
Y
T
L
SA
G
TSE-----NHTS-
---
LDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGVSL
Q
GG
VT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMVVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|656379.3.peg.2879
Escherichia coli FVEC1302 (16-842/879)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
V
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|656380.3.peg.2437
Escherichia coli FVEC1412 (16-842/879)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
V
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|749549.3.peg.1250
Escherichia coli MS 198-1 (16-842/879)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
V
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|585056.7.peg.2860
Escherichia coli UMN026 (16-842/879)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
V
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|216592.1.peg.3162
Escherichia coli 042 (20-846/883)
IALAISGSPFNAL
A
DDIIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
V
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VA
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|216592.3.peg.2697
Escherichia coli 042 (16-842/879)
IALAISGSPFNAL
A
DDIIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
V
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VA
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|216593.1.peg.2393
Escherichia coli E2348/69 (20-846/883)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYMV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
L
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|574521.7.peg.2543
Escherichia coli O127:H6 str. E2348/69 (16-842/879)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYMV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
L
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|656414.3.peg.2726
Escherichia coli H736 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDVS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGVEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFGGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NI
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TTH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
SV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|749538.3.peg.1930
Escherichia coli MS 116-1 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDVS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGVEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFGGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NI
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TTH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
SV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|749540.3.peg.3900
Escherichia coli MS 146-1 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDVS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGVEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFGGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NI
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TTH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
SV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|749544.3.peg.2130
Escherichia coli MS 175-1 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDVS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGVEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFGGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NI
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TTH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
SV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|656437.3.peg.2625
Escherichia coli TA143 (16-842/879)
IALAISGSPFNAL
A
DDTVQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
L
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|670897.3.peg.3315
Escherichia coli 2362-75 (16-842/879)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
L
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|550677.3.peg.1886
Escherichia coli B354 (16-842/879)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
L
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|749531.3.peg.3555
Escherichia coli MS 69-1 (16-842/879)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNTAL-LAGTEISG
D
L
--
GQSA
L
LV
SVPQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
L
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|749548.3.peg.1883
Escherichia coli MS 196-1 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDVS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGVEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFGGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSIVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NI
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TTH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
SV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|358709.5.peg.841
Escherichia coli 101-1 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEHD
I
YWY---AGEDDAS
K
TY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
V
R
A
D
WQTDYQHTR
-
SND
D
D
DEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
IS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KV
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
K
N
Y
Q
S
AAL
G
I
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAQ
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|413997.3.peg.2363
Escherichia coli B str. REL606 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEHD
I
YWY---AGEDDAS
K
TY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
V
R
A
D
WQTDYQHTR
-
SND
D
D
DEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
IS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KV
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
K
N
Y
Q
S
AAL
G
I
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAQ
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|511693.5.peg.2375
Escherichia coli BL21 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEHD
I
YWY---AGEDDAS
K
TY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
V
R
A
D
WQTDYQHTR
-
SND
D
D
DEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
IS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KV
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
K
N
Y
Q
S
AAL
G
I
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAQ
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|469008.4.peg.1360
Escherichia coli BL21(DE3) (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEHD
I
YWY---AGEDDAS
K
TY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
V
R
A
D
WQTDYQHTR
-
SND
D
D
DEFSGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
IS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KV
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
K
N
Y
Q
S
AAL
G
I
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
F
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NV
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TAQ
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QTV
G
L
V
D
D
D
G
NV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|656444.3.peg.3343
Escherichia coli TA280 (16-842/879)
IALAISGSPFNAL
A
DDTIQ
---
F
D
GR
F
LDL-
--
KGNTKI
D
-
LGR
F
SQ
--
KGYVE
PG
K
Y
N
L
R
V
H
V
N
NQPL
-
PDDYD
I
YWY---ATENDPN
K
SY--A
CL
S
--
PEL
V
AQF
GL
K
EDIA
------
KNLQWI-
--------
RDGQ
C
LNMAL-LAGTEISG
D
L
--
GQSA
L
LV
S
L
PQA
Y
L
EYTDSE
W
D
PP
SR
WD
D
GI
P
G
LIAD
Y
SINAQTRH-ENG---
-
GDD
T
NDISGNGTV
G
V
N
V
G
P
WRLR
A
D
WQSDYQHTR
-
SND
D
G
DT
---
DDSG
TQKNWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
S
LG
E
D
Y
LN
S
---
DIFD
G
FS
Y
I
G
GS
I
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
T
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
D
I
GD
S
VS-
G
T
L
H
V
R
I
E
E
Q
N
G
QVQEYD
V
S
T
A
S
M
P
F
L
T
R
P
G
QV
RY
KV
T
M
G
R
PQN-W
--
DHQVAGSF
F
SGGEAS
W
G
IANGW
S
L
YGG
A
-
LAD
E
N
Y
Q
S
AAL
G
L
G
R
D
LALL
GA
L
A
FD
V
T
H
S
RVQLD
DNSVY
GN
-
K
---
TLD
G
N
S
Y
R
VS
Y
A
K
DFDELN
S
RVTFA
GYR
F
S
EKN
Y
M
T
M
S
E
YLDA
--
------------------
-
---NDDDR
A
RTGN
D
K
EMYTV
T
YN
Q
NFTDARV
S
VYL
N
YSHHT
YW
DRQDQTNYNMML
S
HYFNLGSLRNLS
V
S
L
T
GYRYEY
D
K
-----------
SAD
K
-GVYLSL
S
L
P
WG-----
--
------DNSTIS
Y
NGN-
-
--YGSGAD
S
NQV
S
LYH
R
I
--
D
DA
-
S
H
Y
T
L
SA
G
TSE-----NHSS-
---
VDGYYS
H
DGTL
---
A
KVDL
S
A
N
YH
--
EGQYT
S
AGISL
Q
GG
AT
L
TAH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
N
VP
V
E
G
NGSAVY
T
N
MF
G
KA
V
VADV
N
D
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATK
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
EKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NQ-QQV
G
L
V
D
D
E
G
NV
Y
------
L
A
GV
KP
--
GEHMT
V
F
W
E
GESH--
-----
C
DISLPD
P
LPNDLFNGLLLP
C
Q
fig|316401.4.peg.2826
Escherichia coli ETEC H10407 (16-844/881)
IALAMSGSYSSVW
A
EDDIQ
---
F
D
SR
F
LEL-
--
KGDTKI
D
-
LKR
F
SS
--
QGYVE
PG
K
Y
N
L
Q
V
Q
LN
KQPL
-
AEEYD
I
YWY---AGEDDVS
K
SY--A
CLT
--
PEL
V
AQF
GL
K
EDVA
------
KNLQWS-
--------
HDGK
C
LKPGQ-LEGMEIKA
D
L
--
SQSA
L
VI
S
L
PQA
Y
L
EYTWPD
W
D
PP
SR
WD
D
GI
S
G
IIAD
Y
SITAQTRHEENG---
-
GDD
S
NEISGNGTV
G
V
N
L
G
P
WR
M
R
A
D
WQTNYQHTR
-
SND
D
D
DEFGGDD
--
TQKKWE
W
S--
--------
RYY
A
W
R
ALPS
L
K
-
AK
L
A
LG
E
D
Y
LN
S
---
DIFD
G
FN
Y
V
G
GS
V
ST
D
DQ
MLP
PNLR
G
Y
AP
D
I
S
G
V
A
HT
T
A
K
VT
V
S
Q
M
G
RV
IY
ETQ
VP
A
GPF
R
I
Q
DL
GD
S
VS-
G
T
L
H
I
R
I
E
E
Q
N
G
QVQEYD
IS
T
A
S
M
P
Y
L
T
R
P
G
QV
RY
KI
M
M
G
R
PQE-W
--
GHHVEGGF
F
SGAEAS
W
G
IANGW
S
L
YGG
A
-
LGD
E
N
Y
Q
S
AAL
G
V
G
R
D
LSTF
GA
V
A
FD
V
T
H
S
HTKLD
KDTAY
GK
-
G
---
SLD
G
N
S
F
R
VS
YSK
DFDQLN
S
RVTFA
GYR
F
S
EEN
L
M
T
M
S
E
YLDA
--
------------------
-
---SDSEM
V
RTGN
D
K
EMYTA
T
YN
Q
NFRDAGV
S
VYL
N
YTRHT
YW
DREEQTNYNIML
S
HYFNMGSIRNMS
V
S
L
T
GYRYEY
D
N
-----------
RAD
K
-GMYISL
S
M
P
WG-----
--
------DNSTVS
Y
NGN-
-
--YGSGTD
S
SQV
G
YFS
R
V
--
D
DA
-
T
H
Y
Q
L
NI
G
TSD-----KHTS-
---
VDGYYS
H
DGSL
---
A
QVDL
S
A
N
YH
--
EGQYT
S
AGLSL
Q
GG
AT
L
TTH
G
GA
L
H
-
RTQNMGG
T
RL
L
I
D
A
D
G
VA
D
VP
V
E
G
NGAAVY
T
N
MF
G
KA
V
VSDV
N
N
Y
YR
N
QAY
I
D
L
N
K
L
PE
-
N
A
EATQ
S
V
VQATLTE
GAI
GYRK
F
AVIS
G
QKAMAVLRLQD
G
SHPP
FGA
E
V
K
---
-N
D
NE-QTV
G
L
V
D
D
D
G
SV
Y
------
L
A
GV
KP
--
GEHMS
V
F
W
S
GVAH--
-----
C
DINLPD
P
LPADLFNGLLLP
C
Q
fig|362663.8.peg.3000
Escherichia coli 536 (13-817/836)
ITYSLMLSLAGVPVYA-
-
---VD
---
FN
TD
V
LDA-
--
ADRQNI
D
-
FSR
F
SR
--
AGYIM
PG
Q
Y
Q
M
E
I
R
V
N
GQDI
S
PSAFQ
I
AFLEPPFSDSDNE
K
PLPEP
CLT
--
PEI
V
SRM
GL
T
EASQ
------
EKVTYW-
--------
NNGQ
C
ADFRQ-LSGVEIRP
N
P
--
AEGM
L
YI
NM
PQA
W
L
EYSDAS
W
L
PP
SR
WD
N
GI
P
G
LLFD
Y
NING-TVNKPHQ---
-
GKQ
S
QSLNYNGTA
G
A
N
F
G
A
WRLR
A
D
YQGNLNHTT
-
GSA
Q
G
---------
TDSQFT
W
S--
--------
RFY
M
Y
R
AIPR
W
R
-
AN
L
T
LG
E
N
Y
IN
S
---
E
IF
S
S
WR
Y
T
G
AS
L
ES
D
DR
MLP
PKLR
G
Y
AP
Q
V
S
GIA
DT
NA
R
V
V
I
S
Q
Q
G
RI
L
Y
DST
VP
A
GPF
T
I
Q
DL
DS
S
VR-
G
R
L
D
V
E
V
I
E
Q
DG
RKKTFQ
V
D
T
A
Y
V
P
Y
L
T
R
P
G
QV
RY
KL
V
S
G
R
SRT-Y
--
EHTMEGPV
F
AAGEAS
W
G
ISNTW
S
L
YGG
S
-
IVA
G
D
Y
N
A
LAV
G
L
G
R
D
LSKF
G
T
V
S
A
D
V
T
Q
S
VARIP
-----
GY
-
D
---
TKQ
G
K
S
W
R
LS
YSK
RFDEVN
T
DITFA
GYR
F
S
ERN
Y
M
T
M
DQ
YLNA
--
------------------
-
RYRND---
-
FTGR
EK
ELYTV
T
LN
K
NFEDWKA
S
VNL
Q
YSHQT
YW
DRRTSDYYTLSV
N
RYFDAFSFKNIA
L
G
I
S
ASRSKY
L
N
-----------
RDN
D
-SAFVRL
S
V
P
WG-----
--
-------TGTAS
Y
SGS-
-
--MSNDRY
T
NTV
G
YSD
TL
--
N
NG
L
S
SY
S
L
NA
G
VNSGGGQPSQRQ-
---
MSAYYN
H
NGSL
---
T
--NL
S
A
S
FS
AV
ENGYS
S
FGMSA
SGG
AT
V
TMK
G
AA
L
H
-
AGGMNGG
T
RL
LV
D
TD
G
VG
G
VP
V
-
-
DGGRVY
T
N
RW
G
IG
V
VTDV
S
S
Y
YR
N
TTS
V
D
L
N
K
L
PE
-
D
M
EATR
S
V
VESVLTE
GAI
GYRE
F
EVLK
G
SRLFAVLRMSD
N
SYPP
FGA
S
V
T
---
-N
A
KG-REL
G
M
V
A
D
S
G
LA
W
------
L
SGV
NP
--
GETLN
V
G
W
D
GRTQ--
-----
C
VVD
fig|362663.9.peg.3009
Escherichia coli 536 (13-817/836)
ITYSLMLSLAGVPVYA-
-
---VD
---
FN
TD
V
LDA-
--
ADRQNI
D
-
FSR
F
SR
--
AGYIM
PG
Q
Y
Q
M
E
I
R
V
N
GQDI
S
PSAFQ
I
AFLEPPFSDSDNE
K
PLPEP
CLT
--
PEI
V
SRM
GL
T
EASQ
------
EKVTYW-
--------
NNGQ
C
ADFRQ-LSGVEIRP
N
P
--
AEGM
L
YI
NM
PQA
W
L
EYSDAS
W
L
PP
SR
WD
N
GI
P
G
LLFD
Y
NING-TVNKPHQ---
-
GKQ
S
QSLNYNGTA
G
A
N
F
G
A
WRLR
A
D
YQGNLNHTT
-
GSA
Q
G
---------
TDSQFT
W
S--
--------
RFY
M
Y
R
AIPR
W
R
-
AN
L
T
LG
E
N
Y
IN
S
---
E
IF
S
S
WR
Y
T
G
AS
L
ES
D
DR
MLP
PKLR
G
Y
AP
Q
V
S
GIA
DT
NA
R
V
V
I
S
Q
Q
G
RI
L
Y
DST
VP
A
GPF
T
I
Q
DL
DS
S
VR-
G
R
L
D
V
E
V
I
E
Q
DG
RKKTFQ
V
D
T
A
Y
V
P
Y
L
T
R
P
G
QV
RY
KL
V
S
G
R
SRT-Y
--
EHTMEGPV
F
AAGEAS
W
G
ISNTW
S
L
YGG
S
-
IVA
G
D
Y
N
A
LAV
G
L
G
R
D
LSKF
G
T
V
S
A
D
V
T
Q
S
VARIP
-----
GY
-
D
---
TKQ
G
K
S
W
R
LS
YSK
RFDEVN
T
DITFA
GYR
F
S
ERN
Y
M
T
M
DQ
YLNA
--
------------------
-
RYRND---
-
FTGR
EK
ELYTV
T
LN
K
NFEDWKA
S
VNL
Q
YSHQT
YW
DRRTSDYYTLSV
N
RYFDAFSFKNIA
L
G
I
S
ASRSKY
L
N
-----------
RDN
D
-SAFVRL
S
V
P
WG-----
--
-------TGTAS
Y
SGS-
-
--MSNDRY
T
NTV
G
YSD
TL
--
N
NG
L
S
SY
S
L
NA
G
VNSGGGQPSQRQ-
---
MSAYYN
H
NGSL
---
T
--NL
S
A
S
FS
AV
ENGYS
S
FGMSA
SGG
AT
V
TMK
G
AA
L
H
-
AGGMNGG
T
RL
LV
D
TD
G
VG
G
VP
V
-
-
DGGRVY
T
N
RW
G
IG
V
VTDV
S
S
Y
YR
N
TTS
V
D
L
N
K
L
PE
-
D
M
EATR
S
V
VESVLTE
GAI
GYRE
F
EVLK
G
SRLFAVLRMSD
N
SYPP
FGA
S
V
T
---
-N
A
KG-REL
G
M
V
A
D
S
G
LA
W
------
L
SGV
NP
--
GETLN
V
G
W
D
GRTQ--
-----
C
VVD
fig|340197.5.peg.2046
Escherichia coli F11 (13-817/836)
ITYSLMLSLAGVPVYA-
-
---VD
---
FN
TD
V
LDA-
--
ADRQNI
D
-
FSR
F
SR
--
AGYIM
PG
Q
Y
Q
M
E
I
R
V
N
GQDI
S
PSAFQ
I
AFLEPPFSDSDNE
K
PLPEP
CLT
--
PEI
V
SRM
GL
T
EASQ
------
EKVTYW-
--------
NNGQ
C
ADFRQ-LSGVEIRP
N
P
--
AEGM
L
YI
NM
PQA
W
L
EYSDAS
W
L
PP
SR
WD
N
GI
P
G
LLFD
Y
NING-TVNKPHQ---
-
GKQ
S
QSLNYNGTA
G
A
N
F
G
A
WRLR
A
D
YQGNLNHTT
-
GSA
Q
G
---------
TDSQFT
W
S--
--------
RFY
M
Y
R
AIPR
W
R
-
AN
L
T
LG
E
N
Y
IN
S
---
E
IF
S
S
WR
Y
T
G
AS
L
ES
D
DR
MLP
PKLR
G
Y
AP
Q
V
S
GIA
DT
NA
R
V
V
I
S
Q
Q
G
RI
L
Y
DST
VP
A
GPF
T
I
Q
DL
DS
S
VR-
G
R
L
D
V
E
V
I
E
Q
DG
RKKTFQ
V
D
T
A
Y
V
P
Y
L
T
R
P
G
QV
RY
KL
V
S
G
R
SRT-Y
--
EHTMEGPV
F
AAGEAS
W
G
ISNTW
S
L
YGG
S
-
IVA
G
D
Y
N
A
LAV
G
L
G
R
D
LSKF
G
T
V
S
A
D
V
T
Q
S
VARIP
-----
GY
-
D
---
TKQ
G
K
S
W
R
LS
YSK
RFDEVN
T
DITFA
GYR
F
S
ERN
Y
M
T
M
DQ
YLNA
--
------------------
-
RYRND---
-
FTGR
EK
ELYTV
T
LN
K
NFEDWKA
S
VNL
Q
YSHQT
YW
DRRTSDYYTLSV
N
RYFDAFSFKNIA
L
G
I
S
ASRSKY
L
N
-----------
RDN
D
-SAFVRL
S
V
P
WG-----
--
-------TGTAS
Y
SGS-
-
--MSNDRY
T
NTV
G
YSD
TL
--
N
NG
L
S
SY
S
L
NA
G
VNSGGGQPSQRQ-
---
MSAYYN
H
NGSL
---
T
--NL
S
A
S
FS
AV
ENGYS
S
FGMSA
SGG
AT
V
TMK
G
AA
L
H
-
AGGMNGG
T
RL
LV
D
TD
G
VG
G
VP
V
-
-
DGGRVY
T
N
RW
G
IG
V
VTDV
S
S
Y
YR
N
TTS
V
D
L
N
K
L
PE
-
D
M
EATR
S
V
VESVLTE
GAI
GYRE
F
EVLK
G
SRLFAVLRMSD
N
SYPP
FGA
S
V
T
---
-N
A
KG-REL
G
M
V
A
D
S
G
LA
W
------
L
SGV
NP
--
GETLN
V
G
W
D
GRTQ--
-----
C
VVD
fig|679207.4.peg.1590
Escherichia coli MS 107-1 (13-817/836)
ITYSLMLSLAGVPVYA-
-
---VD
---
FN
TD
V
LDA-
--
ADRQNI
D
-
FSR
F
SR
--
AGYIM
PG
Q
Y
Q
M
E
I
R
V
N
GQDI
S
PSAFQ
I
AFLEPPFSDSDNE
K
PLPEP
CLT
--
PEI
V
SRM
GL
T
EASQ
------
EKVTYW-
--------
NNGQ
C
ADFRQ-LSGVEIRP
N
P
--
AEGM
L
YI
NM
PQA
W
L
EYSDAS
W
L
PP
SR
WD
N
GI
P
G
LLFD
Y
NING-TVNKPHQ---
-
GKQ
S
QSLNYNGTA
G
A
N
F
G
A
WRLR
A
D
YQGNLNHTT
-
GSA
Q
G
---------
TDSQFT
W
S--
--------
RFY
M
Y
R
AIPR
W
R
-
AN
L
T
LG
E
N
Y
IN
S
---
E
IF
S
S
WR
Y
T
G
AS
L
ES
D
DR
MLP
PKLR
G
Y
AP
Q
V
S
GIA
DT
NA
R
V
V
I
S
Q
Q
G
RI
L
Y
DST
VP
A
GPF
T
I
Q
DL
DS
S
VR-
G
R
L
D
V
E
V
I
E
Q
DG
RKKTFQ
V
D
T
A
Y
V
P
Y
L
T
R
P
G
QV
RY
KL
V
S
G
R
SRT-Y
--
EHTMEGPV
F
AAGEAS
W
G
ISNTW
S
L
YGG
S
-
IVA
G
D
Y
N
A
LAV
G
L
G
R
D
LSKF
G
T
V
S
A
D
V
T
Q
S
VARIP
-----
GY
-
D
---
TKQ
G
K
S
W
R
LS
YSK
RFDEVN
T
DITFA
GYR
F
S
ERN
Y
M
T
M
DQ
YLNA
--
------------------
-
RYRND---
-
FTGR
EK
ELYTV
T
LN
K
NFEDWKA
S
VNL
Q
YSHQT
YW
DRRTSDYYTLSV
N
RYFDAFSFKNIA
L
G
I
S
ASRSKY
L
N
-----------
RDN
D
-SAFVRL
S
V
P
WG-----
--
-------TGTAS
Y
SGS-
-
--MSNDRY
T
NTV
G
YSD
TL
--
N
NG
L
S
SY
S
L
NA
G
VNSGGGQPSQRQ-
---
MSAYYN
H
NGSL
---
T
--NL
S
A
S
FS
AV
ENGYS
S
FGMSA
SGG
AT
V
TMK
G
AA
L
H
-
AGGMNGG
T
RL
LV
D
TD
G
VG
G
VP
V
-
-
DGGRVY
T
N
RW
G
IG
V
VTDV
S
S
Y
YR
N
TTS
V
D
L
N
K
L
PE
-
D
M
EATR
S
V
VESVLTE
GAI
GYRE
F
EVLK
G
SRLFAVLRMSD
N
SYPP
FGA
S
V
T
---
-N
A
KG-REL
G
M
V
A
D
S
G
LA
W
------
L
SGV
NP
--
GETLN
V
G
W
D
GRTQ--
-----
C
VVD
fig|749545.3.peg.2027
Escherichia coli MS 182-1 (13-817/836)
ITYSLMLSLAGVPVYA-
-
---VD
---
FN
TD
V
LDA-
--
ADRQNI
D
-
FSR
F
SR
--
AGYIM
PG
Q
Y
Q
M
E
I
R
V
N
GQDI
S
PSAFQ
I
AFLEPPFSDSDNE
K
PLPEP
CLT
--
PEI
V
SRM
GL
T
EASQ
------
EKVTYW-
--------
NNGQ
C
ADFRQ-LSGVEIRP
N
P
--
AEGM
L
YI
NM
PQA
W
L
EYSDAS
W
L
PP
SR
WD
N
GI
P
G
LLFD
Y
NING-TVNKPHQ---
-
GKQ
S
QSLNYNGTA
G
A
N
F
G
A
WRLR
A
D
YQGNLNHTT
-
GSA
Q
G
---------
TDSQFT
W
S--
--------
RFY
M
Y
R
AIPR
W
R
-
AN
L
T
LG
E
N
Y
IN
S
---
E
IF
S
S
WR
Y
T
G
AS
L
ES
D
DR
MLP
PKLR
G
Y
AP
Q
V
S
GIA
DT
NA
R
V
V
I
S
Q
Q
G
RI
L
Y
DST
VP
A
GPF
T
I
Q
DL
DS
S
VR-
G
R
L
D
V
E
V
I
E
Q
DG
RKKTFQ
V
D
T
A
Y
V
P
Y
L
T
R
P
G
QV
RY
KL
V
S
G
R
SRT-Y
--
EHTMEGPV
F
AAGEAS
W
G
ISNTW
S
L
YGG
S
-
IVA
G
D
Y
N
A
LAV
G
L
G
R
D
LSKF
G
T
V
S
A
D
V
T
Q
S
VARIP
-----
GY
-
D
---
TKQ
G
K
S
W
R
LS
YSK
RFDEVN
T
DITFA
GYR
F
S
ERN
Y
M
T
M
DQ
YLNA
--
------------------
-
RYRND---
-
FTGR
EK
ELYTV
T
LN
K
NFEDWKA
S
VNL
Q
YSHQT
YW
DRRTSDYYTLSV
N
RYFDAFSFKNIA
L
G
I
S
ASRSKY
L
N
-----------
RDN
D
-SAFVRL
S
V
P
WG-----
--
-------TGTAS
Y
SGS-
-
--MSNDRY
T
NTV
G
YSD
TL
--
N
NG
L
S
SY
S
L
NA
G
VNSGGGQPSQRQ-
---
MSAYYN
H
NGSL
---
T
--NL
S
A
S
FS
AV
ENGYS
S
FGMSA
SGG
AT
V
TMK
G
AA
L
H
-
AGGMNGG
T
RL
LV
D
TD
G
VG
G
VP
V
-
-
DGGRVY
T
N
RW
G
IG
V
VTDV
S
S
Y
YR
N
TTS
V
D
L
N
K
L
PE
-
D
M
EATR
S
V
VESVLTE
GAI
GYRE
F
EVLK
G
SRLFAVLRMSD
N
SYPP
FGA
S
V
T
---
-N
A
KG-REL
G
M
V
A
D
S
G
LA
W
------
L
SGV
NP
--
GETLN
V
G
W
D
GRTQ--
-----
C
VVD
fig|749533.3.peg.3740
Escherichia coli MS 84-1 (13-817/836)
ITYSLMLSLAGVPVYA-
-
---VD
---
FN
TD
V
LDA-
--
ADRQNI
D
-
FSR
F
SR
--
AGYIM
PG
Q
Y
Q
M
E
I
R
V
N
GQDI
S
PSAFQ
I
AFLEPPFSDSDNE
K
PLPEP
CLT
--
PEI
V
SRM
GL
T
EASQ
------
EKVTYW-
--------
NNGQ
C
ADFRQ-LSGVEIRP
N
P
--
AEGM
L
YI
NM
PQA
W
L
EYSDAS
W
L
PP
SR
WD
N
GI
P
G
LLFD
Y
NING-TVNKPHQ---
-
GKQ
S
QSLNYNGTA
G
A
N
F
G
A
WRLR
A
D
YQGNLNHTT
-
GSA
Q
G
---------
TDSQFT
W
S--
--------
RFY
M
Y
R
AIPR
W
R
-
AN
L
T
LG
E
N
Y
IN
S
---
E
IF
S
S
WR
Y
T
G
AS
L
ES
D
DR
MLP
PKLR
G
Y
AP
Q
V
S
GIA
DT
NA
R
V
V
I
S
Q
Q
G
RI
L
Y
DST
VP
A
GPF
T
I
Q
DL
DS
S
VR-
G
R
L
D
V
E
V
I
E
Q
DG
RKKTFQ
V
D
T
A
Y
V
P
Y
L
T
R
P
G
QV
RY
KL
V
S
G
R
SRT-Y
--
EHTMEGPV
F
AAGEAS
W
G
ISNTW
S
L
YGG
S
-
IVA
G
D
Y
N
A
LAV
G
L
G
R
D
LSKF
G
T
V
S
A
D
V
T
Q
S
VARIP
-----
GY
-
D
---
TKQ
G
K
S
W
R
LS
YSK
RFDEVN
T
DITFA
GYR
F
S
ERN
Y
M
T
M
DQ
YLNA
--
------------------
-
RYRND---
-
FTGR
EK
ELYTV
T
LN
K
NFEDWKA
S
VNL
Q
YSHQT
YW
DRRTSDYYTLSV
N
RYFDAFSFKNIA
L
G
I
S
ASRSKY
L
N
-----------
RDN
D
-SAFVRL
S
V
P
WG-----
--
-------TGTAS
Y
SGS-
-
--MSNDRY
T
NTV
G
YSD
TL
--
N
NG
L
S
SY
S
L
NA
G
VNSGGGQPSQRQ-
---
MSAYYN
H
NGSL
---
T
--NL
S
A
S
FS
AV
ENGYS
S
FGMSA
SGG
AT
V
TMK
G
AA
L
H
-
AGGMNGG
T
RL
LV
D
TD
G
VG
G
VP
V
-
-
DGGRVY
T
N
RW
G
IG
V
VTDV
S
S
Y
YR
N
TTS
V
D
L
N
K
L
PE
-
D
M
EATR
S
V
VESVLTE
GAI
GYRE
F
EVLK
G
SRLFAVLRMSD
N
SYPP
FGA
S
V
T
---
-N
A
KG-REL
G
M
V
A
D
S
G
LA
W
------
L
SGV
NP
--
GETLN
V
G
W
D
GRTQ--
-----
C
VVD
fig|340197.3.peg.1936
Escherichia coli F11 (33-837/856)
ITYSLMLSLAGVPVYA-
-
---VD
---
FN
TD
V
LDA-
--
ADRQNI
D
-
FSR
F
SR
--
AGYIM
PG
Q
Y
Q
M
E
I
R
V
N
GQDI
S
PSAFQ
I
AFLEPPFSDSDNE
K
PLPEP
CLT
--
PEI
V
SRM
GL
T
EASQ
------
EKVTYW-
--------
NNGQ
C
ADFRQ-LSGVEIRP
N
P
--
AEGM
L
YI
NM
PQA
W
L
EYSDAS
W
L
PP
SR
WD
N
GI
P
G
LLFD
Y
NING-TVNKPHQ---
-
GKQ
S
QSLNYNGTA
G
A
N
F
G
A
WRLR
A
D
YQGNLNHTT
-
GSA
Q
G
---------
TDSQFT
W
S--
--------
RFY
M
Y
R
AIPR
W
R
-
AN
L
T
LG
E
N
Y
IN
S
---
E
IF
S
S
WR
Y
T
G
AS
L
ES
D
DR
MLP
PKLR
G
Y
AP
Q
V
S
GIA
DT
NA
R
V
V
I
S
Q
Q
G
RI
L
Y
DST
VP
A
GPF
T
I
Q
DL
DS
S
VR-
G
R
L
D
V
E
V
I
E
Q
DG
RKKTFQ
V
D
T
A
Y
V
P
Y
L
T
R
P
G
QV
RY
KL
V
S
G
R
SRT-Y
--
EHTMEGPV
F
AAGEAS
W
G
ISNTW
S
L
YGG
S
-
IVA
G
D
Y
N
A
LAV
G
L
G
R
D
LSKF
G
T
V
S
A
D
V
T
Q
S
VARIP
-----
GY
-
D
---
TKQ
G
K
S
W
R
LS
YSK
RFDEVN
T
DITFA
GYR
F
S
ERN
Y
M
T
M
DQ
YLNA
--
------------------
-
RYRND---
-
FTGR
EK
ELYTV
T
LN
K
NFEDWKA
S
VNL
Q
YSHQT
YW
DRRTSDYYTLSV
N
RYFDAFSFKNIA
L
G
I
S
ASRSKY
L
N
-----------
RDN
D
-SAFVRL
S
V
P
WG-----
--
-------TGTAS
Y
SGS-
-
--MSNDRY
T
NTV
G
YSD
TL
--
N
NG
L
S
SY
S
L
NA
G
VNSGGGQPSQRQ-
---
MSAYYN
H
NGSL
---
T
--NL
S
A
S
FS
AV
ENGYS
S
FGMSA
SGG
AT
V
TMK
G
AA
L
H
-
AGGMNGG
T
RL
LV
D
TD
G
VG
G
VP
V
-
-
DGGRVY
T
N
RW
G
IG
V
VTDV
S
S
Y
YR
N
TTS
V
D
L
N
K
L
PE
-
D
M
EATR
S
V
VESVLTE
GAI
GYRE
F
EVLK
G
SRLFAVLRMSD
N
SYPP
FGA
S
V
T
---
-N
A
KG-REL
G
M
V
A
D
S
G
LA
W
------
L
SGV
NP
--
GETLN
V
G
W
D
GRTQ--
-----
C
VVD
fig|550676.3.peg.174
Escherichia coli B185 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITPLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
VA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|701177.3.peg.870
Escherichia coli O55:H7 str. CB9615 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DTQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|562.373.peg.3015
Escherichia coli 1125A (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|562.372.peg.1711
Escherichia coli 1212A (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|562.374.peg.5266
Escherichia coli 536A (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|444454.5.peg.5236
Escherichia coli O157:H7 str. EC4024 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|444449.5.peg.5572
Escherichia coli O157:H7 str. EC4042 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|444448.5.peg.3447
Escherichia coli O157:H7 str. EC4045 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|444453.5.peg.790
Escherichia coli O157:H7 str. EC4076 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|444452.5.peg.3574
Escherichia coli O157:H7 str. EC4113 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|444450.8.peg.906
Escherichia coli O157:H7 str. EC4115 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|444451.5.peg.4518
Escherichia coli O157:H7 str. EC4196 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|478005.5.peg.1294
Escherichia coli O157:H7 str. EC4486 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|478007.5.peg.3992
Escherichia coli O157:H7 str. EC508 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|478008.5.peg.2136
Escherichia coli O157:H7 str. EC869 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|637388.3.peg.1495
Escherichia coli O157:H7 str. FRIK2000 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|570506.3.peg.422
Escherichia coli O157:H7 str. FRIK966 (8-805/819)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|544404.4.peg.771
Escherichia coli O157:H7 str. TW14359 (5-802/816)
RLSVLSCLAMVTPPALT-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VDFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWM
GA
L
SFD
V
T
W
A
DSHFD
-----
TQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITLLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITV
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|656419.3.peg.991
Escherichia coli M718 (5-802/816)
RLSVLSCLAMVTPPALA-
-
---AE
---
FN
LN
V
LDK-
--
SIRDSV
D
-
ISL
L
NQ
--
KGVVA
PG
D
Y
F
V
S
V
T
V
N
NNKI
-
SNGQQ
I
RWQ------KSGD
K
II--P
C
I
N
--
ESL
I
ELF
GL
K
SDFR
------
KKLPA--
--------
-IKE
C
VNFSV-FPEIIFTF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
N
N
GI
P
G
FLMD
Y
NLFA-STYRPQS---
-
GSS
S
NNLNAYGTT
GLN
A
G
A
WRLR
S
D
YQ--LSQS-
-
DSG
D
N
---------
REQSGA
I
S--
--------
RTY
L
F
R
PLPQ
I
G
-
SR
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
S
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
R
PRSSM
--
SHHTEDET
F
ISHEVS
W
G
MLSNT
S
L
YGG
M
L
LAG
D
D
Y
R
S
GAL
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
DSHFD
-----
IQ
-
Q
---
DEQ
G
Y
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YIDH
--
------------------
-
KYNDA---
-
DAQD
EK
QTISL
S
FG
Q
PITPLNL
N
LYA
N
ILHQS
W
W
NADTSTTANITA
G
FNVDIGDWKDIS
V
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYFSI
S
L
P
IG-----
--
------ESGRLG
Y
DMQ-
-
--NNSNTT
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
IQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDL
T
G
T
YA
--
ANDYT
S
ASASW
SG
S
FT
A
TQH
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VG
D
IP
I
-
-
QGNIDY
T
N
RF
G
IA
V
VPFV
S
S
Y
QP
T
TVA
V
N
M
N
D
L
PD
-
G
V
TVSE
N
V
VKETWTE
GAI
GFKS
L
ASRA
G
KDLNVIISDAN
G
HFPP
L
GA
D
V
R
---
-Q
A
EGGVSV
G
M
V
G
E
N
G
HA
W
------
L
SGV
DE
--
NQQFT
V
H
WG
DQKT--
-----
C
AIHLPE
fig|656444.3.peg.4132
Escherichia coli TA280 (12-816/835)
ITYALLLSLAGAPAYA-
-
---VD
---
FN
TD
V
LDA-
--
ADRQNI
D
-
FSR
F
SQ
--
AGYIM
PG
Q
Y
Q
M
E
I
M
V
N
DQGI
S
PSAFP
V
TFLEPPVSGQDGK
K
PLPQA
CLT
--
PEM
V
SRM
GL
T
VASQ
------
EKVTYW-
--------
NNGQ
C
ADLSQ-LPGVEIRP
N
P
--
AEGM
L
YI
NM
PQA
W
L
EYSDAS
W
L
PP
SR
WD
N
GI
P
G
LLFD
Y
NING-TVNKPHK---
-
GKQ
S
QSLSYNGTA
G
A
N
F
G
A
WRLR
A
D
YQGNLNHTT
-
GSV
Q
G
---------
TDSQFT
W
S--
--------
RFY
M
Y
R
AIPR
W
R
-
AS
L
T
LG
E
N
Y
IN
S
---
E
IF
S
S
WR
Y
T
G
AS
L
ES
D
DR
MLP
PKLR
G
Y
AP
Q
V
S
GIA
DT
NA
R
V
V
I
S
Q
Q
G
RI
L
Y
DST
VP
A
GPF
T
I
Q
DL
DS
S
VR-
G
R
L
D
V
E
V
I
E
Q
DG
RKKTFQ
V
D
T
A
Y
V
P
Y
L
T
R
P
G
QI
RY
KL
V
S
G
R
SRN-Y
--
EHTTEGPV
F
AAGEAS
W
G
ISNKW
S
L
YGG
G
-
IVA
G
D
Y
N
A
LAV
G
L
G
R
D
LSEF
G
T
V
S
A
D
V
T
Q
S
VARIP
-----
GE
-
E
---
TKQ
G
K
S
W
R
LS
YSK
RFDDVN
A
DITFA
GYR
F
S
ERN
Y
M
T
M
DQ
YLNA
--
------------------
-
RYRND---
-
FTGR
EK
ELYTV
T
LN
K
NFEDWKT
S
VNL
Q
YSHQT
YW
DRRTSDYYTLSV
N
RYFDAFGFKNIS
L
G
L
S
ASRSKY
Q
N
-----------
RDN
D
-SAFVRL
S
V
P
WG-----
--
-------TGTAS
Y
SGS-
-
--MSNDRY
T
NTV
G
YSD
TL
--
N
KG
L
S
SY
S
L
NA
G
VSSGGGQPSQSQ-
---
MSAYYN
H
SSPL
---
A
--NL
S
A
N
FS
AV
ENGYT
S
FGMSA
SGG
AT
I
TAK
G
AA
L
H
-
AGGMNGG
T
RL
LV
D
TD
G
VG
G
VP
V
-
-
DGGRVS
T
N
RW
G
IG
V
VTDV
S
S
Y
YR
N
TTS
V
D
L
N
K
L
PE
-
D
M
EATR
S
V
VESVLTE
GAI
GYRE
F
EVLK
G
SRLFAVLRLAD
N
SHPP
FGA
S
V
T
---
-N
A
KG-REL
G
M
V
A
D
S
G
LA
W
------
L
SGV
NP
--
GETLN
V
G
W
D
GRTQ--
-----
C
VVD
fig|749547.3.peg.1737
Escherichia coli MS 187-1 (5-802/816)
RLSFVSCLVMAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SNYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
S
WRLR
S
D
YQ--LNNT-
-
DSE
D
S
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITSLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|670888.3.peg.1279
Escherichia coli 1827-70 (5-802/816)
RLSFVSCLVMAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SNYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
S
WRLR
S
D
YQ--LNNT-
-
DSE
D
S
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|413997.3.peg.720
Escherichia coli B str. REL606 (5-802/816)
RLSFVSCLVMAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SNYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
S
WRLR
S
D
YQ--LNNT-
-
DSE
D
S
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|511693.5.peg.717
Escherichia coli BL21 (5-802/816)
RLSFVSCLVMAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SNYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
S
WRLR
S
D
YQ--LNNT-
-
DSE
D
S
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|469008.4.peg.3039
Escherichia coli BL21(DE3) (5-802/816)
RLSFVSCLVMAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SNYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
S
WRLR
S
D
YQ--LNNT-
-
DSE
D
S
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|358709.5.peg.3965
Escherichia coli 101-1 (5-802/816)
RLSFVSCLVMAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SNYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
S
WRLR
S
D
YQ--LNNT-
-
DSE
D
S
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
MA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|585055.6.peg.714
Escherichia coli 55989 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
KT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
I
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|585055.8.peg.716
Escherichia coli 55989 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
KT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
I
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|340186.3.peg.3855
Escherichia coli E110019 (8-805/819)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|340186.5.peg.4048
Escherichia coli E110019 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|331111.12.peg.1040
Escherichia coli E24377A (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|331111.3.peg.3259
Escherichia coli E24377A (8-805/819)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|316401.4.peg.846
Escherichia coli ETEC H10407 (5-801/815)
RLSFVSCLVMAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SNYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
S
WRLR
S
D
YQ--LNNT-
-
DSE
D
S
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
R
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QYPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|481805.6.peg.3134
Escherichia coli ATCC 8739 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVTE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|331112.6.peg.743
Escherichia coli HS (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVTE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|481805.3.peg.3149
Escherichia coli ATCC 8739 (8-805/819)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVTE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|331112.3.peg.712
Escherichia coli HS (8-805/819)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVTE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|562.375.peg.1438
Escherichia coli EC4100B (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|656408.3.peg.660
Escherichia coli H591 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|679207.4.peg.1372
Escherichia coli MS 107-1 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|679206.4.peg.2773
Escherichia coli MS 119-7 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|679205.4.peg.3671
Escherichia coli MS 124-1 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|749533.3.peg.4665
Escherichia coli MS 84-1 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|585396.4.peg.766
Escherichia coli O111:H- str. 11128 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|656443.3.peg.983
Escherichia coli TA271 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|566546.3.peg.4582
Escherichia coli W (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|566546.4.peg.772
Escherichia coli W (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|573235.3.peg.793
Escherichia coli O26:H11 str. 11368 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNDVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|749532.3.peg.2590
Escherichia coli MS 78-1 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
G
T
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
GSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQQFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|550672.3.peg.961
Escherichia coli B088 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPVI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
K
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|585034.4.peg.688
Escherichia coli IAI1 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPVI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
K
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|585034.5.peg.687
Escherichia coli IAI1 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPVI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
K
G
HA
W
------
L
SGV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|409438.11.peg.903
Escherichia coli SE11 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
N
GV
AE
--
NQKFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|749545.3.peg.3930
Escherichia coli MS 182-1 (5-802/816)
RLSFVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
ILMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTT
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
ER
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
G
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
GSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQQFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|340184.3.peg.2282
Escherichia coli B7A (8-805/819)
RLSLVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQNFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|340184.6.peg.2396
Escherichia coli B7A (5-802/816)
RLSLVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQNFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|679204.3.peg.4494
Escherichia coli MS 145-7 (5-802/816)
RLSLVSCLVVAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWH------KNDD
K
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPL--
--------
-INQ
C
VDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSEN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
T
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
LSG
D
D
Y
H
S
AAI
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQNFT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|749548.3.peg.3226
Escherichia coli MS 196-1 (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDH-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|749531.3.peg.1848
Escherichia coli MS 69-1 (5-802/816)
RLSFISCLVMAMPCALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
T
I
PQA
W
L
AWHSDN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSN
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSQFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDN---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNF
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGNWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSNHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVS
V
N
M
N
D
L
PD
-
G
V
TVAD
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQQFT
V
V
WG
DSQR--
-----
C
SIHLPE
fig|550677.3.peg.1123
Escherichia coli B354 (5-802/816)
RLSFISCLVMAIPSALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
DWK------KNGD
Q
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPR--
--------
-LNQ
C
VDFSS-RPEILFIF
D
Q
--
ASQQ
L
NI
T
I
PQA
W
L
AWHSDN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSN
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSQFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDN---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGNWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSQST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSTTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
NSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQMLT
V
V
WG
DSQH--
-----
C
SLHLPE
fig|595496.3.peg.644
Escherichia coli BW2952 (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|536056.3.peg.3079
Escherichia coli DH1 (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|656414.3.peg.899
Escherichia coli H736 (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|749538.3.peg.653
Escherichia coli MS 116-1 (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|749544.3.peg.3267
Escherichia coli MS 175-1 (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|316407.3.peg.692
Escherichia coli W3110 (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|316385.5.peg.781
Escherichia coli str. K-12 substr. DH10B (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|316385.7.peg.793
Escherichia coli str. K-12 substr. DH10B (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|511145.12.peg.748
Escherichia coli str. K-12 substr. MG1655 (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|511145.6.peg.739
Escherichia coli str. K-12 substr. MG1655 (5-801/815)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|83333.1.peg.710
Escherichia coli K12 (8-804/818)
RLSFVSCLVMAMPCAMA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGVIA
PG
E
Y
F
V
S
V
A
V
N
NNKI
-
SNGQK
I
NWQ------KKGD
K
TI--P
C
I
N
--
DSL
V
DKF
GL
K
PDIR
------
QSLPQ--
--------
-IDR
C
IDFSS-RPEMLFNF
D
Q
--
ANQQ
L
NI
S
I
PQA
W
L
AWHSEN
W
A
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSS
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNKT-
-
DSE
D
N
---------
HDQSGG
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VTI
S
Q
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRPSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISD
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSHFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGDWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
I
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
SA
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ASDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQLFT
V
V
WG
E-QS--
-----
C
IIHLPE
fig|656379.3.peg.1492
Escherichia coli FVEC1302 (5-802/816)
RLSFISCLVMAIPSALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KG
--
KGGIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
DWK------KNGD
Q
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPR--
--------
-LNQ
C
VDFSS-RPEILFIF
D
Q
--
ASQQ
L
NI
T
I
PQA
W
L
AWHSDN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSN
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRTSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISG
D
N
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSQFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDN---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGNWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSNHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAD
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNLIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
E
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQQFT
V
I
WG
DSQR--
-----
C
SIHLPE
fig|656380.3.peg.1320
Escherichia coli FVEC1412 (5-802/816)
RLSFISCLVMAIPSALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KG
--
KGGIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
DWK------KNGD
Q
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPR--
--------
-LNQ
C
VDFSS-RPEILFIF
D
Q
--
ASQQ
L
NI
T
I
PQA
W
L
AWHSDN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSN
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRTSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISG
D
N
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSQFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDN---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGNWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSNHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAD
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNLIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
E
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQQFT
V
I
WG
DSQR--
-----
C
SIHLPE
fig|749549.3.peg.4481
Escherichia coli MS 198-1 (5-802/816)
RLSFISCLVMAIPSALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KG
--
KGGIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
DWK------KNGD
Q
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPR--
--------
-LNQ
C
VDFSS-RPEILFIF
D
Q
--
ASQQ
L
NI
T
I
PQA
W
L
AWHSDN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSN
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRTSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISG
D
N
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSQFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDN---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGNWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSNHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAD
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNLIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
E
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQQFT
V
I
WG
DSQR--
-----
C
SIHLPE
fig|585056.7.peg.1000
Escherichia coli UMN026 (5-802/816)
RLSFISCLVMAIPSALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KG
--
KGGIA
PG
E
Y
F
V
S
V
A
V
N
NNQI
-
SNGQK
I
DWK------KNGD
Q
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPR--
--------
-LNQ
C
VDFSS-RPEILFIF
D
Q
--
ASQQ
L
NI
T
I
PQA
W
L
AWHSDN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSN
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RVNNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRTSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISG
D
N
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSQFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATN
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNDN---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGNWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSNHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
ST
G
LQSDR-PDNGAQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAD
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNLIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
E
DSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQQFT
V
I
WG
DSQR--
-----
C
SIHLPE
fig|656393.3.peg.1403
Escherichia coli H299 (5-802/816)
RLSFISCLVMAIPSALA-
-
---VE
---
FN
LN
V
LDK-
--
SMRDRI
D
-
ISL
L
KE
--
KGGIA
PG
E
Y
F
V
S
V
T
V
N
NNQI
-
SNGQK
I
DWK------KNGD
Q
TI--P
C
I
N
--
DLL
V
DKF
GL
K
PEVR
------
QSLPR--
--------
-LNQ
C
VDFSS-RPEILFIF
D
Q
--
ASQQ
L
NI
T
I
PQA
W
L
AWHSDN
W
T
PP
ST
W
K
E
G
V
A
G
VLMD
Y
NLFA-SSYRPQD---
-
GSN
S
TNLNAYGTA
G
I
N
A
G
A
WRLR
S
D
YQ--LNQT-
-
DSD
D
N
---------
HEQSGE
I
S--
--------
RTY
L
F
R
PLPQ
L
G
-
SK
L
T
LG
E
T
D
FS
S
---
N
IFD
G
FS
Y
T
G
AA
L
AS
D
DR
MLP
WELR
G
Y
AP
Q
I
S
GIA
QT
NA
T
VT
V
S
H
S
G
RV
IY
QKK
VPPGPF
I
I
D
DL
NQ
S
VQ-
G
T
L
D
V
K
V
T
E
E
DG
RANNFQ
V
S
A
A
S
T
P
F
L
T
R
Q
G
QV
RY
KL
A
A
G
Q
PRTSM
--
SHQTENET
F
FSNEVS
W
G
MLSNT
S
L
YGG
L
L
ISG
D
D
Y
H
S
AAM
G
I
G
Q
N
MLWL
GA
L
SFD
V
T
W
A
SSQFD
-----
TQ
-
Q
---
DER
G
L
S
Y
R
FN
YSK
QVDATD
S
TISLA
A
YR
F
S
DRH
F
H
S
Y
AN
YLDH
--
------------------
-
KYNNS---
-
DAQD
EK
QTISL
S
VG
Q
PITPLNL
N
LYA
N
LLHQT
W
W
NADASTTANITA
G
FNVDIGNWRDIS
I
S
T
S
FNTTHY
E
D
-----------
KDR
D
NQIYLSI
S
L
P
FG-----
--
------NGGRVG
Y
DMQ-
-
--NSSHST
T
HRM
S
WND
TL
--
D
ER
-
N
S
W
G
M
ST
G
LQSDR-PDNGVQ-
---
VSGNYQ
H
LSSA
---
G
EWDI
S
G
T
YA
--
ANDYS
S
VSSSW
SG
S
FT
A
TQY
G
AA
F
H
-
RRSSTNE
P
RL
M
V
S
TD
G
VA
D
IP
V
-
-
QGNLDY
T
N
HF
G
IA
V
VPLI
S
S
Y
QP
S
TVA
V
N
M
N
D
L
PD
-
G
V
TVAE
N
V
IKETWIE
GAI
GYKS
L
ASRS
G
KDVNVIIRNAS
G
QFPP
L
GA
D
I
R
---
-Q
D
NSGISV
G
M
V
G
E
E
G
HA
W
------
L
SGV
AE
--
NQMLT
V
V
WG
DSQH--
-----
C
SLHLPE
Consen1
Primary consensus
QmhqvL
mp
a
---
Fn
f
--
d
-
f
--
pG
Y
v
i
lN
-
i
-
Clt
--
l
Gln
------
--------
C
d
--
L
svPQa
l
y
pP
Wd
Gina
Y
-
s
GlN
g
Wrlr
n
-
-
---------
w
--------
l
R
L
-
L
lGd
y
g
---
diFDs
f
G
l
d
MLP
gfaP
i
GiA
nA
vti
Q
G
iY
VppGpF
I
Dl
a
gdL
V
i
E
dG
vp
ss
P
l
r
G
rY
a
ge
--
F
hG
t
YGG
-
e
Y
a
G
G
n
GA
sfD
T
a
-----
-
---
G
S
r
YsK
t
gyRyS
f
t
sd
--
-
-
-
r
t
Q
s
s
yW
g
ls
s
q
-----------
d
siP
--
y
-
t
g
tl
d
-
sy
v
g
---
y
---
g
g
s
--
q
sGg
a
G
l
-
t
lv
apG
d
v
-
Td
G
v
t
Y
n
ld
n
l
-
v
av
GAi
f
g
g
fGa
v
---
-
giv
d
g
y
------
sgv
--
V
wg
-----
C
p
a
CR
Consen2
Secondary consensus
kfnm
lt
-
d
v
a
l
-
v
v
v
k
i
k
ldnkeh
n
t
i
w
s
v
g
n
l
qmh
d
d
i
a
e
d
s
nf
g
y
i
t
nyt
v
v
t
isv
v
sa
a
i
at
v
n
is
aa
m
q
k
a
s
l
s
d
av
s
g
s
aw
f
y
y
ek
n
w
s
n
e
n
w
t
s
s
r
r
w
s
gy
a
s
d
s
q
s
l
f
p
td
n
i
n
i
s
n
s
a
r
l
r
n
l
y
a
l
d
s
i
s
ir
e
id
t
c
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character