fig|1040638.4.peg.859
Escherichia coli O104:H4 str. LB226692
M
G
GVRR
D
G
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
I
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|6666666.5357.peg.2667
Escherichia coli TY-2482 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
I
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|585055.6.peg.4736
Escherichia coli 55989 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
I
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|585055.8.peg.4741
Escherichia coli 55989 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
I
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|562.371.peg.3680
Escherichia coli 1044A (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|562.373.peg.5723
Escherichia coli 1125A (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|562.372.peg.3378
Escherichia coli 1212A (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|562.374.peg.2713
Escherichia coli 536A (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|83334.1.peg.5117
Escherichia coli O157:H7 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|155864.1.peg.5100
Escherichia coli O157:H7 EDL933 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|444454.5.peg.4222
Escherichia coli O157:H7 str. EC4024 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|444448.5.peg.2434
Escherichia coli O157:H7 str. EC4045 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|444453.5.peg.2071
Escherichia coli O157:H7 str. EC4076 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|444452.5.peg.1091
Escherichia coli O157:H7 str. EC4113 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|444450.8.peg.5512
Escherichia coli O157:H7 str. EC4115 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|444451.5.peg.2175
Escherichia coli O157:H7 str. EC4196 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|444447.5.peg.2600
Escherichia coli O157:H7 str. EC4206 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|478004.5.peg.2123
Escherichia coli O157:H7 str. EC4401 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|478005.5.peg.652
Escherichia coli O157:H7 str. EC4486 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|478006.5.peg.1539
Escherichia coli O157:H7 str. EC4501 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|478007.5.peg.4907
Escherichia coli O157:H7 str. EC508 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|478008.5.peg.186
Escherichia coli O157:H7 str. EC869 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|637388.3.peg.2399
Escherichia coli O157:H7 str. FRIK2000 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|570506.3.peg.1323
Escherichia coli O157:H7 str. FRIK966 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|544404.4.peg.5324
Escherichia coli O157:H7 str. TW14359 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|502346.5.peg.1497
Escherichia coli O157:H7 str. TW14588 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|701177.3.peg.4991
Escherichia coli O55:H7 str. CB9615 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|344601.3.peg.3303
Escherichia coli B171 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|344601.5.peg.3456
Escherichia coli B171 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|331112.3.peg.4118
Escherichia coli HS (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|331112.6.peg.4289
Escherichia coli HS (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|585396.4.peg.5263
Escherichia coli O111:H- str. 11128 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|573235.3.peg.5501
Escherichia coli O26:H11 str. 11368 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|358709.5.peg.1910
Escherichia coli 101-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|670888.3.peg.4571
Escherichia coli 1827-70 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|670897.3.peg.1528
Escherichia coli 2362-75 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|344610.3.peg.1444
Escherichia coli 53638 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|344610.7.peg.1963
Escherichia coli 53638 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|413997.3.peg.4251
Escherichia coli B str. REL606 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|511693.5.peg.4271
Escherichia coli BL21 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|469008.4.peg.4019
Escherichia coli BL21(DE3) (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|595496.3.peg.4236
Escherichia coli BW2952 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|536056.3.peg.4068
Escherichia coli DH1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|331111.12.peg.4924
Escherichia coli E24377A (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|331111.3.peg.2317
Escherichia coli E24377A (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|656414.3.peg.4724
Escherichia coli H736 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|83333.1.peg.4072
Escherichia coli K12 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|595495.4.peg.5007
Escherichia coli KO11 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749537.3.peg.4224
Escherichia coli MS 115-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749538.3.peg.2469
Escherichia coli MS 116-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|679205.4.peg.1098
Escherichia coli MS 124-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749540.3.peg.3053
Escherichia coli MS 146-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749544.3.peg.3435
Escherichia coli MS 175-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749547.3.peg.1526
Escherichia coli MS 187-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749548.3.peg.3717
Escherichia coli MS 196-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749533.3.peg.3781
Escherichia coli MS 84-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|566546.3.peg.2147
Escherichia coli W (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|566546.4.peg.4456
Escherichia coli W (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|316407.3.peg.3998
Escherichia coli W3110 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|316385.5.peg.4306
Escherichia coli str. K-12 substr. DH10B (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|316385.7.peg.4397
Escherichia coli str. K-12 substr. DH10B (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|511145.12.peg.4293
Escherichia coli str. K-12 substr. MG1655 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|511145.6.peg.4271
Escherichia coli str. K-12 substr. MG1655 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|481805.3.peg.4133
Escherichia coli ATCC 8739 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDL
I
A
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|481805.6.peg.4121
Escherichia coli ATCC 8739 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDL
I
A
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|340185.3.peg.4140
Escherichia coli E22 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MP
I
RH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|340185.4.peg.4367
Escherichia coli E22 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MP
I
RH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|585395.4.peg.5209
Escherichia coli O103:H2 str. 12009 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MP
I
RH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|656419.3.peg.5409
Escherichia coli M718 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|562.375.peg.2914
Escherichia coli EC4100B (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
I
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|316401.4.peg.5121
Escherichia coli ETEC H10407 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEI
S
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|409438.11.peg.4641
Escherichia coli SE11 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|656393.3.peg.383
Escherichia coli H299 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749549.3.peg.3557
Escherichia coli MS 198-1 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|585056.7.peg.4858
Escherichia coli UMN026 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|562.376.peg.1838
Escherichia coli WV_060327 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|340184.3.peg.667
Escherichia coli B7A (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|340184.6.peg.701
Escherichia coli B7A (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|340186.3.peg.1091
Escherichia coli E110019 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|340186.5.peg.1130
Escherichia coli E110019 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|679207.4.peg.1546
Escherichia coli MS 107-1 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|679204.3.peg.4059
Escherichia coli MS 145-7 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749545.3.peg.663
Escherichia coli MS 182-1 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749532.3.peg.895
Escherichia coli MS 78-1 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|679206.4.peg.1024
Escherichia coli MS 119-7 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
I
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
K
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|585057.6.peg.4773
Escherichia coli IAI39 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|656408.3.peg.4662
Escherichia coli H591 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
I
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|656443.3.peg.5190
Escherichia coli TA271 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
I
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQ
Q
VIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
A
T
S
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|216593.1.peg.3839
Escherichia coli E2348/69 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTS
V
GKGRQAGSL
fig|574521.7.peg.4588
Escherichia coli O127:H6 str. E2348/69 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTS
V
GKGRQAGSL
fig|216592.1.peg.2592
Escherichia coli 042 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|216592.3.peg.4814
Escherichia coli 042 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|656437.3.peg.4673
Escherichia coli TA143 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749531.3.peg.3250
Escherichia coli MS 69-1 (22-1107/1107)
A
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|525281.3.peg.1708
Escherichia coli 83972 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
K
I
SATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|655817.3.peg.4912
Escherichia coli ABU 83972 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
K
I
SATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|585397.7.peg.4925
Escherichia coli ED1a (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
K
I
SATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|585397.9.peg.4925
Escherichia coli ED1a (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
K
I
SATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749546.3.peg.1567
Escherichia coli MS 185-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
K
I
SATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749528.3.peg.861
Escherichia coli MS 45-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
K
I
SATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|656444.3.peg.48
Escherichia coli TA280 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
K
K
TGRTLTSA
A
K
A
RQAG
G
L
fig|656417.3.peg.5359
Escherichia coli M605 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPA
N
ANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|431946.3.peg.4211
Escherichia coli SE15 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPA
N
ANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|199310.1.peg.5136
Escherichia coli CFT073 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
K
I
SATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
G
VT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|199310.4.peg.4916
Escherichia coli CFT073 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
K
I
SATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
G
VT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|753642.3.peg.3886
Escherichia coli NC101 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
V
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|405955.13.peg.4740
Escherichia coli APEC O1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|405955.9.peg.3998
Escherichia coli APEC O1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|714962.3.peg.4703
Escherichia coli IHE3034 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|685038.3.peg.4318
Escherichia coli O83:H1 str. NRG 857C (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|585035.6.peg.4641
Escherichia coli S88 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|656440.3.peg.4736
Escherichia coli TA206 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|869729.3.peg.4518
Escherichia coli UM146 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|364106.7.peg.4637
Escherichia coli UTI89 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|364106.8.peg.4636
Escherichia coli UTI89 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
Q
Q
EAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|439855.10.peg.4739
Escherichia coli SMS-3-5 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMI
S
A
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|362663.8.peg.4437
Escherichia coli 536 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQAD
S
QPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|362663.9.peg.4453
Escherichia coli 536 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQAD
S
QPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|340197.3.peg.3513
Escherichia coli F11 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQAD
S
QPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|340197.5.peg.3677
Escherichia coli F11 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQAD
S
QPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749550.3.peg.559
Escherichia coli MS 200-1 (16-1107/1107)
GAYAA
TA
P
DSKQITQE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
S
NDA
L
N
Q
EILQISS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
T
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
Q
L
A
E
S
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPKPQQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQPLL
R
QIHQAD
S
QPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMIGA
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749527.3.peg.5085
Escherichia coli MS 21-1 (16-1107/1107)
GAYAA
TA
P
DSKQI
S
QE
L
EQAKAA
K
--
P
AQ
PEV
V
EA
-
L
QSA
L
NA
L
EERKGSLE
RIK
----
Q
YQEVIDNY
P
KLSATLR
A
Q
L
NNMR
D
-----
E
P
R
SVSPGM
ST
DA
L
N
Q
EILQ
V
SS
Q
L
LD
KSRQ
AQ
QEQERAREIAD
SL
NQL
P
QQ
----
QT
DA
RR
QL
NE
I
ER
RL
-
GT
LT
G
N
S
P
L
NQA
Q
NFAL
QS
DS
A
R
L
K
A
LV
D
ELELAQ
L
SA
N
N
-
R
Q
ELARLRSELAEKE
S
QQ
L
DAY
LQ
A
L
RNQL
NS
Q
R
QLEAERA
L
ES
TE
L
L
A
E
N
--
S
A
D
LPKD
I
V
A
QFKINR
ELS
AAL
NQQ
-
A
QR
MDLVA
S
QQR
Q
AAS
Q
TLQ
V
R
-------
Q
ALNTLR
EQ
SQW
L
GS
S
N
LL
GEA
L
RA
Q
VAR
LP
EMPK
L
QQLDTEM
A
Q
LR
VQRLRYED
----
L
LNKQP
Q
L
R
QIHQADGQPLTA
E
QNRI
L
EAQLRTQ
RELL
NS
L
-
L
Q
G
G
DT
L
LLELTK
L
KVSNG
QL
EDAL
K
EVNEATHRYL
FW
TSDV
RPM
TI
AW
PLEIA
Q
D
LR
RLISL
D
T
F
SQLG
K
AS
V
MM
L
TSKETILPL
F
G
A
---
L
I
L
VGCS
I
YS
R
-
RYFTRFLERS
A
AK
VG
KVTQ
D
HFWL
T
LRTLFWSILV
A
SP
LPV
LWMT
L
GY
GL
----------
REA
W
PYPLAV
AI
GDGVTATVPLL
W
V
V
M
-
I
C
ATFARP
NG
LFIA
HFG
W
PR
ERV
S
RGM
R
YYL
-
M
S
IG
L
-------
I
V
PLIMA
L
MMF
D
NLDDREFSGS
LG
R
--
LC
F
I
L
ICG
A
LA
V
VTLSLKKAGIPLYLN
KE
GSGDNITNHMLWNMMI
S
A
P
LVAILAS
A
V
GY
LA
T
AQA
L
L
A
R
-
L
ET
S
--
V
A
IW
F
LL
LVV
Y
HVIR
R
WMLIQR
RR
L
A
FD
RA
KH
RR
AEMLAQR
A
R
G
E
E
EAHHHSS
PE
GAIEVD
E
SEVD
L
DAISA
Q
S
LR
LVRSI
L
MLI
AL
LS
V
I
--
VL
WS
EIHSA
F
GF
L
EN
I
S
LW
DVT
S
T
VQ
G
VESLEPI
T
L
G
AV
L
I
AI
LVFIITTQ
L
V
RNLP
A
LLE
LAI
L
QH
L
DLTP
G
TG
YAITTI
TK
Y
LLMLI
G
GLVG
F
SMI
G
IE
W
S
KLQWL
V
AAL
G
VGLGFGLQEIF
A
NF
I
SGLIILFE
K
P
I
RIGDTVTI
RDLT
G
S
V
T
KI
NT
RATTI
S
D
W
DRKE
I
I
V
PNKAF
I
TE
QF
INWSL
S
D
SV
TR
V
V
LTIPAPADANS
E
E
V
TEI
LL
T
AA
RRCSL
V
IDN
P
A
PEVF
LVDLQQGIQIF
ELR
I
Y
AA
E
MGH
R
MPLRH
E
IHQL
I
LAGFHAHG
I
DMP
F
PPFQMR
L
ESL
NG
KQ
TGRTLTSAGKGRQAGSL
fig|749527.3.peg.730
Escherichia coli MS 21-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
Q
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
N
WIKAFPQ
S
LR
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|413997.3.peg.447
Escherichia coli B str. REL606 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRL
E
WLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|562.376.peg.684
Escherichia coli WV_060327 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|358709.5.peg.2090
Escherichia coli 101-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|670888.3.peg.1049
Escherichia coli 1827-70 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|481805.3.peg.3376
Escherichia coli ATCC 8739 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|481805.6.peg.3363
Escherichia coli ATCC 8739 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|511693.5.peg.452
Escherichia coli BL21 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|469008.4.peg.3298
Escherichia coli BL21(DE3) (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|595496.3.peg.386
Escherichia coli BW2952 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|536056.3.peg.3324
Escherichia coli DH1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|316401.4.peg.594
Escherichia coli ETEC H10407 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656414.3.peg.661
Escherichia coli H736 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|331112.3.peg.504
Escherichia coli HS (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|331112.6.peg.529
Escherichia coli HS (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|83333.1.peg.461
Escherichia coli K12 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749537.3.peg.3384
Escherichia coli MS 115-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749540.3.peg.3785
Escherichia coli MS 146-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749547.3.peg.2441
Escherichia coli MS 187-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|409438.11.peg.617
Escherichia coli SE11 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|316407.3.peg.450
Escherichia coli W3110 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|316385.7.peg.426
Escherichia coli str. K-12 substr. DH10B (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|511145.12.peg.483
Escherichia coli str. K-12 substr. MG1655 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|511145.6.peg.478
Escherichia coli str. K-12 substr. MG1655 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656417.3.peg.609
Escherichia coli M605 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585396.4.peg.514
Escherichia coli O111:H- str. 11128 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAM
C
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|362663.8.peg.528
Escherichia coli 536 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|362663.9.peg.528
Escherichia coli 536 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|405955.13.peg.468
Escherichia coli APEC O1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|405955.9.peg.382
Escherichia coli APEC O1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|340197.3.peg.3794
Escherichia coli F11 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|340197.5.peg.3972
Escherichia coli F11 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|714962.3.peg.458
Escherichia coli IHE3034 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749550.3.peg.4711
Escherichia coli MS 200-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585035.6.peg.461
Escherichia coli S88 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|869729.3.peg.3218
Escherichia coli UM146 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|364106.7.peg.599
Escherichia coli UTI89 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|364106.8.peg.597
Escherichia coli UTI89 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|216592.3.peg.526
Escherichia coli 042 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|562.371.peg.2721
Escherichia coli 1044A (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|562.373.peg.3275
Escherichia coli 1125A (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|562.372.peg.1452
Escherichia coli 1212A (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|562.374.peg.5524
Escherichia coli 536A (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|550676.3.peg.1262
Escherichia coli B185 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|550677.3.peg.903
Escherichia coli B354 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656379.3.peg.950
Escherichia coli FVEC1302 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656380.3.peg.1099
Escherichia coli FVEC1412 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656393.3.peg.1118
Escherichia coli H299 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585057.4.peg.222
Escherichia coli IAI39 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585057.6.peg.221
Escherichia coli IAI39 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656419.3.peg.673
Escherichia coli M718 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749549.3.peg.4341
Escherichia coli MS 198-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749531.3.peg.2782
Escherichia coli MS 69-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|83334.1.peg.595
Escherichia coli O157:H7 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|155864.1.peg.514
Escherichia coli O157:H7 EDL933 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|155864.8.peg.524
Escherichia coli O157:H7 EDL933 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|444454.5.peg.4997
Escherichia coli O157:H7 str. EC4024 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|444449.5.peg.5334
Escherichia coli O157:H7 str. EC4042 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|444453.5.peg.550
Escherichia coli O157:H7 str. EC4076 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|444452.5.peg.976
Escherichia coli O157:H7 str. EC4113 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|444450.8.peg.671
Escherichia coli O157:H7 str. EC4115 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|444451.5.peg.4635
Escherichia coli O157:H7 str. EC4196 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|478006.5.peg.937
Escherichia coli O157:H7 str. EC4501 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|478007.5.peg.869
Escherichia coli O157:H7 str. EC508 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|478008.5.peg.1611
Escherichia coli O157:H7 str. EC869 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|637388.3.peg.1006
Escherichia coli O157:H7 str. FRIK2000 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|544404.4.peg.535
Escherichia coli O157:H7 str. TW14359 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|502346.5.peg.688
Escherichia coli O157:H7 str. TW14588 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|701177.3.peg.576
Escherichia coli O55:H7 str. CB9615 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|439855.10.peg.681
Escherichia coli SMS-3-5 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
Q
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
N
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656437.3.peg.551
Escherichia coli TA143 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585056.7.peg.710
Escherichia coli UMN026 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|444448.5.peg.3209
Escherichia coli O157:H7 str. EC4045 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|444447.5.peg.3383
Escherichia coli O157:H7 str. EC4206 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|478004.5.peg.1379
Escherichia coli O157:H7 str. EC4401 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|570506.3.peg.2203
Escherichia coli O157:H7 str. FRIK966 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|386585.9.peg.623
Escherichia coli O157:H7 str. Sakai (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585055.6.peg.487
Escherichia coli 55989 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585055.8.peg.488
Escherichia coli 55989 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|562.375.peg.3744
Escherichia coli EC4100B (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749538.3.peg.1371
Escherichia coli MS 116-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFG
R
P
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|431946.3.peg.441
Escherichia coli SE15 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LS
A
LSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
Q
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|595495.4.peg.402
Escherichia coli KO11 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|566546.3.peg.2318
Escherichia coli W (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|566546.4.peg.542
Escherichia coli W (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|344601.3.peg.3341
Escherichia coli B171 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|344601.5.peg.3494
Escherichia coli B171 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|340184.3.peg.2597
Escherichia coli B7A (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|340184.6.peg.2716
Escherichia coli B7A (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|340186.3.peg.874
Escherichia coli E110019 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|340186.5.peg.912
Escherichia coli E110019 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|340185.3.peg.2550
Escherichia coli E22 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585034.4.peg.463
Escherichia coli IAI1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585034.5.peg.462
Escherichia coli IAI1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|679207.4.peg.3159
Escherichia coli MS 107-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|679205.4.peg.172
Escherichia coli MS 124-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|679204.3.peg.1934
Escherichia coli MS 145-7 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSD
I
DNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749545.3.peg.2243
Escherichia coli MS 182-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749532.3.peg.4709
Escherichia coli MS 78-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585395.4.peg.465
Escherichia coli O103:H2 str. 12009 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|573235.3.peg.506
Escherichia coli O26:H11 str. 11368 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656443.3.peg.721
Escherichia coli TA271 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656444.3.peg.943
Escherichia coli TA280 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|340185.4.peg.2690
Escherichia coli E22 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|749533.3.peg.1331
Escherichia coli MS 84-1 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|199310.1.peg.562
Escherichia coli CFT073 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
K
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
R
GDE
fig|199310.4.peg.547
Escherichia coli CFT073 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
K
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
R
GDE
fig|331111.12.peg.809
Escherichia coli E24377A (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
H
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|331111.3.peg.3038
Escherichia coli E24377A (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
H
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|670897.3.peg.252
Escherichia coli 2362-75 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
Q
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|216593.1.peg.3134
Escherichia coli E2348/69 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
Q
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|656408.3.peg.407
Escherichia coli H591 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
G
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|679206.4.peg.82
Escherichia coli MS 119-7 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
G
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|753642.3.peg.1167
Escherichia coli NC101 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|685038.3.peg.442
Escherichia coli O83:H1 str. NRG 857C (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|344610.3.peg.1472
Escherichia coli 53638 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNL
Q
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|344610.7.peg.1209
Escherichia coli 53638 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
N
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
S
L
K
-----
DEFKSM
-
KITVNW
Q
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
A
GRWIETVYLVIIWNL
Q
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
G
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|525281.3.peg.2440
Escherichia coli 83972 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
R
GDE
fig|655817.3.peg.585
Escherichia coli ABU 83972 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
R
GDE
fig|749546.3.peg.3577
Escherichia coli MS 185-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
R
GDE
fig|749528.3.peg.3593
Escherichia coli MS 45-1 (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
R
GDE
fig|656440.3.peg.343
Escherichia coli TA206 (38-1100/1118)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
A
QQALLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLG
C
LKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585397.7.peg.490
Escherichia coli ED1a (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQ
T
LLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAM
A
VFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
fig|585397.9.peg.492
Escherichia coli ED1a (40-1102/1120)
DLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
----
IDR
V
KEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKI
---
LSTLSLRQLETRV
-
AQALDDLQNAQNDLASYNSQLVSLQTQPERVQNAMY
N
ASQQLQQIRSRLDGTDVGETALRPSQKVLMQ
V
QQ
T
LLNAEIDQ
-
QRKSLEGNTVLQDTLQKQRDYVTANSARLEHQLQLLQEAVNSKR
-------
LTLTE
K
TAQ
E
AVSPDEAARIQANPLVKQEL
E
--
INQQLSQRLITAT
E
NGNQLMQQNIKVKNWLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFV
S
KLEEGHTNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAI
-
NLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDW
D
WIKAFPQ
T
L
K
-----
DEFKSM
-
KITVNW
E
KAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
--
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAI
----------
FWLVFGLCWKVLEKNGVAVRHFGMP
E
QQTSHWRRQIVRISLALLPIHFWSVVAELSPLHLMDDV
--------
LGQAMIFFNLLLIAFLVWPMCRES
-----
WRDKESHTMRLVTITVLSII
---
PIALMVLTATGYFYTTLRL
S
GRWIETVYLVIIWNLL
---
YQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAE
------
PPE
------
EPTIALEQVNQQTLRI
--
TMLLMFALFGVMFWAIWSDLITVFSYLDSITLWHYN
A
TEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQGASYAITTILNYIIIAVGAM
A
VFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGTVSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEVFFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNE
K
GDE
Consen1
Primary consensus
GAYAAdlPtkadlqaqLdslnkqKdlsAQdklVqqdLtdtLatLdk
----
idRiKeetvQlrqkvaeaPekmrqatAaLtalsDvdndeEtRki
---
lStlsLrQletrv
-
aQaLDdlqnAQndlasynsqlvSLqtqPervqnamydAsqQLqqIrsRLdGTdvGeTaLrpsQkvlmQsqqAlLnAeiDq
-
qrksLegNtvlQdtlqkqrdyvtanSarLehqLQlLqeavNSkR
-------
LtlTE
tAq
avSpDeaarIqAnplvkqELs
--
iNQQlsQRlitatsngnQlmqQnikVknwleralQsernikEQiavLkgSlLLsriLyqQqqtLPsadelenmtnriAdLRleqfevnqqrdaLfqsdafv
kleeghtnevnsEvhdaLlqvvdmrRELLdqLnkQlGnqLmmai
-
nLqinqqQLmsvsKnlksiltqqiFWvnsnRPMdwaWikafpQ
Lr
-----
DeFksm
-
KitVnw
kawpavfiaFlAglpLlLiaglIhwRlgwlkayqqklAsaVGslrnDsqlnTpkailidlirA
--
LPVcliiLavGLilltmqlnisellWsfskklAI
----------
fWlVfglCwkvlekNGvavrHFGmPrqqtShwrRqivriSlaLlpihfwsvVaelspLhlmDdv
--------
LGqamifFnLlliAflVwpmcres
-----
wrdKEshtmrlvtitvlsii
---
PialmvltAtGYfyTtlrL
gRwiETvylViIWnLL
---
YqtvlRglsvaaRRiAwrRAlaRRqnlvkegAeGaE
------
pPE
------
EptiaLeqvnqQtLRi
--
tmLlmfALfgVmfwaiWSdlitvFsyLdsItLWhyn
TeaGaavvknvTmGslLfAIiasmvawaLiRNLPgLLEvlvLsrLnmrqGasYAITTIlnYiiiavGamtvFgslGvsWdKLQWLaAALsVGLGFGLQEIFgNFvSGLIILFErPvRIGDTVTIgsfsGtVsKIriRATTItDfDRKEvIiPNKAFvTErlINWSLtDttTRlVirlgvaygsdlEkVrkvLLkAAtehprVmhePmPEVFftafgastldhELRlYvrElrdRsrtvdElnrtIdqlcrendIniaFnqlevhLhnenGdeTGRTLTSAGKGRQAGSL
Consen2
Secondary consensus
ta
dskqitqe
eqakaa
--
p
pev
ea
-
qsa
na
eerkgsle
v
----
yqevidny
klsatlr
q
nnmr
-----
p
svspgm
nda
n
eilqiss
l
ksrq
qeqerareiad
nql
qq
----
qtn
rr
ne
er
-
lt
n
p
nqa
nfal
ads
r
k
lv
elelaq
sa
n
-
r
elarlrselaeke
qq
day
a
rnql
q
qleaera
es
l
e
--
a
lpkd
v
qfkinr
eaal
-
a
mdlvaeqqr
aas
tlq
r
-------
alntlr
sqw
gs
n
gea
ra
var
empkpqqldtem
q
vqrlryed
----
lnkqpll
qihqadgqplta
qnri
eaqlrtq
ns
-
l
g
dt
lleltk
kvsng
edal
evneathryl
tsdv
tid
pleia
krlisl
t
sqlg
as
mm
tsketilpl
g
---
i
vgcs
ys
-
ryftrflers
ak
kvtq
hfwl
lrtlfwsilv
sp
lwmt
gy
----------
rea
pyplav
gdgvtatvpll
v
m
-
i
atfarp
lfia
w
eerv
rgm
yyl
-
m
ig
-------
i
plima
mmf
nlddrefsgs
r
--
lc
i
icg
la
vtlslkkagiplyln
gsgdnitnhmlwnmmiga
lvailas
v
la
aqa
a
-
l
s
--
a
f
lvv
hvir
wmliqr
l
fd
kh
aemlaqr
r
e
eahhhss
gaievd
sevd
daisa
s
lvrsi
mli
ls
i
--
vl
eihsa
gf
en
s
dvt
vq
veslepi
l
av
i
lvfiittq
v
a
lai
qh
dltp
tg
tk
llmli
glvg
smi
ie
s
v
g
a
i
k
i
rdlt
s
t
nt
s
w
i
v
i
qf
s
sv
v
ltipapadans
e
tei
t
rrcsl
idn
a
lvdlqqgiqif
i
aa
mgh
mplrh
ihql
lagfhahg
dmp
ppfqmr
eslk
kq
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character