fig|6666666.5357.peg.539
Escherichia coli TY-2482 (11-771/845)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|1040638.4.peg.3274
Escherichia coli O104:H4 str. LB226692
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|585055.6.peg.2480
Escherichia coli 55989
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|585055.8.peg.2486
Escherichia coli 55989
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|409438.11.peg.2617
Escherichia coli SE11
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|550672.3.peg.1911
Escherichia coli B088 (11-771/771)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|340186.3.peg.1299
Escherichia coli E110019 (11-771/771)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|344601.3.peg.1428
Escherichia coli B171 (11-771/771)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|344601.5.peg.1504
Escherichia coli B171
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|585395.4.peg.2792
Escherichia coli O103:H2 str. 12009
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|340185.4.peg.668
Escherichia coli E22
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
G
F
GLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|331111.3.peg.232
Escherichia coli E24377A (11-771/771)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STRAME
I
FGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NL
T
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|340185.3.peg.626
Escherichia coli E22 (11-771/771)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
G
F
GLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|562.375.peg.3177
Escherichia coli EC4100B
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
V
NNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|340186.5.peg.1347
Escherichia coli E110019
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|656408.3.peg.2442
Escherichia coli H591
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|679206.4.peg.2418
Escherichia coli MS 119-7
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|679204.3.peg.228
Escherichia coli MS 145-7
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|656443.3.peg.2902
Escherichia coli TA271
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|331111.12.peg.2776
Escherichia coli E24377A
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STRAME
I
FGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NL
T
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|340184.3.peg.414
Escherichia coli B7A (11-771/771)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHV
G
SGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|595495.4.peg.242
Escherichia coli KO11
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
E
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
A
S
TM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|566546.3.peg.1437
Escherichia coli W
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
E
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
A
S
TM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|566546.4.peg.2374
Escherichia coli W
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
E
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
A
S
TM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|679207.4.peg.4048
Escherichia coli MS 107-1
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLN
A
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|340184.6.peg.431
Escherichia coli B7A
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
K
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
ME
A
FGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHV
G
SGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|656419.3.peg.2951
Escherichia coli M718 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IA
V
FA
AIEKGG
L
LEV
K
D
GG
L
A
FA
VDQ
KA
GGAIK
AT
TR
V
MEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASA
K
NF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|481805.3.peg.1562
Escherichia coli ATCC 8739 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
GT
V
FA
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIK
AT
TR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQAN
V
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|481805.6.peg.1554
Escherichia coli ATCC 8739 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
GT
V
FA
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIK
AT
TR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQAN
V
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|585034.4.peg.2243
Escherichia coli IAI1 (28-788/788)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|585034.5.peg.2240
Escherichia coli IAI1 (28-788/788)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
T
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
C
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|550676.3.peg.2512
Escherichia coli B185 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
S
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IA
V
FA
AIEKGG
L
LEV
K
D
GG
L
A
FA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
N
R
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|656444.3.peg.3184
Escherichia coli TA280 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KV
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSD
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|749531.3.peg.2763
Escherichia coli MS 69-1 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
S
I
LN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAN
E
P
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSD
S
AS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|573235.3.peg.3192
Escherichia coli O26:H11 str. 11368 (28-788/788)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
S
AVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|585057.4.peg.2426
Escherichia coli IAI39 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAN
E
P
TIKGGRLIVEKDG
GT
V
FA
AIEKGG
L
LEV
K
EGG
F
A
LA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VD
N
GGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
S
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
D
G
S
S
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|585057.6.peg.2428
Escherichia coli IAI39 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAN
E
P
TIKGGRLIVEKDG
GT
V
FA
AIEKGG
L
LEV
K
EGG
F
A
LA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VD
N
GGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
S
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
D
G
S
S
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|749527.3.peg.204
Escherichia coli MS 21-1 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAN
E
P
TIKGGRLIVEKDG
GT
V
FA
AIEKGG
L
LEV
K
EGG
F
A
LA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VD
N
GGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
S
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
D
G
G
S
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|439855.10.peg.2507
Escherichia coli SMS-3-5 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAN
E
P
TIKGGRLIVEKDG
GT
V
FA
AIEKGG
L
LEV
K
EGG
F
A
LA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VD
N
GGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
S
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
D
G
S
S
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|216592.1.peg.2999
Escherichia coli 042 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
Q
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
N
S
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|216592.3.peg.2544
Escherichia coli 042 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
Q
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
N
S
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|670888.3.peg.4088
Escherichia coli 1827-70 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|344610.3.peg.1703
Escherichia coli 53638 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|344610.7.peg.4208
Escherichia coli 53638 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|413997.3.peg.2214
Escherichia coli B str. REL606 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|511693.5.peg.2226
Escherichia coli BL21 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|469008.4.peg.1509
Escherichia coli BL21(DE3) (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|316401.4.peg.2661
Escherichia coli ETEC H10407 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|331112.6.peg.2285
Escherichia coli HS (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|749547.3.peg.2628
Escherichia coli MS 187-1 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|749548.3.peg.2177
Escherichia coli MS 196-1 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|749538.3.peg.3242
Escherichia coli MS 116-1 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
V
N
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|749544.3.peg.2547
Escherichia coli MS 175-1 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
V
N
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|562.373.peg.3941
Escherichia coli 1125A (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|562.372.peg.5577
Escherichia coli 1212A (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|444449.5.peg.1468
Escherichia coli O157:H7 str. EC4042 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|444448.5.peg.229
Escherichia coli O157:H7 str. EC4045 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|444453.5.peg.1575
Escherichia coli O157:H7 str. EC4076 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|444450.8.peg.3319
Escherichia coli O157:H7 str. EC4115 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|444447.5.peg.363
Escherichia coli O157:H7 str. EC4206 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|478004.5.peg.1739
Escherichia coli O157:H7 str. EC4401 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|478005.5.peg.1855
Escherichia coli O157:H7 str. EC4486 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|478008.5.peg.4122
Escherichia coli O157:H7 str. EC869 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|386585.9.peg.3214
Escherichia coli O157:H7 str. Sakai (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|544404.4.peg.3122
Escherichia coli O157:H7 str. TW14359 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|155864.8.peg.2973
Escherichia coli O157:H7 EDL933 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TX
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|701177.3.peg.2757
Escherichia coli O55:H7 str. CB9615 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMED
S
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|595496.3.peg.2160
Escherichia coli BW2952 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|536056.3.peg.1542
Escherichia coli DH1 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|331112.3.peg.2186
Escherichia coli HS (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|83333.1.peg.2164
Escherichia coli K12 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|316407.3.peg.2123
Escherichia coli W3110 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|316385.5.peg.2313
Escherichia coli str. K-12 substr. DH10B (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|316385.7.peg.2368
Escherichia coli str. K-12 substr. DH10B (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
N
NT
E
I
N
GGYQYIE
MN
G
A
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLIVEKDG
GA
V
FV
AIEKGG
L
LEV
K
EGG
F
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGT
V
T
G
V
D
K
K
AGG
K
LIVST
-
NALEVSG
P
NS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DKH
ATMQSLGKDT
---
G
T
K
VQANA
V
YDLGR
--------
S
YQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
I
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|562.371.peg.5052
Escherichia coli 1044A (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|562.374.peg.3341
Escherichia coli 536A (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|83334.1.peg.3075
Escherichia coli O157:H7 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|155864.1.peg.3080
Escherichia coli O157:H7 EDL933 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|444454.5.peg.2006
Escherichia coli O157:H7 str. EC4024 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|444452.5.peg.2594
Escherichia coli O157:H7 str. EC4113 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|444451.5.peg.807
Escherichia coli O157:H7 str. EC4196 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|478006.5.peg.4972
Escherichia coli O157:H7 str. EC4501 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|478007.5.peg.4979
Escherichia coli O157:H7 str. EC508 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|637388.3.peg.391
Escherichia coli O157:H7 str. FRIK2000 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|570506.3.peg.4941
Escherichia coli O157:H7 str. FRIK966 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|502346.5.peg.4301
Escherichia coli O157:H7 str. TW14588 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-SV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|656379.3.peg.2727
Escherichia coli FVEC1302 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAN
E
P
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
P
LNL
A
NLA
M
S
G
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
S
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|656380.3.peg.2286
Escherichia coli FVEC1412 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAN
E
P
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
P
LNL
A
NLA
M
S
G
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
S
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|749549.3.peg.1836
Escherichia coli MS 198-1 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAN
E
P
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
P
LNL
A
NLA
M
S
G
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
S
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|585056.7.peg.2709
Escherichia coli UMN026 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAN
E
P
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
P
LNL
A
NLA
M
S
G
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
S
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|656437.3.peg.2463
Escherichia coli TA143 (28-788/863)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
S
I
LN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
S
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|749537.3.peg.2479
Escherichia coli MS 115-1 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
V
S
S
T
T
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
V
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IA
V
FA
AIEKGG
L
LEV
K
D
GG
L
A
FA
VDQ
KA
GGAIK
A
STR
V
MEVFGTNRLG
-
QF
E
I
KN
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VDSGGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
L
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GNV
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSY
R
MDL
Q
NGTTLEP
fig|550677.3.peg.1739
Escherichia coli B354 (1-761/836)
M
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
T
TIKGGRLI
I
EKDG
IT
V
FA
AIEKGG
L
LEV
K
EGG
L
A
FA
VDQ
KA
GGAIKT
T
TRAMEVFGTNRLG
-
QF
D
I
KD
G
I
ANNMLLENGGSLRVEE
------------------------------------------------------
ND
F
A
Y
NT
T
VD
N
GGLL
E
V
M
DGGTAT
G
V
D
K
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
A
W
T
I
IA
D
I
T
TT
NQNT
R
LNL
A
NLA
M
S
G
A
N
V
I
M
MAE-
-
-PV
T
R
S
SV
TASAENF
T
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRER
I
G
SVKG
V
NYDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|679205.4.peg.5511
Escherichia coli MS 124-1 (28-714/789)
L
A
S-
-
TV
-
EYGE
-
T
VDGVVLEKDIQ
LVY
G
T
AN
N
T
K
IN
P
GGEQHIKEFG
I
S
S
NT
E
I
N
GGYQYIE
MN
G
T
A
EY
SVLN
-
DGYQIVQ
M
GG
A
A
NQ
TTLNNGVLQVYGAAND
P
TIKGGRLIVEKDG
IT
V
LA
AIEKGG
L
LEV
K
EGG
L
A
IA
VDQ
--
-----------
-
--------
-
--
-
-
--
-
-
----------------
------------------------------------------------------
--
-
-
-
--
-
-------
-
-
-
------
-
-
-
-
K
AGG
K
LIVST
-
NALEVSGTNS
K
G
Q
-
FSI
K
D
--
GVS
K
NY
E
LD
D
GSGLIVMEDT
Q
A
I
DTIL
DEH
ATMQSLGKDT
---
G
T
R
VQANA
V
YDLGR
--------
S
DQ
NG
SITY
S
SK
A
I
SEN
MV
I
N
---
N
GRA
N
V
W
AGTM
VNV
SV
R
GN
D
G
I
L
E
VM
K
P
QI
N
Y
AP
AM
L
V
G
K
V
VVSE
GAS
F
RT
H
GAVDTS
K
ADVSL
--
E
NS
V
W
T
I
IA
D
I
T
TT
NQNT
L
LNL
A
NLA
M
S
D
A
N
V
I
M
MDE-
-
-PV
T
R
S
SV
TASAENF
I
T
---
LTTNTLSGNG
---
NFYMRTDMA
N
H
Q
SDQLNV
T
-
GQATGDFKIFVTDTGASPAAGDS
L
TLVTTGGG
-------------
DAAF
T
LGN
A
GGVVDIGTYEYTLLDNGNHSWSLAEN
--------------------------------------
--
-
RAQITPSTTDVLNMAAAQPLVFDAELDTVRERL
G
SVKG
V
S
YDT
--
AMWSSAINTRNNVTTDAGAGFEQT
-------
LTGLTLGIDSRF
S
REE
S
S
T
I
R
GL
F
--
FGYSHSDIGFDRGGK
-
GN
I
--
DSYTLGAYA
G
WEH
-
Q
NGAYVDGVVKVDRFANTIH
G
KMSNGATAFGDY
N
SNG
A
G
--
AHVESGFRW
--
VDGLWSVRPYLAFT
--
GFTT
-
DGQDYTLSNGMRADVGNTRILRAEAGTAVSYHMDL
Q
NGTTLEP
fig|595496.3.peg.2624
Escherichia coli BW2952 (271-1136/1211)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPIPNPKPDPKPDPKPD
--
PNPKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
T
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
K
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|536056.3.peg.1085
Escherichia coli DH1 (271-1136/1211)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPIPNPKPDPKPDPKPD
--
PNPKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
T
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
K
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|316385.7.peg.2835
Escherichia coli str. K-12 substr. DH10B (271-1136/1211)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPIPNPKPDPKPDPKPD
--
PNPKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
T
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
K
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|511145.12.peg.2743
Escherichia coli str. K-12 substr. MG1655 (271-1136/1211)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPIPNPKPDPKPDPKPD
--
PNPKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
T
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
K
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|511145.6.peg.2728
Escherichia coli str. K-12 substr. MG1655 (271-1136/1211)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPIPNPKPDPKPDPKPD
--
PNPKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
T
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
K
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|656419.3.peg.3416
Escherichia coli M718 (318-1181/1256)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
QIV
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKR
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPNPNPKPDPKPDPKPD
----
PKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
A
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
E
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|670888.3.peg.3667
Escherichia coli 1827-70 (318-1183/1258)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPIPNPKPDPKPDPKPD
--
PNPKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
T
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
K
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|344610.7.peg.3631
Escherichia coli 53638 (318-1183/1258)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPIPNPKPDPKPDPKPD
--
PNPKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
T
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
K
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|749544.3.peg.3054
Escherichia coli MS 175-1 (318-1183/1258)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPIPNPKPDPKPDPKPD
--
PNPKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
T
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
K
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|749540.3.peg.850
Escherichia coli MS 146-1 (318-1183/1258)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNMGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
K
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
LG
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
V
RITD
S
A
T
L
T
L
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPIPNPKPDPKPDPKPD
--
PNPKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
T
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
K
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
KS
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|216592.3.peg.2979
Escherichia coli 042 (374-1181/1256)
Y
G
L
A
TD
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
D
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
VN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
E
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
D
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTTSAKTQVNAGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
V
K
T
T
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
I
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
A
G
--
N
L
A
T
N
V
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GNG
G
A
MQ
N
LG
Q
D
F
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LL
I
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
V
V
RITD
S
A
T
L
TI
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTT
S
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPNPNPKPDPKPDPKPD
----
PKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
A
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DG
I
VK
LN
RF
E
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
R
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
E
S
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|478004.5.peg.2171
Escherichia coli O157:H7 str. EC4401 (318-1185/1260)
TVVTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANI
KG
GKQ
T
V
Y
G
L
A
TE
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
E
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
AN
G
T
A
EG
S
I
I
N
-
G
G
S
QIV
N
E
GG
L
A
EN
S
V
LN
D
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
G
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTSSDKTQVNTGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
I
K
T
T
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
V
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
S
G
--
N
L
A
T
N
M
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GKG
G
A
MQ
N
Q
G
Q
D
S
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AGT
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
A
I
RITD
S
A
T
L
TI
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
N
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTTN
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
S
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPNPNPNPNPKPDPKPDPKPDPKPDPTPEPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDS
PN
D
IP
E
G
I
A
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
E
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
H
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
E
S
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
ME
I
EP
fig|656444.3.peg.3758
Escherichia coli TA280 (547-1354/1429)
Y
G
L
A
TD
AN
I
-
E
S
GE
Q
I
VDG
--------
---
G
S
T
D
K
T
H
IN
-
GG
T
Q
TV
Q
N
Y
G
K
A
I
NT
D
I
V
S
G
L
Q
Q
I
M
VN
G
T
A
EG
S
I
I
N
-
G
G
S
Q
V
V
N
E
GG
L
A
EN
S
V
LN
E
-----------
-
-------------
--
-
--
----
GG
T
L
D
V
R
E
K
G
S
A
TG
I
Q
Q
SS
Q
GA
LVAT
TRA
T
R
V
T
GT
RAD
G
V
A
F
S
I
EQ
D
A
ANN
I
LL
A
NGG
V
L
T
VE
S
DTTSAKTQVNAGGREIVKTKATATGTTLTGGEQIVEGVANETTINDGGIQTVSA
N
G
E
A
V
K
T
T
IN
E
GG
T
L
T
V
N
D
N
G
K
AT
D
I
I
Q
N
S
G
A
A
L
QT
ST
A
N
G
I
E
I
SGT
H
Q
Y
G
T
-
FSI
A
G
--
N
L
S
T
N
A
L
L
E
N
G
G
N
L
L
V
L
AG
T
E
A
R
D
S
TV
GNG
G
A
MQ
N
LG
Q
D
F
---
A
T
K
V
N
S
GG
Q
Y
T
LGR
--------
S
KD
E
-
---F
Q
AL
A
R
A
E
D
LQ
V
A
---
G
G
T
A
I
V
Y
AG
A
L
ADA
SV
S
G
A
T
G
S
L
S
L
M
T
P
RD
N
V
T
P
VK
L
E
G
V
V
RITD
S
A
T
L
TI
G
NG
VDT
T
L
AD
L
T
AAS
R
G
S
V
W
L
N
SN
N
S
C
AG
TS
N
C
E
YR
V
N
S
L
L
L
N
D
G
D
V
Y
L
SAQT
A
APA
T
T
N
GI
------
Y
N
T
---
LTT
S
E
LSG
S
G
---
NFY
L
H
T
N
V
A
G
S
R
G
DQL
V
V
N
-
NN
ATG
N
FKIFV
Q
DTG
V
SP
Q
S
D
D
A
M
TLV
K
TGGG
-------------
DA
S
F
T
LGN
T
GG
F
VD
L
GTYEY
V
L
KS
D
GN
S
N
W
N
L
TN
D
VKPNPDPNPNPKPDPKPDPKPD
----
PKPDPTPDPTPT
PV
PEK
R
ITPST
A
A
VLNMAA
T
L
PLVFDAEL
NSI
RERL
N
I
M
K
A
S
P
HN
N
--
N
V
W
G
A
TY
NTRNNVTTDAGAGFEQT
-------
LTG
M
T
V
GIDSR
N
D
IP
E
G
I
A
T
L
G
A
F
--
M
GYSHS
H
IGFDRGG
H
-
G
S
V
--
G
SY
S
LG
G
YA
S
WEH
-
E
S
G
F
Y
L
DGVVK
LN
RF
E
S
N
V
A
G
KMS
S
G
GA
A
N
G
S
Y
R
SNG
L
G
--
G
H
I
E
T
G
M
R
F
--
T
DG
N
W
N
L
T
PY
A
S
L
T
--
GFT
A
-
D
NP
E
Y
H
LSNGM
E
S
K
SV
D
TR
S
I
YR
E
L
G
AT
L
SY
N
M
R
L
G
NG
MEV
EP
fig|749540.3.peg.3461
Escherichia coli MS 146-1 (208-933/1039)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SG
T
V
V
N
S
DG
W
QIV
K
N
GG
V
A
GN
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
R
L
Q
V
D
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
I
NRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
L
T
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
I
Q
GN
K
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LY
T
SM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
S
F
G
fig|585056.7.peg.5046
Escherichia coli UMN026 (208-933/1039)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SG
T
V
V
N
S
DG
W
QIV
K
N
GG
V
A
GN
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
R
L
Q
V
D
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
I
NRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
L
T
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
I
Q
GN
K
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LY
T
SM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
S
F
G
fig|340184.3.peg.2219
Escherichia coli B7A (208-933/1039)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SG
T
V
V
N
S
DG
W
QIV
K
N
GG
V
A
GN
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
R
L
Q
V
D
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
I
NRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
L
T
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
I
Q
GN
K
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
S
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
V
T
A
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
L
I
H
N
A
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
V
R
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
A
S
YVKF
G
HGSAQHVR
AG
FRLGS
H
H
D
M
N
F
G
fig|340184.6.peg.2331
Escherichia coli B7A (208-933/1039)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SG
T
V
V
N
S
DG
W
QIV
K
N
GG
V
A
GN
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
R
L
Q
V
D
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
I
NRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
L
T
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
I
Q
GN
K
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
S
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
V
T
A
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
L
I
H
N
A
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
V
R
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
A
S
YVKF
G
HGSAQHVR
AG
FRLGS
H
H
D
M
N
F
G
fig|670897.3.peg.1699
Escherichia coli 2362-75 (208-933/1039)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SG
T
V
V
N
S
DG
W
QIV
K
N
GG
V
A
GN
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
R
L
Q
V
D
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
I
NRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
L
T
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
I
Q
GN
K
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
S
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
V
T
A
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
L
I
H
N
A
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
V
R
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
H
D
M
N
F
G
fig|656443.3.peg.2194
Escherichia coli TA271 (208-933/1039)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SG
T
V
V
N
S
DG
W
QIV
K
N
GG
V
A
GN
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
R
L
Q
V
D
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
I
NRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
L
T
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
K
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
S
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
V
T
A
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
L
I
H
N
A
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|656419.3.peg.8
Escherichia coli M718 (208-933/1039)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SG
T
V
V
N
S
DG
W
QIV
K
N
GG
V
A
GN
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
R
L
Q
V
D
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
I
NRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
L
T
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
I
Q
GN
K
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LY
T
SM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
L
V
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
N
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGS
T
QHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|753642.3.peg.40
Escherichia coli NC101 (124-934/1040)
LEGGTASDTVIRDGGGQSLNGLAVNTTLN
NRGEQWVHEGGVATGTIINRDGYQS
V
KSGGLAT
GT
IINTGAEGGPDS
D
N
SY
TG
Q
K
V
Q
G
T
A
ES
T
T
INKN
G
RQ
I
I
----------
LFS
G
I
A
R
D
T
L
I
Y
A
GG
D
Q
S
V
--
H
G
R
A
L
NT
T
L
N
GGYQY
VH
KD
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
A
V
GN
TT
V
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
R
V
H
A
GG
E
A
TA
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
G
I
NRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
L
T
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
TV
RGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
R
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
V
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
M
----
R
T
E
VAG
M
S
V
T
A
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
L
I
H
N
A
S
G
LWA
D
I
V
A
L
GT
R
HS
----
-
--
MKAS
T
DNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|656419.3.peg.3463
Escherichia coli M718 (124-843/949)
LEGGTASDTVIRDGGGQSLNGLAVNTTLN
NRGEQWVHEGGVATGTIINRDGYQS
V
KSGGLAT
GT
IINTGAEGGPDS
D
N
SY
TG
Q
K
V
Q
G
T
A
ES
T
T
INKN
G
RQ
I
I
----------
LFS
G
I
A
R
D
T
L
I
Y
A
GG
D
Q
S
V
--
H
G
R
A
L
NT
T
L
N
GGYQY
VH
KD
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
A
V
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
R
V
H
A
GG
E
A
TA
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
N
F
F
V
GN
G
M
A
D
N
VV
LENGG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
Q
K
T
R
VD
D
GG
T
L
A
V
S
A
GG
K
AT
D
V
T
M
T
S
GG
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
TS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-
D
I
V
N
A
G
E
I
H
F
DNQT
T
QEA
A
L
S
RA
V
A
K
S
N
S
P
V
T
FHK
LTT
T
N
L
T
G
Q
G
GTI
N
M
RV
SL
D
-G
S
N
A
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
T
S
G
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LY
T
SM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
N
F
G
fig|199310.1.peg.3573
Escherichia coli CFT073 (208-936/1042)
Q
M
V
G
G
T
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
M
A
R
D
T
L
I
Y
A
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
NG
G
T
A
TE
T
LI
N
R
DG
W
Q
V
I
K
E
GG
T
A
AH
TT
I
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
K
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
I
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
D
A
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSSR
T
GTF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
R
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
D
VAG
M
S
V
T
A
G
I
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASS
G
NN
D
F
R
AR
G
R
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|749531.3.peg.3947
Escherichia coli MS 69-1 (219-947/1043)
Q
M
V
G
G
T
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
M
A
R
D
T
L
I
Y
A
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
NG
G
T
A
TE
T
LI
N
R
DG
W
Q
V
I
K
E
GG
T
A
AH
TT
I
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
K
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
S
T
N
VTL
A
-
--
S
D
A
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSSR
T
GTF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
R
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
D
VAG
M
S
V
T
A
G
I
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASS
G
NN
D
F
R
AR
G
R
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|199310.4.peg.3441
Escherichia coli CFT073 (206-934/1040)
Q
M
V
G
G
T
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
M
A
R
D
T
L
I
Y
A
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
NG
G
T
A
TE
T
LI
N
R
DG
W
Q
V
I
K
E
GG
T
A
AH
TT
I
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
K
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
I
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
D
A
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSSR
T
GTF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
R
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
D
VAG
M
S
V
T
A
G
I
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASS
G
NN
D
F
R
AR
G
R
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|655817.3.peg.3471
Escherichia coli ABU 83972 (206-934/1040)
Q
M
V
G
G
T
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
M
A
R
D
T
L
I
Y
A
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
NG
G
T
A
TE
T
LI
N
R
DG
W
Q
V
I
K
E
GG
T
A
AH
TT
I
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
K
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
I
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
T
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
A
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
T
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSAR
T
GKF
V
-
-
--
------
P
T
T
---
L
Q
VKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
G
T
G
L
A
TTG
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGI
V
R
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
L
V
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
R
G
W
Q
GSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|585055.6.peg.3393
Escherichia coli 55989 (205-933/1039)
Q
M
V
G
G
T
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
M
A
R
D
T
L
I
Y
A
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
I
K
E
GG
T
T
AH
TT
I
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
K
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
LA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
A
L
T
G
N
GR
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
ER
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASS
G
NN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|585055.8.peg.3397
Escherichia coli 55989 (205-933/1039)
Q
M
V
G
G
T
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
M
A
R
D
T
L
I
Y
A
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
I
K
E
GG
T
T
AH
TT
I
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
K
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
LA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
A
L
T
G
N
GR
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
ER
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASS
G
NN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|199310.1.peg.1230
Escherichia coli CFT073 (250-985/1091)
S
E
N
VS
TG
Q
M
V
G
G
I
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
I
A
R
D
T
L
I
Y
T
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
R
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
R
T
L
VD
D
GG
T
L
D
V
R
N
GGTAT
A
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGGSLAG
T
T
T
LN
N
G
A
T
F
T
L
A
GKTVNNDTLT
IR
E
G
DALL
Q
GG
A
L
T
G
N
GR
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
D
I
IA
H
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
T
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSAR
T
GKF
V
-
-
--
------
P
T
T
---
L
Q
VKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
G
T
G
L
A
TTG
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
ER
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
R
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|585056.7.peg.3580
Escherichia coli UMN026 (148-933/1039)
G
T
V
L
N
G
GEQW
I
H
A
GG
S
A
S
GT
V
IN
QS
GYQ
T
I
K
H
GG
Q
AT
GT
I
V
NTGAEGGP
E
S
E
N
VS
S
G
Q
M
V
G
G
T
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
M
A
R
D
T
L
I
Y
A
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
K
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
K
A
FSI
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGG
T
LAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
I
Q
GN
K
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NS
A
RLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASS
G
NN
D
F
R
AR
G
R
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|714962.3.peg.1146
Escherichia coli IHE3034 (208-842/948)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SD
T
V
V
N
S
DG
W
QI
I
K
E
GG
L
A
DF
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
T
A
TN
V
TL
TQ
GGA
LV
TST
A
A
T
-
V
T
G
S
NRLG
-
N
F
T
V
EN
G
N
A
D
G
VV
LE
S
GG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
W
K
T
L
VD
D
GG
T
L
A
V
S
A
GG
K
AT
D
V
T
M
T
S
GG
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
IS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-
D
I
V
N
A
G
E
I
R
F
DNQT
T
PDA
A
L
S
RA
V
A
KG
DS
P
V
T
FHK
LTT
S
N
L
T
G
Q
G
GTI
N
M
RV
R
L
D
-G
S
N
A
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
T
S
G
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
S
G
V
S
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
S
N
GSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
H
L
V
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
N
F
G
fig|869729.3.peg.2606
Escherichia coli UM146 (208-842/948)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SD
T
V
V
N
S
DG
W
QI
I
K
E
GG
L
A
DF
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
T
A
TN
V
TL
TQ
GGA
LV
TST
A
A
T
-
V
T
G
S
NRLG
-
N
F
T
V
EN
G
N
A
D
G
VV
LE
S
GG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
W
K
T
L
VD
D
GG
T
L
A
V
S
A
GG
K
AT
D
V
T
M
T
S
GG
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
IS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-
D
I
V
N
A
G
E
I
R
F
DNQT
T
PDA
A
L
S
RA
V
A
KG
DS
P
V
T
FHK
LTT
S
N
L
T
G
Q
G
GTI
N
M
RV
R
L
D
-G
S
N
A
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
T
S
G
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
S
G
V
S
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
S
N
GSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
H
L
V
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
N
F
G
fig|364106.7.peg.1218
Escherichia coli UTI89 (208-842/948)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SD
T
V
V
N
S
DG
W
QI
I
K
E
GG
L
A
DF
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
T
A
TN
V
TL
TQ
GGA
LV
TST
A
A
T
-
V
T
G
S
NRLG
-
N
F
T
V
EN
G
N
A
D
G
VV
LE
S
GG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
W
K
T
L
VD
D
GG
T
L
A
V
S
A
GG
K
AT
D
V
T
M
T
S
GG
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
IS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-
D
I
V
N
A
G
E
I
R
F
DNQT
T
PDA
A
L
S
RA
V
A
KG
DS
P
V
T
FHK
LTT
S
N
L
T
G
Q
G
GTI
N
M
RV
R
L
D
-G
S
N
A
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
T
S
G
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
S
G
V
S
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
S
N
GSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
H
L
V
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
N
F
G
fig|364106.8.peg.1217
Escherichia coli UTI89 (208-842/948)
Y
G
D
A
VR
T
T
INKN
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SD
T
V
V
N
S
DG
W
QI
I
K
E
GG
L
A
DF
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
T
A
TN
V
TL
TQ
GGA
LV
TST
A
A
T
-
V
T
G
S
NRLG
-
N
F
T
V
EN
G
N
A
D
G
VV
LE
S
GG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
W
K
T
L
VD
D
GG
T
L
A
V
S
A
GG
K
AT
D
V
T
M
T
S
GG
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
IS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-
D
I
V
N
A
G
E
I
R
F
DNQT
T
PDA
A
L
S
RA
V
A
KG
DS
P
V
T
FHK
LTT
S
N
L
T
G
Q
G
GTI
N
M
RV
R
L
D
-G
S
N
A
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
T
S
G
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
S
G
V
S
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
S
N
GSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
H
L
V
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
N
F
G
fig|216592.1.peg.5500
Escherichia coli 042 (230-867/973)
Q
F
V
R
G
N
A
VR
T
T
IN
E
N
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SD
T
V
V
N
S
DG
W
QIV
K
E
GG
L
A
DF
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
S
NRLG
-
N
F
T
V
EN
G
N
A
D
G
VV
LE
S
GG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
W
K
T
L
VD
D
GG
T
L
A
V
S
A
GG
K
AT
D
V
T
M
T
S
G
S
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
TS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-
D
I
V
N
A
G
E
I
R
F
DNQT
T
PDA
A
L
S
RA
V
A
KG
DS
P
V
T
FHK
LTT
S
N
L
T
G
Q
G
GTI
N
M
RV
R
L
D
-G
S
N
T
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
T
S
G
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LY
T
SM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
H
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
S
F
G
fig|331111.3.peg.2475
Escherichia coli E24377A (230-867/973)
Q
F
V
R
G
N
A
VR
T
T
IN
E
N
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SD
T
V
V
N
S
DG
W
QIV
K
E
GG
L
A
DF
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
S
NRLG
-
N
F
T
V
EN
G
N
A
D
G
VV
LE
S
GG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
W
K
T
L
VD
D
GG
T
L
A
V
S
A
GG
K
AT
D
V
T
M
T
S
G
S
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
TS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-
D
I
V
N
A
G
E
I
R
F
DNQT
T
PDA
A
L
S
RA
V
A
KG
DS
P
V
T
FHK
LTT
S
N
L
T
G
Q
G
GTI
N
M
RV
R
L
D
-G
S
N
T
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
T
S
G
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LY
T
SM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
H
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
S
F
G
fig|216592.1.peg.2454
Escherichia coli 042 (207-935/1041)
Q
M
V
G
G
T
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
M
A
R
D
T
L
I
Y
A
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
I
K
E
GG
T
T
AH
TT
I
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
K
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
LA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGGSLAG
T
T
T
LN
N
G
A
T
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
A
L
T
G
N
GR
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
T
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
T
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
Q
VKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
G
T
G
L
A
TTG
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
ER
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASS
G
NN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|216592.3.peg.4683
Escherichia coli 042 (185-913/1019)
Q
M
V
G
G
T
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
M
A
R
D
T
L
I
Y
A
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
I
K
E
GG
T
T
AH
TT
I
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
K
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
LA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGGSLAG
T
T
T
LN
N
G
A
T
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
A
L
T
G
N
GR
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
T
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
T
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
Q
VKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
G
T
G
L
A
TTG
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
ER
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASS
G
NN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|340197.3.peg.1382
Escherichia coli F11 (208-936/1042)
Q
M
V
G
G
T
A
ES
T
T
IN
N
N
G
RQ
V
I
----------
WSS
G
V
S
R
D
T
L
I
Y
T
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
K
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGGSLAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
R
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
V
K
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
V
T
A
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
S
D
M
T
F
G
fig|362663.8.peg.328
Escherichia coli 536 (206-934/1040)
Q
M
V
G
G
T
A
ES
T
T
IN
N
N
G
RQ
V
I
----------
WSS
G
V
S
R
D
T
L
I
Y
T
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
K
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGGSLAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
R
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
V
K
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
V
T
A
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
S
D
M
T
F
G
fig|362663.9.peg.327
Escherichia coli 536 (206-934/1040)
Q
M
V
G
G
T
A
ES
T
T
IN
N
N
G
RQ
V
I
----------
WSS
G
V
S
R
D
T
L
I
Y
T
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
K
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGGSLAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
R
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
V
K
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
V
T
A
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
S
D
M
T
F
G
fig|340197.5.peg.1463
Escherichia coli F11 (206-934/1040)
Q
M
V
G
G
T
A
ES
T
T
IN
N
N
G
RQ
V
I
----------
WSS
G
V
S
R
D
T
L
I
Y
T
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
K
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GGTAT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGGSLAG
T
T
T
LN
N
G
A
I
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
S
L
T
G
N
GS
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
A
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSTR
T
GKF
V
-
-
--
------
P
A
T
---
L
KVKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
A
S
G
L
A
T
S
G
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
R
---
LQA
G
AFN
Y
S
L
NRDSDE
SW
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
V
K
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
V
T
A
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
S
D
M
T
F
G
fig|585397.7.peg.5145
Escherichia coli ED1a (206-934/1040)
Q
M
V
G
G
T
A
ES
T
T
IN
N
N
G
RQ
V
I
----------
WSS
G
V
S
R
D
T
L
I
Y
T
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
K
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGL
L
T
ARGGSLAG
T
T
T
LN
N
G
A
T
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
T
L
T
G
N
GR
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
T
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
T
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
T
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSAR
T
GKF
V
-
-
--
------
P
T
T
---
L
Q
VKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
G
T
G
L
A
TTG
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
ER
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
R
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|585397.9.peg.5146
Escherichia coli ED1a (206-934/1040)
Q
M
V
G
G
T
A
ES
T
T
IN
N
N
G
RQ
V
I
----------
WSS
G
V
S
R
D
T
L
I
Y
T
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
K
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VA
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
NT
R
VD
D
GG
T
L
D
V
R
N
GG
A
AT
T
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGL
L
T
ARGGSLAG
T
T
T
LN
N
G
A
T
LT
L
SGKTVNNDTLT
IR
E
G
DALL
Q
GG
T
L
T
G
N
GR
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
T
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
DV
IA
Q
R
G
T
T
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
T
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSAR
T
GKF
V
-
-
--
------
P
T
T
---
L
Q
VKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
G
T
G
L
A
TTG
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
ER
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
R
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|216592.3.peg.4997
Escherichia coli 042 (205-842/948)
Q
F
V
R
G
N
A
VR
T
T
IN
E
N
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SD
T
V
V
N
S
DG
W
QIV
K
E
GG
L
A
DF
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
S
NRLG
-
N
F
T
V
EN
G
N
A
D
G
VV
LE
S
GG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
W
K
T
L
VD
D
GG
T
L
A
V
S
A
GG
K
AT
D
V
T
M
T
S
G
S
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
TS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-
D
I
V
N
A
G
E
I
R
F
DNQT
T
PDA
A
L
S
RA
V
A
KG
DS
P
V
T
FHK
LTT
S
N
L
T
G
Q
G
GTI
N
M
RV
R
L
D
-G
S
N
T
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
T
S
G
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LY
T
SM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
H
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
S
F
G
fig|331111.12.peg.5095
Escherichia coli E24377A (205-842/948)
Q
F
V
R
G
N
A
VR
T
T
IN
E
N
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
H
A
L
D
T
T
L
N
GGYQY
VH
NG
G
T
A
SD
T
V
V
N
S
DG
W
QIV
K
E
GG
L
A
DF
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
S
NRLG
-
N
F
T
V
EN
G
N
A
D
G
VV
LE
S
GG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
W
K
T
L
VD
D
GG
T
L
A
V
S
A
GG
K
AT
D
V
T
M
T
S
G
S
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
TS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-
D
I
V
N
A
G
E
I
R
F
DNQT
T
PDA
A
L
S
RA
V
A
KG
DS
P
V
T
FHK
LTT
S
N
L
T
G
Q
G
GTI
N
M
RV
R
L
D
-G
S
N
T
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
T
S
G
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LY
T
SM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
H
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
L
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
S
F
G
fig|655817.3.peg.1298
Escherichia coli ABU 83972 (198-933/1039)
S
E
N
VS
TG
Q
M
V
G
G
I
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
I
A
R
D
T
L
I
Y
T
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
R
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
R
T
L
VD
D
GG
T
L
D
V
R
N
GGTAT
A
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGGSLAG
T
T
T
LN
N
G
A
T
F
T
L
A
GKTVNNDTLT
IR
E
G
DALL
Q
GG
A
L
T
G
N
GR
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
D
I
IA
H
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
T
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSAR
T
GKF
V
-
-
--
------
P
T
T
---
L
Q
VKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
G
T
G
L
A
TTG
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
ER
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
R
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|199310.4.peg.1208
Escherichia coli CFT073 (198-933/1039)
S
E
N
VS
TG
Q
M
V
G
G
I
A
ES
T
T
INKN
G
RQ
V
I
----------
WSS
G
I
A
R
D
T
L
I
Y
T
GG
D
Q
TV
--
H
G
E
A
H
NT
R
L
E
GG
N
QY
VH
KY
G
L
A
LN
T
V
I
N
E
G
G
W
Q
V
V
K
A
GG
T
A
GN
TT
I
N
Q
-----------
-
-------------
--
-
--
----
N
G
E
L
R
V
H
A
GG
E
A
SD
V
T
Q
NT
GGA
LV
TST
A
A
T
-
V
T
GTNRLG
-
A
F
S
V
VE
G
K
A
D
N
VV
LENGG
R
L
D
V
LS
------------------------------------------------------
GH
T
A
T
R
T
L
VD
D
GG
T
L
D
V
R
N
GGTAT
A
V
S
M
G
N
GG
V
L
LADS
-
-
GAA
VSGT
R
S
D
G
T
A
F
R
I
G
G
--
G
QA
D
AL
M
L
E
K
GS
SFTLNAGD
T
A
T
DT
TV
--N
GGLF
T
ARGGSLAG
T
T
T
LN
N
G
A
T
F
T
L
A
GKTVNNDTLT
IR
E
G
DALL
Q
GG
A
L
T
G
N
GR
V
E
KSG
S
G
TL
T
V
S
NT
T
L
TQK
A
V
N
L
N
E
G
T
L
T
LN
D
S
TV
T
T
D
I
IA
H
R
G
T
A
LKLT
G
ST
V
-
L
N
GA
I
D
PT
N
VTL
T
-
--
S
GA
T
W
N
I
PD
N
A
T
VQ
S
---
-
--
V
V
D
DL
S
H
A
G
Q
I
H
F
TSAR
T
GKF
V
-
-
--
------
P
T
T
---
L
Q
VKN
L
N
G
QNGTISLRV
R
P
DMA
Q
N
N
A
D
R
L
VI
D
G
G
R
ATG
KTILNLVNA
G
N
S
-----
G
T
G
L
A
TTG
K
G
IQVVEAINGATTEEG
AF
V
Q
GN
M
---
LQA
G
AFN
YTL
NRDSDE
SW
Y
L
RSE
--------------------------------------
ER
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
VAG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
V
RD
D
AGS
LG
G
Y
M
N
LT
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
R
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
R
L
QY
T
WQ
G
LSLD
DG
K
D
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
fig|656417.3.peg.5254
Escherichia coli M605 (209-842/948)
G
N
A
VR
T
T
IN
E
N
G
RQ
I
V
----------
AAE
G
T
AN
T
T
V
VY
A
GG
D
Q
TV
--
H
G
Y
A
L
D
T
T
L
N
GG
N
QY
VH
NG
G
T
A
SD
T
V
V
N
S
DG
W
QI
I
K
E
GG
L
A
DF
TT
V
N
Q
-----------
-
-------------
--
-
--
----
K
G
K
L
Q
V
N
A
GG
T
A
TN
V
TL
KQ
GGA
LV
TST
A
A
T
-
V
T
G
S
NRLG
-
N
F
A
V
EN
G
K
A
D
G
VV
LE
S
GG
R
L
D
V
L
E
------------------------------------------------------
GH
S
A
R
K
T
L
VD
D
GG
T
L
A
V
S
A
GG
K
AT
G
V
T
M
T
S
GG
A
LI
ADS
-
-
GA
T
V
E
GTN
A
S
G
K
-
FSI
D
G
TS
G
QA
S
G
L
L
L
E
N
G
GSFT
V
NAG
G
L
A
S
N
T
TV
GHR
G
T
L
TL
A
A
GGSL
S
G
R
T
Q
L
SK
G
A
S
MV
L
N
G
--------
-
--
-
-
----
-
--
-
-
---
--
-
-
---
-
---
-
-
-
----
---
--
-
--
-
-
-
-
-
--
-
-
--
-
-
DV
VS
-
-
-
-
-
---T
G
--
-
-
-
-
------
-
----
-
--
-
--
-
-
-
-
--
-
-
-
--
-
---
-
--
-
-
-NI
V
N
A
G
E
I
H
F
DNQT
T
QDA
V
L
S
RA
V
A
KGA
S
P
V
T
FHK
LTT
S
N
L
T
G
Q
G
GTI
N
M
RV
SL
D
-G
S
N
A
SDQL
VI
N
G
GQATG
KT
W
L
AF
T
N
V
G
N
S
-----
N
L
G
V
A
TTG
Q
G
I
R
VV
D
A
Q
NGATTEEG
AF
A
L
SR
P
---
LQA
G
AFN
YTL
NRDSDE
D
W
Y
L
RSE
--------------------------------------
NA
Y
RA
EV
-
P
LYASM
L
----
T
Q
AMDY
D
RI
L
AGS
R
SHQ
T
G
V
N
G
E
N
NSVRLSIQGGHLGHD
NN
GGIAR
GA
TP
E
SSGSYGFVR
L
E
G
DL
L
----
R
T
E
I
AG
M
S
L
T
T
G
V
Y
GAA
G
H
S
SV
D
VKD
D
D
G
SRA
G
T
L
RD
D
AGS
LG
G
Y
L
N
L
V
H
T
S
S
G
LWA
D
I
V
AQGT
R
HS
----
-
--
MKASSDNN
D
F
R
AR
G
W
G
WLGSL
E
T
G
LPFSIT
D
N
L
M
-
LE
P
Q
L
QY
T
WQ
G
LSLD
DGQD
----
N
AGYVKF
G
HGSAQHVR
AG
FRLGS
H
N
D
M
T
F
G
Consen1
Primary consensus
LEGGTASDTVIRDGGGQSLNGLAVNTTLNtvvtgsravdtiinangkmdvygkdVgtvlnsaGTqtiyasatsdkani
gkQ
V
G
A
-
Tv
-
eyGe
-
vdgvvlekdiq
G
an
T
in
GGeQhikefG
s
nT
i
GGyQYie
G
A
svlN
dGyQiVq
GG
A
TTlNngvlqvygaand
tikggrlivekdg
v
aiekgG
LeV
eGG
A
Vdq
GGAiktsTramevfGtNRLG
-
qF
i
G
AnNmlLENGGsLrVee
------------------------------------------------------
nd
A
nT
VDsGGlL
V
dGGtaT
V
k
aGg
Livst
-
naleVSGtns
G
-
FSI
d
--
Gvs
ny
Ld
Gsglivmedt
A
DTil
atmqslgkdt
---
T
vqana
ydLgr
--------
s
ng
s
a
sen
i
---
Gra
V
agTm
sV
gn
G
L
vm
p
n
ap
l
G
v
Gas
rt
gavDts
advsl
--
ns
W
i
d
t
nqnt
lnl
nla
s
a
v
m
-
t
s
tasaenf
T
---
LttntLsGng
---
nfymrtDmA
h
sDqLnv
-
GqATGdfkifvtdtGaSpaagds
tLvTtGgG
-------------
daAF
lGN
ggvvdiGtyeYtLldngnhsWsLaen
--------------------------------------
-
RAqitPsttdvLnmaaaQplvfDaeLdtvRerl
sVkg
nydt
--
amwssaintrNNvttdaGAgfEqt
-------
LtGltLgidsRf
ree
S
i
Gl
--
fGyShsDigfDrGgk
-
Gnv
--
DsytLGaYa
weH
-
nGayvDgVvkvdRfantih
kmsngatafgDy
snG
G
--
ahvEsGfrw
--
vDgLwsvrPyLafT
--
Gftt
-
DGqDytlsNgmradvGntrilraeAGtavsyHmDl
nGttlEP
Consen2
Secondary consensus
nrgeqwvheggvatgtiinrdgyqs
ksgglat
iintgaeggpds
n
tg
t
inkn
rq
i
----------
tr
vy
d
tv
--
h
a
d
l
n
vh
ti
g
w
v
k
q
-----------
-------------
-
----
k
a
tl
lvat
avt
-
at
s
a
v
d
vv
r
d
ls
gh
k
t
n
kv
m
a
lads
-
gaa
prq
a
g
qa
al
e
gsftlnagd
tv
gglf
arggslag
ln
gg
lt
sgktvnndtlt
e
-
q
s
gd
v
ksg
tl
nt
l
a
la
ln
s
t
dv
q
a
st
-
l
ngi
pt
vtl
-
ga
n
n
c
---
--
v
dl
g
i
f
t
v
-
------
p
kvkn
n
qngtislrvhp
v
n
a
r
vi
g
r
ktilnlvna
n
-----
g
a
s
k
iqvveaingatteeg
q
---
lqa
afn
s
nrdsden
y
rse
y
ev
-
lyasm
----
t
amdy
ri
ags
shq
g
na
snsvrlsiqgghlghd
ggiar
tp
ssgsygfvr
e
dl
----
t
vag
t
gaa
h
sv
vkd
d
sra
tird
ags
g
l
lt
t
s
lwa
i
aqgt
hs
----
--
mkassdnn
f
ar
wlgsl
t
lpfsit
n
m
-
le
q
qy
wq
lsld
k
----
agyvkf
hgsaqhvr
frlgs
n
m
f
mev
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character