fig|1040638.4.peg.5357
Escherichia coli O104:H4 str. LB226692
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
F
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
L
AY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
Y
QQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
Y
Q
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
N
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQL
L
WCQK
fig|585055.6.peg.358
Escherichia coli 55989
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
F
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
L
AY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
Y
QQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
Y
Q
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
N
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQL
L
WCQK
fig|585055.8.peg.359
Escherichia coli 55989
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
F
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
L
AY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
Y
QQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
Y
Q
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
N
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQL
L
WCQK
fig|573235.3.peg.378
Escherichia coli O26:H11 str. 11368
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
F
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
L
AY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
Y
QQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELP
G
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|585396.4.peg.388
Escherichia coli O111:H- str. 11128
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
I
P
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
F
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
L
AY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
Y
QQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELP
G
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656437.3.peg.422
Escherichia coli TA143
MTMIT
N
SLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
V
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GT
T
PFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKP
V
LIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
Q
D
KQ
L
IELPE
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTAD
I
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|413997.3.peg.319
Escherichia coli B str. REL606
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|511693.5.peg.324
Escherichia coli BL21
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|469008.4.peg.3424
Escherichia coli BL21(DE3)
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|536056.3.peg.3452
Escherichia coli DH1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656414.3.peg.532
Escherichia coli H736
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|83333.1.peg.341
Escherichia coli K12
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|749547.3.peg.4285
Escherichia coli MS 187-1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|316407.3.peg.332
Escherichia coli W3110
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|511145.12.peg.352
Escherichia coli str. K-12 substr. MG1655
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|511145.6.peg.349
Escherichia coli str. K-12 substr. MG1655
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|670888.3.peg.920
Escherichia coli 1827-70
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|316401.4.peg.463
Escherichia coli ETEC H10407
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
L
T
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|749538.3.peg.1483
Escherichia coli MS 116-1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
V
R
E
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|481805.3.peg.3521
Escherichia coli ATCC 8739
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPT
S
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|481805.6.peg.3504
Escherichia coli ATCC 8739
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPT
S
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|409438.11.peg.492
Escherichia coli SE11
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
Y
T
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWK
V
A
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|344610.7.peg.1335
Escherichia coli 53638 (1-1022/1048)
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWC
fig|595496.3.peg.4092
Escherichia coli BW2952 (60-1080/1080)
ITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656443.3.peg.590
Escherichia coli TA271
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
L
AY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
Y
QQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELP
G
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWK
V
A
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|679207.4.peg.2129
Escherichia coli MS 107-1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKA
T
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656408.3.peg.276
Escherichia coli H591
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|679206.4.peg.209
Escherichia coli MS 119-7
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|679205.4.peg.45
Escherichia coli MS 124-1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|749533.3.peg.1457
Escherichia coli MS 84-1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|331112.3.peg.381
Escherichia coli HS
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
D
V
PLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYG
S
HQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|331112.6.peg.401
Escherichia coli HS
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
D
V
PLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYG
S
HQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|331111.12.peg.682
Escherichia coli E24377A
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYG
S
HQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|331111.3.peg.2916
Escherichia coli E24377A
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
S
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYG
S
HQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|439855.10.peg.548
Escherichia coli SMS-3-5
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHP
H
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWL
D
CDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DE
C
W
L
Q
K
G
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GT
T
PFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKP
V
LIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
S
V
A
L
D
G
KP
LA
SGE
M
P
L
-
DVAP
Q
D
KQ
L
IELPE
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSV
A
L
P
S
AP
HAIP
Q
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
I
RAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AE
V
AL
LQ
CTAD
I
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656444.3.peg.812
Escherichia coli TA280
MT
I
ITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DE
C
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
Y
F
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GT
T
PFGGEI
ID
ERG
D
YADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
T
EACD
VGFR
EVRIEN
GL
LLL
N
GKP
V
LIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
I
M
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELP
V
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
H
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AE
V
AL
LQ
CTAD
I
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YT
Q
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|431946.3.peg.323
Escherichia coli SE15
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
L
Q
L
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
I
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
V
S
V
G
V
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
N
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|749537.3.peg.28
Escherichia coli MS 115-1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
V
RP
S
P
QLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTV
I
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
IN
ES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELP
G
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656393.3.peg.991
Escherichia coli H299
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DE
C
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
S
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GT
T
PFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKP
V
LIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWS
V
KKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELP
V
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
W
H
Q
WR
L
A
EN
LSVTL
P
A
A
SH
S
IP
H
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
L
AE
V
AL
LQ
CTAD
I
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
V
S
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
L
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
M
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656379.3.peg.1218
Escherichia coli FVEC1302
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADT
I
V
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
V
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GT
T
PFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
V
K
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELP
V
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
W
H
Q
WR
L
A
EN
LSVTL
P
A
A
SH
S
IP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLR
A
Q
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656380.3.peg.973
Escherichia coli FVEC1412
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADT
I
V
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
V
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GT
T
PFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
V
K
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELP
V
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
W
H
Q
WR
L
A
EN
LSVTL
P
A
A
SH
S
IP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLR
A
Q
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|585056.7.peg.585
Escherichia coli UMN026
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADT
I
V
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
V
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GT
T
PFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
V
K
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELP
V
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
W
H
Q
WR
L
A
EN
LSVTL
P
A
A
SH
S
IP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLR
A
Q
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|749545.3.peg.2116
Escherichia coli MS 182-1
MITDSL
V
VVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
S
T
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERG
N
YADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQ
T
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
W
K
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKA
T
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|670897.3.peg.365
Escherichia coli 2362-75
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DE
C
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MV
L
RDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|216593.1.peg.3252
Escherichia coli E2348/69
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
K
VTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DE
C
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MV
L
RDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
T
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|595495.4.peg.530
Escherichia coli KO11
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
S
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKA
T
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|566546.3.peg.2446
Escherichia coli W
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
S
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKA
T
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|566546.4.peg.420
Escherichia coli W
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
S
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SH
I
IP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
L
LSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKA
T
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|216592.1.peg.4881
Escherichia coli 042
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
A
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
V
R
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
V
K
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELP
V
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
W
H
Q
WR
L
A
EN
LSVTL
P
A
A
SH
S
IP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|216592.3.peg.401
Escherichia coli 042
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
A
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
V
R
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
LY
RA
V
V
K
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELP
V
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
W
H
Q
WR
L
A
EN
LSVTL
P
A
A
SH
S
IP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|550677.3.peg.777
Escherichia coli B354
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DE
C
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GT
T
PFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
A
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
K
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTA
N
T
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
V
T
S
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|585034.4.peg.339
Escherichia coli IAI1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
V
RP
S
P
QLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTV
I
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
S
T
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
S
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEA
S
D
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
Y
QQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELP
G
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
T
W
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWK
V
A
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|585034.5.peg.338
Escherichia coli IAI1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
V
RP
S
P
QLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTV
I
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
S
T
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
S
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEA
S
D
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
Y
QQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELP
G
-
LPQ
PES
A
GQLW
L
TV
H
V
VQPNA
T
T
W
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWK
V
A
G
HY
Q
AEAAL
LQ
C
S
ADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|701177.3.peg.444
Escherichia coli O55:H7 str. CB9615
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|362663.8.peg.411
Escherichia coli 536
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
V
G
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAG
H
YHYQLVWCQK
fig|362663.9.peg.409
Escherichia coli 536
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
V
G
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAG
H
YHYQLVWCQK
fig|585395.4.peg.341
Escherichia coli O103:H2 str. 12009
MTMITDSL
V
VVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
S
T
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
S
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEA
S
D
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RW
N
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
A
N
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
I
RAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYG
S
HQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|714962.3.peg.343
Escherichia coli IHE3034
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FL
H
A
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|869729.3.peg.3338
Escherichia coli UM146
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FL
H
A
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|364106.7.peg.484
Escherichia coli UTI89
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FL
H
A
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|364106.8.peg.482
Escherichia coli UTI89
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FL
H
A
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|753642.3.peg.1285
Escherichia coli NC101
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
S
V
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
I
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
V
G
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|83334.1.peg.474
Escherichia coli O157:H7
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|155864.1.peg.393
Escherichia coli O157:H7 EDL933
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|155864.8.peg.391
Escherichia coli O157:H7 EDL933
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|637388.3.peg.876
Escherichia coli O157:H7 str. FRIK2000
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|570506.3.peg.1884
Escherichia coli O157:H7 str. FRIK966
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|386585.9.peg.492
Escherichia coli O157:H7 str. Sakai
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|502346.5.peg.871
Escherichia coli O157:H7 str. TW14588
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|679204.3.peg.2063
Escherichia coli MS 145-7
MITDSL
V
VVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
S
T
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
S
T
Q
I
S
DF
H
V
A
T
H
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEA
S
D
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
A
N
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
I
RAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYG
S
HQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|405955.13.peg.353
Escherichia coli APEC O1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FL
H
A
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPH
L
WR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|405955.9.peg.288
Escherichia coli APEC O1
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FL
H
A
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPH
L
WR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|685038.3.peg.330
Escherichia coli O83:H1 str. NRG 857C
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
I
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
S
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
V
S
V
G
V
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|655817.3.peg.465
Escherichia coli ABU 83972
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|199310.1.peg.438
Escherichia coli CFT073
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|550676.3.peg.1133
Escherichia coli B185
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
I
P
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
I
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELP
G
-
LPQ
S
ES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
T
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYG
S
HQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQL
L
WCQK
fig|444449.5.peg.5201
Escherichia coli O157:H7 str. EC4042
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
H
HS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|444448.5.peg.3077
Escherichia coli O157:H7 str. EC4045
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
H
HS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|444450.8.peg.539
Escherichia coli O157:H7 str. EC4115
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
H
HS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|444447.5.peg.3247
Escherichia coli O157:H7 str. EC4206
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
H
HS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|544404.4.peg.404
Escherichia coli O157:H7 str. TW14359
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
H
HS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|585057.4.peg.362
Escherichia coli IAI39
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
N
V
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGY
V
D
L
V
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
S
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
Q
D
KQ
V
IELPE
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEA
R
H
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|585057.6.peg.361
Escherichia coli IAI39
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
K
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
N
V
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGY
V
D
L
V
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
S
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
Q
D
KQ
V
IELPE
-
LPQ
PESTGQLW
L
TV
H
V
VQPNA
T
AW
SEA
R
H
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656417.3.peg.481
Escherichia coli M605
MTMITDSLAVVLQRRDWEN
P
S
V
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
S
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TSE
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
V
S
V
G
V
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
N
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|749550.3.peg.1823
Escherichia coli MS 200-1
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
V
G
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAG
H
YHYQLVWCQK
fig|585035.6.peg.346
Escherichia coli S88
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FL
H
A
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|478004.5.peg.2674
Escherichia coli O157:H7 str. EC4401
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
V
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
H
HS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656440.3.peg.222
Escherichia coli TA206
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
I
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|585397.7.peg.367
Escherichia coli ED1a
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
E
D
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
S
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SE
G
GH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|585397.9.peg.368
Escherichia coli ED1a
MTMITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
E
D
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
S
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SE
G
GH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|525281.3.peg.2555
Escherichia coli 83972
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|749528.3.peg.1518
Escherichia coli MS 45-1
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656419.3.peg.538
Escherichia coli M718
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
V
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQ
L
F
PAV
P
KWSIKKWLSLPGE
L
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
L
E
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELP
G
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
V
S
VDV
E
VAS
N
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPL
P
D
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|199310.4.peg.431
Escherichia coli CFT073
MITDSLAVVLQRRDWEN
PGV
T
QL
NRLA
AHPP
F
A
S
WRNSEE
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
A
E
NPTG
C
Y
SLT
F
N
I
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
S
X
L
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
Q
V
T
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
E
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQ
Y
FQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NE
F
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DV
G
P
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
A
SHAIP
Q
LT
TS
G
T
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGD
E
KQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
H
G
EM
V
I
N
VDV
A
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQV
S
ER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|478006.5.peg.2483
Escherichia coli O157:H7 str. EC4501
MTMITDS
S
AVVLQRRD
R
EN
PGV
T
Q
V
NRLA
AHPP
F
A
S
WRNSEE
ART
--
N
RP
S
QQLRS
L
N
G
E
W
Q
F
V
W
F
PA
P
EA
VPE
SWLECDLP
D
ADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
L
S
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
L
F
NDDFSR
A
V
L
EA
EV
QM
Y
G
--
ELRD
E
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
L
G
LN
VE
N
P
K
L
WSAE
I
P
N
I
Y
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
S
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
M
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
L
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
R
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
T
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
V
IELPE
-
LP
R
L
ESTGQLW
L
TV
H
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
S
AP
HAIP
Q
LT
TSE
T
DFCIEL
D
NKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITT
V
HAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
T
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|316385.7.peg.1378
Escherichia coli str. K-12 substr. DH10B
MTMITDSLAVV
-------------------------------
ART
--
D
RP
S
QQLRS
L
N
G
E
W
R
F
AW
F
PA
P
EA
VPE
SWLECDLPEADTVV
VP
SN
WQM
H
G
YDAPI
YT
NVTY
P
ITVNP
PFVP
T
E
NPTG
C
Y
SLT
F
N
V
DES
W
L
Q
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
RW
VG
YGQD
SRL
PS
EFD
L
SA
FLRA
G
E
N
R
L
A
V
M
V
LR
W
S
D
GS
Y
L
EDQDMW
RMS
GIFRDV
S
LL
H
K
P
T
T
Q
I
S
DF
H
V
A
T
R
F
NDDFSR
A
V
L
EA
EV
QM
C
G
--
ELRD
Y
LRVTVS
L
WQ
GE
T
Q
V
A
S
GTAPFGGEI
ID
ERGGYADRV
T
---
LRLN
VE
N
P
K
L
WSAE
I
P
N
LY
RA
V
VE
L
HT
ADG
TLI
E
AEACD
VGFR
EVRIEN
GL
LLL
N
GKPLLIR
GVNRH
EHHPLH
G
QVMDEQTMVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
H
P
LW
Y
T
LCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
TDDP
R
W
L
P
AMS
ER
VT
R
MVQRDR
NHPS
V
IIWSLGNESG
H
G
A
N
HD
A
L
Y
RWI
K
SV
D
PS
R
P
V
Q
YE
GGGA
D
T
T
AT
DII
CP
MY
A
RV
DEDQP
F
PAV
P
KWSIKKWLSLPGE
T
R
P
L
I
L
CEYAHAMGN
SL
GG
FAK
Y
WQA
F
RQYPRL
QG
GF
VW
D
W
V
D
QSLIKY
D
E
NGN
P
W
SAY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VFA
D
R
TP
H
P
A
L
T
E
A
K
HQQQFFQFRLSG
Q
T
---
IE
V
TSEYL
F
RHS
D
NEL
LH
W
M
V
A
L
D
G
KP
LA
SGEVP
L
-
DVAP
QGKQ
L
IELPE
-
LPQ
PES
A
GQLW
L
TV
R
V
VQPNA
T
AW
SEAGH
IS
A
WQ
Q
WR
L
A
EN
LSVTL
P
A
A
SHAIP
H
LT
TSE
M
DFCIELGNKRW
--
Q
F
NR
Q
SG
FLSQMWIGDKKQ
L
LTPLRDQ
F
TRAPL
DN
DIGVSEATRIDPNAWVERWKAA
G
HY
Q
AEAAL
LQ
CTADT
L
A
D
A
V
LITTAHAW
Q
HQ
G
KT
L
F
IS
-------------
R
K
TY
--
RI
DG
S
GQ
M
A
I
T
VDV
E
VAS
D
T
PH
PAR
-
IG
L
N
CQLAQVAER
V
NWL
G
L
GP
Q
ENY
P
D
RLT
A
ACF
D
R
W
DLPLSD
M
YTP
Y
V
FP
SE
NG
L
R
CGT
R
ELNYGPHQWR
G
-------
DFQ
F
NIS
RY
S
Q
QQLMETS
H
RHL
L
HAEEGTW
LN
I
D
GFHM
G
IG
G
DD
SW
SPS
VSAEFQLSAGRYHYQLVWCQK
fig|656440.3.peg.3093
Escherichia coli TA206 (12-1026/1047)
I
L
S
R
E
DW
Q
N
QN
I
TH
L
NRL
P
AHP
S
F
S
S
WR
D
V
S
A
AR
D
--
N
RP
S
DRR
R
R
L
D
G
E
W
Q
F
S
Y
ARS
P
F
QV
DA
SWL
LH
DLP
D
SRPTP
VP
SN
WQMEG
YDAPI
Y
S
NV
R
Y
P
I
GTM
P
P
R
VP
E
DNPTG
L
Y
SL
L
L
T
V
DE
A
W
RT
EG
QT
R
I
I
FDGV
NSA
F
HLWC
NG
E
W
VG
Y
S
QD
SRL
P
A
A
FD
L
S
PY
LR
Q
GDN
R
LCV
M
VM
R
W
S
A
G
T
W
L
EDQDMW
RMS
GIFR
S
V
W
LL
H
K
P
QQ
H
L
S
D
V
H
L
T
P
Q
P
D
D
LCR
DA
Q
L
QVS
L
S
V
S
AEP
E
VLP
A
L
E
V
E
VS
L
W
E
GE
T
R
I
A
GERR
P
P
G
TPV
ID
ERG
S
Y
S
E
R
AV
---
F
S
L
D
V
R
R
P
R
L
WSAE
Q
P
H
C
Y
RA
V
V
S
L
WHE
D
-
TL
LE
S
EA
W
D
I
GFR
R
V
E
I
RDGL
LLL
N
GKPLLIR
GVNRH
EHH
HR
R
G
QV
I
S
E
E
D
MVQ
D
IL
LMKQ
N
N
F
N
A
VR
CS
HYPN
A
PR
W
YELCD
R
YGL
Y
V
VD
E
ANI
E
T
HG
MV
---
PMN
R
L
S
DDP
D
W
L
A
A
Y
S
A
RI
T
R
MVQ
S
N
R
NHPSIIIWSLGNESG
C
G
D
N
H
Q
AMY
Q
W
L
K
HN
D
PS
R
P
V
Q
YE
GGGA
D
S
S
AT
DI
L
CP
MY
A
RV
ER
DQP
F
PAV
P
KWSIKKW
I
SLPGE
Q
R
P
L
I
L
CEYAHAMGN
SL
G
N
FA
D
Y
WQA
F
R
D
YPRL
QG
GF
I
W
D
W
A
D
LA
I
E
K
TFP
D
G
S
V
G
W
AY
GGD
F
GD
T
PN
DRQ
FC
MN
GL
VF
P
D
R
RA
H
P
S
L
I
E
A
R
H
A
QQ
Y
F
R
F
A
L
L
G
Q
NPLR
I
A
V
TSEYL
F
R
S
T
D
NE
TL
R
W
R
V
E
C
A
GET
I
A
SG
D
L
T
L
-
A
L
P
P
QG
R
A
E
L
T
L
S
SN
L
AL
P
RG
AR
D
V
W
L
Q
L
D
V
I
QP
Q
A
T
AW
S
D
P
GH
R
V
A
WQ
Q
L
PL
A
TP
L
VICEA
S
A
GGP
A
P
V
LT
T
E
P
T
IYQV
S
V
G
S
W
RW
--
IID
R
Q
SG
H
LS
E
W
YA
G
GE
Q
Q
L
LTPL
E
DQ
F
V
RAPL
DN
D
T
GVSEA
GH
IDPNAW
T
ERWK
RS
GL
Y
Q
LT
A
RC
VE
C
R
A
Q
Q
L
ERE
V
I
I
DSHWHYL
H
G
GE
T
V
IIS
H
-----------
W
R
M
T
F
--
TA
D
-
-
G
RL
R
L
T
IN
G
K
R
A
ETL
P
P
L
AR
-
IG
L
RF
Q
V
A
D
QH
T
E
V
S
WL
G
L
GP
H
ENY
P
D
R
K
S
S
ACF
S
R
WR
LPLS
E
M
S
TP
Y
I
FP
T
E
NG
L
R
C
DC
K
A
L
DW
G
CW
H
VT
G
-------
Q
F
H
FS
VQP
Y
S
T
E
QLM
T
T
D
H
W
H
R
M
TP
E
K
G
V
W
I
T
LD
G
Q
HM
G
V
G
G
DD
SW
I
PS
V
LP
QW
L
L
LDT
QW
HYQ
I
V
fig|439855.10.peg.3530
Escherichia coli SMS-3-5 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
S
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLS
E
GW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
L
VHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSD
D
EVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|340197.3.peg.2355
Escherichia coli F11 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPI
N
VPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
L
VHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSD
D
EVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|340197.5.peg.2470
Escherichia coli F11 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPI
N
VPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
L
VHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSD
D
EVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749550.3.peg.994
Escherichia coli MS 200-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPI
N
VPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
L
VHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSD
D
EVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|550677.3.peg.3640
Escherichia coli B354 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLA
S
SPV
V
TTLEYTLFDGE
R
L
VHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKL
C
DVAPNSEA
S
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656379.3.peg.3815
Escherichia coli FVEC1302 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
A
TTLEY
S
LFDGE
R
VVHSS
S
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGN
I
LEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656380.3.peg.3734
Escherichia coli FVEC1412 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
A
TTLEY
S
LFDGE
R
VVHSS
S
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGN
I
LEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749549.3.peg.5150
Escherichia coli MS 198-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
A
TTLEY
S
LFDGE
R
VVHSS
S
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGN
I
LEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585056.7.peg.3735
Escherichia coli UMN026 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
A
TTLEY
S
LFDGE
R
VVHSS
S
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGN
I
LEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|562.376.peg.3945
Escherichia coli WV_060327 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585057.4.peg.3696
Escherichia coli IAI39 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
S
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISR
V
TDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHA
V
KALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585057.6.peg.3705
Escherichia coli IAI39 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
S
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISR
V
TDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHA
V
KALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|216592.1.peg.3995
Escherichia coli 042 (55-1031/1081)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
A
TTLEY
S
LFDGE
R
VVHSS
S
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGN
I
LEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANII
E
IWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|216592.3.peg.3502
Escherichia coli 042 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
A
TTLEY
S
LFDGE
R
VVHSS
S
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGN
I
LEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANII
E
IWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656437.3.peg.3494
Escherichia coli TA143 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
A
TTLEY
S
LFDGE
R
VVHSS
S
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGN
I
LEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANII
E
IWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|6666666.5357.peg.4456
Escherichia coli TY-2482 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585055.6.peg.3525
Escherichia coli 55989 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585055.8.peg.3528
Escherichia coli 55989 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|344601.3.peg.2000
Escherichia coli B171 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|344601.5.peg.2088
Escherichia coli B171 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|340185.3.peg.440
Escherichia coli E22 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|340185.4.peg.480
Escherichia coli E22 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656408.3.peg.3488
Escherichia coli H591 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585034.4.peg.3158
Escherichia coli IAI1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGM
H
CTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585034.5.peg.3156
Escherichia coli IAI1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGM
H
CTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|595495.4.peg.853
Escherichia coli KO11 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|679206.4.peg.1442
Escherichia coli MS 119-7 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|409438.11.peg.3525
Escherichia coli SE11 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|566546.3.peg.528
Escherichia coli W (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|566546.4.peg.3306
Escherichia coli W (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656443.3.peg.3957
Escherichia coli TA271 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|1040638.4.peg.2230
Escherichia coli O104:H4 str. LB226692 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSK
A
SRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|481805.3.peg.663
Escherichia coli ATCC 8739 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
S
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHA
S
QHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|481805.6.peg.660
Escherichia coli ATCC 8739 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
S
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHA
S
QHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749537.3.peg.1154
Escherichia coli MS 115-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTF
T
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHA
S
QHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|344610.3.peg.2747
Escherichia coli 53638 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
S
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRD
I
APNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHA
S
QHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|344610.7.peg.3168
Escherichia coli 53638 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
S
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRD
I
APNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHA
S
QHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749531.3.peg.2494
Escherichia coli MS 69-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDIS
T
MVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
A
TTLEY
S
LFDGE
R
VVHSS
S
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGN
I
LEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKL
C
DVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585395.4.peg.3995
Escherichia coli O103:H2 str. 12009 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLR
G
F
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|331112.3.peg.3045
Escherichia coli HS (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLS
Y
EVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSS
T
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
Y
P
QENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|331112.6.peg.3182
Escherichia coli HS (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLS
Y
EVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSS
T
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
Y
P
QENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|362663.8.peg.3196
Escherichia coli 536 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
L
VHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAK
V
LDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSD
D
EVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|362663.9.peg.3207
Escherichia coli 536 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
L
VHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAK
V
LDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSD
D
EVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWRYTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|685038.3.peg.3127
Escherichia coli O83:H1 str. NRG 857C (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
S
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
Q
PQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFV
I
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|562.375.peg.716
Escherichia coli EC4100B (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQE
N
EGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585396.4.peg.4035
Escherichia coli O111:H- str. 11128 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQE
N
EGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|573235.3.peg.4276
Escherichia coli O26:H11 str. 11368 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQE
N
EGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656393.3.peg.4130
Escherichia coli H299 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
L
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAK
V
LDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQ
A
VP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|670888.3.peg.3177
Escherichia coli 1827-70 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSS
T
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLN
F
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
Y
P
QENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|753642.3.peg.4885
Escherichia coli NC101 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFV
I
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|431946.3.peg.3050
Escherichia coli SE15 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
S
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSA
R
F
D
FTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKL
C
DVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQL
P
GL
-
GSNSWGSEV
fig|550672.3.peg.3320
Escherichia coli B088 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|340186.3.peg.460
Escherichia coli E110019 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|340186.5.peg.482
Escherichia coli E110019 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|679207.4.peg.3792
Escherichia coli MS 107-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|679204.3.peg.802
Escherichia coli MS 145-7 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749545.3.peg.521
Escherichia coli MS 182-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749532.3.peg.591
Escherichia coli MS 78-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656440.3.peg.3312
Escherichia coli TA206 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
S
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQL
P
GL
-
GSNSWGSEV
fig|656444.3.peg.4335
Escherichia coli TA280 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDIS
T
MVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSS
T
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKD
T
DGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749547.3.peg.4021
Escherichia coli MS 187-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHA
S
QHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|331111.12.peg.3812
Escherichia coli E24377A (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
E
I
SSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|331111.3.peg.1228
Escherichia coli E24377A (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
E
I
SSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
T
H
GELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|679205.4.peg.424
Escherichia coli MS 124-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NL
S
ASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749533.3.peg.3555
Escherichia coli MS 84-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NL
S
ASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|340184.3.peg.1962
Escherichia coli B7A (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
K
TLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|340184.6.peg.2054
Escherichia coli B7A (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
K
TLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
V
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585397.7.peg.3741
Escherichia coli ED1a (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585397.9.peg.3738
Escherichia coli ED1a (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|714962.3.peg.3544
Escherichia coli IHE3034 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|585035.6.peg.3405
Escherichia coli S88 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|869729.3.peg.194
Escherichia coli UM146 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|364106.7.peg.3479
Escherichia coli UTI89 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|364106.8.peg.3480
Escherichia coli UTI89 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|701177.3.peg.3815
Escherichia coli O55:H7 str. CB9615 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|637912.3.peg.2287
Escherichia coli OP50 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
S
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTP
R
PGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656417.3.peg.3914
Escherichia coli M605 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
S
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLA
T
SPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSA
R
F
D
FTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKL
C
DVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQL
P
GL
-
GSNSWGSEV
fig|525281.3.peg.3607
Escherichia coli 83972 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSA
R
FAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|655817.3.peg.3655
Escherichia coli ABU 83972 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSA
R
FAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|199310.1.peg.3747
Escherichia coli CFT073 (63-1039/1089)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSA
R
FAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|199310.4.peg.3609
Escherichia coli CFT073 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSA
R
FAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749546.3.peg.2147
Escherichia coli MS 185-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSA
R
FAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749528.3.peg.434
Escherichia coli MS 45-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSA
R
FAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
N
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|413997.3.peg.3084
Escherichia coli B str. REL606 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
S
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|511693.5.peg.3093
Escherichia coli BL21 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
S
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|469008.4.peg.681
Escherichia coli BL21(DE3) (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
S
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|155864.8.peg.3885
Escherichia coli O157:H7 EDL933 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAF
X
SELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENY
X
FPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|83333.1.peg.3023
Escherichia coli K12 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSA
T
FAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWR
QA
VDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|670897.3.peg.4241
Escherichia coli 2362-75 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQL
P
GL
-
GSNSWGSEV
fig|216593.1.peg.5061
Escherichia coli E2348/69 (63-1039/1089)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQL
P
GL
-
GSNSWGSEV
fig|574521.7.peg.3438
Escherichia coli O127:H6 str. E2348/69 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQL
P
GL
-
GSNSWGSEV
fig|562.371.peg.2378
Escherichia coli 1044A (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|562.373.peg.1854
Escherichia coli 1125A (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|562.372.peg.2574
Escherichia coli 1212A (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|562.374.peg.4649
Escherichia coli 536A (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|83334.1.peg.3932
Escherichia coli O157:H7 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|155864.1.peg.3958
Escherichia coli O157:H7 EDL933 (16-992/1042)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|444454.5.peg.3009
Escherichia coli O157:H7 str. EC4024 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|444449.5.peg.2468
Escherichia coli O157:H7 str. EC4042 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|444448.5.peg.1221
Escherichia coli O157:H7 str. EC4045 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|444453.5.peg.2644
Escherichia coli O157:H7 str. EC4076 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|444452.5.peg.2348
Escherichia coli O157:H7 str. EC4113 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|444450.8.peg.4304
Escherichia coli O157:H7 str. EC4115 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|444451.5.peg.974
Escherichia coli O157:H7 str. EC4196 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|444447.5.peg.1378
Escherichia coli O157:H7 str. EC4206 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|478004.5.peg.905
Escherichia coli O157:H7 str. EC4401 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|478005.5.peg.3398
Escherichia coli O157:H7 str. EC4486 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|478006.5.peg.2498
Escherichia coli O157:H7 str. EC4501 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|478007.5.peg.528
Escherichia coli O157:H7 str. EC508 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|478008.5.peg.1277
Escherichia coli O157:H7 str. EC869 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|637388.3.peg.5169
Escherichia coli O157:H7 str. FRIK2000 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|570506.3.peg.773
Escherichia coli O157:H7 str. FRIK966 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|386585.9.peg.4130
Escherichia coli O157:H7 str. Sakai (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|544404.4.peg.4114
Escherichia coli O157:H7 str. TW14359 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|502346.5.peg.3299
Escherichia coli O157:H7 str. TW14588 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDA
I
FENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|358709.5.peg.4567
Escherichia coli 101-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|595496.3.peg.3054
Escherichia coli BW2952 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|536056.3.peg.654
Escherichia coli DH1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|316401.4.peg.3798
Escherichia coli ETEC H10407 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656414.3.peg.3542
Escherichia coli H736 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749538.3.peg.3830
Escherichia coli MS 116-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749540.3.peg.1355
Escherichia coli MS 146-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|749548.3.peg.2481
Escherichia coli MS 196-1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|316407.3.peg.2961
Escherichia coli W3110 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|316385.7.peg.3273
Escherichia coli str. K-12 substr. DH10B (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|511145.12.peg.3170
Escherichia coli str. K-12 substr. MG1655 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|511145.6.peg.3155
Escherichia coli str. K-12 substr. MG1655 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDD
H
GNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHA
R
D
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGH
P
IATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEVLIISRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|656419.3.peg.4048
Escherichia coli M718 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFLPLSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVP
C
DNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYL
V
GK
H
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
R
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDA
N
GNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDV
K
SHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
M
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDF
A
V
------
EQSDGEV
F
IISRTVIAPPVFDFGMRCTYIWRI
T
A
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|405955.13.peg.3505
Escherichia coli APEC O1 (4-980/1030)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWA
N
STYVED
L
DMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
fig|405955.9.peg.2909
Escherichia coli APEC O1 (63-1039/1089)
WENIQLTHENRLAPRAYFFSYDSVAQARTFA
R
ETSSLFL
L
LSGQW
N
FHFFDHPLQVPEAFTSELMADWGHITVPAMWQMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGW
-
QGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEFDISAMVKTGDNLLCVRVMQWA
N
STYVED
L
DMWWSAGIFRDVYL
I
GK
Q
LTHINDF
T
V
R
T
D
FDEAYCDATLSCEVVL
E
NLAASPV
V
TTLEYTLFDGE
H
VVHSSA
-------
IDHLA
--
IEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDADGNVLEVVPQRVGFRDIKVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDVESHGFANVGDISRITDDPQWE
K
VYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYE
-
EDRDA
E
VVDIISTMYTRVPLMNEFGEYP
------------
H
P
KPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDNGNVWYKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHALD
L
TRGELKVENKLWFTTLDDYTLHA
E
VR
A
EGETLATQQIKLRDVAPNSEA
P
LQIT
--
LPQLD
-
AREAFLNI
T
VTKDSRTRYSEAGHSIATYQFPLKENTAQPVP
F
APNNAR
P
LTLED
D
RLSCTVRGYNFAITFSK
T
SGKPTSWQVNGESLLTREPKINFFKPMIDN
----------------
HKQEYEGLWQPNH
--
LQIMQEHLRDFVV
------
EQSD
D
EVL
LV
SRTVIAPPVFDFGMRCTYIWRIAA
D
GQV
N
VALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADSQQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAW
H
YTQENIHAAQHCNELQRSDDITLNLDHQLLGL
-
GSNSWGSEV
Consen1
Primary consensus
MTMITDSLAVVLQRRDWENiqlTheNRLAprayFfSydsvaqARTfa
etSslflpLsGqW
FhfFdhPlqVPEaftselmadwghitVPamWQMeGhgklqYTdegfPfpidvPFVPsdNPTGaYqriFtlsdgW
-
QgkQTlIkFDGVetyFevyvNGqyVGfskgSRLtaEFDiSAmvktGdNlLcVrVmqWaDstYvEDQDMWwsaGIFRDVyLlgK
lThInDF
V
T
FdeaycdAtLscEVvl
nlaaspv
ttleytLfdGE
vVhSsa
-------
IDhla
--
ieklTsasfaftVEqPqqWSAEsPyLYhlVmtLkdAdGnvlEvvpqrVGFRdikvrdGLfwiNnryvmlhGVNRHdndhrkGravgmdrvekDlqLMKQhNiNsVRtaHYPNdPrfYeLCDiYGLfVmaEtdvEsHGfanvgdisRiTDDPqWe
vyvERivRhihaqkNHPSiIIWSLGNESGyGcNirAmYhaaKalDdtRlVhYE
-
edrDa
vvDIIstMYtRVplmneFgeyP
------------
h
kPrIiCEYAHAMGNgpGGlteYqnvFykhdciQGhyVWeWcDhgiqaqDdNGNvWykfGGDyGDyPNnynFCldGLiysDqTPgPgLkEyKqviapvkihald
TrgelkVenklwFttlDdytLHa
Vr
eGetLAtqqikLrDVaPnsea
lqit
--
LPQld
-
areafLni
VtkdsrTrySEAGHsiAtyQfpLkENtaqpvP
Apnnar
LTled
rlsctvrgynfaitFsk
SGkptswqvngeslLtrepkinFfkpmiDN
----------------
hkqeyeGlwQpnh
--
LQimqehLrDfvv
------
eQsdgevLiISrtviappvfdfgmRcTYiwRIaa
Gqv
valsgErygDyPHiipcIGfTmgingeydqVayyGrGPgENYaDsqqAniiDiWrstvdamfenYpFPqnNGnRqhvRwtaltnrhgnGllvvpqrpinFsawrYtQenihaaqHcneLqrsdditLNlDhqllGl
-
GsnSWgseVSAEFQLSAGRYHYQLVWCQK
Consen2
Secondary consensus
pgv
ql
ahpp
a
wrnsee
--
rp
qqlrs
n
e
aw
pa
ea
swlecdlpeadtvv
sn
h
ydapi
nvty
itvnp
e
c
slt
n
des
l
eg
r
i
nsa
hlwc
rw
ygqd
ps
l
flra
e
r
a
m
lr
s
gs
l
rms
s
h
t
q
s
nddfsr
v
ea
qm
g
--
elrd
lrvtvs
wq
q
a
gtapfggei
erggyadrv
---
lrln
n
l
i
n
ra
ve
ht
n
tli
aeacd
evrien
lll
gkpllir
ehhplh
qvmdeqtmvq
il
n
f
a
cs
h
lw
t
r
y
vd
ani
t
mv
---
pmn
l
r
l
ams
vt
mvqrdr
v
h
a
hd
l
rwi
sv
ps
p
q
ggga
t
at
cp
a
dedqp
pav
kwsikkwlslpge
r
l
l
sl
fak
wqa
rqyprl
gf
d
v
qsliky
e
p
say
f
t
drq
mn
vfa
r
h
a
t
a
hqqqffqfrlsg
---
ie
tseyl
rhs
nel
w
a
d
kp
sgevp
-
g
qgkq
ielpe
pestgqlw
tv
vqpna
aw
is
wq
wr
a
lsvtl
shaip
tse
dfcielgnkrw
--
q
nr
flsqmwigdkkq
ltplrdq
trapl
digvseatridpnawverwkaa
hy
aeaal
ctadt
a
aalittahaw
hqdkt
f
-------------
k
--
dg
em
i
vdv
vas
t
par
-
l
cqlaqvaer
nwl
l
q
p
rlt
acf
r
dlplsdiytp
v
se
l
cgt
elnygphqwr
-------
dfq
nish
s
qqlmets
rhl
haeegtw
i
gfhm
ig
dd
sps
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character