fig|1040638.4.peg.476
Escherichia coli O104:H4 str. LB226692
MSADKQTFALHCEAKNDKVRKRLGIKGGFFW
T
EARKLS
V
A
V
SRC
I
AAMDDAGYDEDDFKKPVRVNFPVVNDLPPEGVFDTEFCNRYEKGGNDGITMMAI
P
F
D
D
N
I
N
G
E
DA
T
TA
GD
DN
D
NLDG
TIPDDVEK
S
E
S
PD
S
D
DD
C
SECE
----
IPVAT
L
SLTHRFLH
L
F
L
FS
KDEDGKYR
HHA
T
P
E
QR
NN
V
I
RME
MDTED
S
Y
LQSL
----------
LTA
VR
AAHH
E
--
LDKLTN
Y
HLSRLA
E
S
VG
KAFPHSANH
RI
SPAEF
DK
F
I
S
T
WMK
T
DY
L
DQGLLTKEWQ
N
GN
Y
VSGITRT
P
SGANAGGGN
I
TDRGEGF
K
HD
K
TSLARDVATGVLARSMDVDIYNLHPAHAKRVEEI
V
SENKPPFSVFRDKFIAMPGGLDYSRAIVVASVKEAPIGIEAIPA
R
VTEYLNKVLTETDH
T
NPDPEIV
E
IACGRSSAPMPQRGTAEGKH
G
DEEKQQ
A
SD
T
MANEQAA
P
ESVE
EIPV
KH
N
E
DTQSL
E
---
N
VSSV
ET
KYQELR
E
EL
N
KAR
E
NIPPKNPVDADKLLAASRGEFVEGISNPADPKWVKGIQ
-
TRDTEDQNQSKVEQIAP
E
AGQNSPDTQQNGPEEQQPGPV
M
-
Q
Q
E
V
EK
V
C
TT
C
S
Q
N
GG
G
H
CPDCG
P
VMGD
E
TY
A
ETF
G
ENDAA
DG
EDSAQTEEKIIQENAVDAAQE
GE
T
VV
Q
N
E
P
GSDTSGDDANSEPVTLDWKRQLVIAAVYGLCANPACIATAPAIPDIA
I
MIANRLENFGGDKS
fig|585397.7.peg.1101
Escherichia coli ED1a
MSADKQTFALHCEAKNDKVRKRLGIKGGFFW
T
EARKLS
V
A
V
SRC
I
AAMDDAGYDEDDFKKPVRVNFPVVNDLPPEGVFDTEFCNRYEKGGNDGITMMAI
P
F
D
D
N
I
N
G
E
DA
T
TA
GD
DN
D
NLDG
TIPDDVEK
S
E
S
PD
S
D
DD
C
SECE
----
IPVAT
L
SLTHRFLH
L
F
L
FS
KDEDGKYR
HHA
T
P
E
QR
NN
V
I
RME
MDTED
S
Y
LQSL
----------
LTA
VR
AAHH
E
--
LDKLTN
Y
HLSRLA
E
S
VG
KAFPHSANH
RI
SPAEF
DK
F
I
S
T
WMK
T
DY
V
DQGLL
A
KEWQKGN
Y
V
T
GITRT
P
SGANAGGGNLTDRGEGF
T
H
NQA
SLARD
I
ATGVLARSMDVDIYNLHPAHAKRVEEI
V
SENKPPFSVFRDKFIAMPGGLDYSRAIVVASVKEAPIGIEAIPA
R
VTEYLNKVLTETDH
S
NPDPEIV
E
IACGRSSAPMPQRGTAE
EI
HDDEEKQQ
T
SDAM
S
NEQAA
P
ESVE
EIPV
KH
NA
DTQSL
E
---
N
VSSV
ET
KYQELR
E
EL
N
KAR
E
NIPPKNPVDADKLLAASRGEFVEGISNPADPKWVKGIQ
-
TRDTEDQNQSKVEQIAP
E
AGQNSPDTQQNGPEEQQPGPV
A
-
Q
P
E
L
EK
N
C
RV
CGQTGG
G
NCPDC
S
AVMGD
S
TY
T
ETF
G
ENDAA
DG
EDSAQTEEKIIQEN
S
VDAAQE
GE
T
VV
Q
N
E
P
GSDTSGDDANSEPVTLDWKRQLVIAAVYGLCANPACIATAPAIPDIA
I
MIANRLENFGGDKS
fig|585397.9.peg.1100
Escherichia coli ED1a
MSADKQTFALHCEAKNDKVRKRLGIKGGFFW
T
EARKLS
V
A
V
SRC
I
AAMDDAGYDEDDFKKPVRVNFPVVNDLPPEGVFDTEFCNRYEKGGNDGITMMAI
P
F
D
D
N
I
N
G
E
DA
T
TA
GD
DN
D
NLDG
TIPDDVEK
S
E
S
PD
S
D
DD
C
SECE
----
IPVAT
L
SLTHRFLH
L
F
L
FS
KDEDGKYR
HHA
T
P
E
QR
NN
V
I
RME
MDTED
S
Y
LQSL
----------
LTA
VR
AAHH
E
--
LDKLTN
Y
HLSRLA
E
S
VG
KAFPHSANH
RI
SPAEF
DK
F
I
S
T
WMK
T
DY
V
DQGLL
A
KEWQKGN
Y
V
T
GITRT
P
SGANAGGGNLTDRGEGF
T
H
NQA
SLARD
I
ATGVLARSMDVDIYNLHPAHAKRVEEI
V
SENKPPFSVFRDKFIAMPGGLDYSRAIVVASVKEAPIGIEAIPA
R
VTEYLNKVLTETDH
S
NPDPEIV
E
IACGRSSAPMPQRGTAE
EI
HDDEEKQQ
T
SDAM
S
NEQAA
P
ESVE
EIPV
KH
NA
DTQSL
E
---
N
VSSV
ET
KYQELR
E
EL
N
KAR
E
NIPPKNPVDADKLLAASRGEFVEGISNPADPKWVKGIQ
-
TRDTEDQNQSKVEQIAP
E
AGQNSPDTQQNGPEEQQPGPV
A
-
Q
P
E
L
EK
N
C
RV
CGQTGG
G
NCPDC
S
AVMGD
S
TY
T
ETF
G
ENDAA
DG
EDSAQTEEKIIQEN
S
VDAAQE
GE
T
VV
Q
N
E
P
GSDTSGDDANSEPVTLDWKRQLVIAAVYGLCANPACIATAPAIPDIA
I
MIANRLENFGGDKS
fig|656417.3.peg.2430
Escherichia coli M605 (1-662/672)
MS
T
DKQ
VYP
L
YY
EAKNDKVRKRLGIKGGF
Y
WAEA
K
KLSIAISR
G
A
V
A
I
DDAGYDEDDFKKPVRVN
L
PVV
D
DLPPEGVFDTEFCNRYEKGG
E
DGITM
VF
IA
P
S
P
S
A
QGK
P
AST
-
--
DNTN
VN
G
--------
-
-
-
--
-
-
E
DM
T
E
I
E
ENMLL
P
I
SG
Q
E
L
PI
R
W
L
-
-
-
-
--
-------
A
Q
H
G
S
E
K
PV
THVSR
N
E
-----
-
-
LQ
A
L
HIARAEELPAV
T
S
LA
VS
H
K
T
SL
LD
P
L
EI
R
D
L
HK
L
V
R
D
TD
R
V
FP
NPG
N
S
--
S
LGL
M
TAFF
E
A
Y
M
D
ADYTD
R
GLLTKEW
M
KGNRVS
R
ITRTASGANAGGGNLTDRGEGFVHDLTSLARDVATGVLARSMDVDIYNLHPAHAKRVEEII
A
ENKPPFSVFRDKFI
T
MPGGLDYSRAIVVASVKEAPIGIE
V
IPAHVTEYLNKVLTETDHANPDPEIVDIACGRSSAPMPQR
V
T
E
E
E
K
Q
DDEEK
L
QPS
C
AMA
D
EQA
T
AE
T
VEPDAT
E
HHQDTQ
P
LD
AQS
QV
N
SVDAKY
L
K
LRAELH
E
ARKNIPPKNPVDADKLLAASRGEFVEGIS
D
P
N
DPKWVKGIQ
-
TRD
S
V
Y
QNQ
P
E
T
E
HNDQ
KA
E
QN
D
P
N
TQQN
E
PE
T
K
QP
E
PV
V
Q
Q
Q
E
T
EK
V
C
TA
CGQ
S
GGDNCPDCGAVMGDATYQETFD
D
E
N
--
--
Q
V
E
V
R
E
N
E
P
E
K
M
E
G
A
EHPH
K
ENA
G
SD
P
H
R
D
C
SD
ET
G
E
A
A
A
S
SLEK
LDWKRQ
V
VIAAVYGLCANPA
G
IA
S
AP
L
IP
G
IAMMIAN
K
LENFG
fig|344610.3.peg.3329
Escherichia coli 53638 (1-636/646)
MS
T
DKQ
VYP
L
YY
EAKNDKVRKRLGIKGGF
Y
WAEA
K
KLSIAISR
G
A
V
A
I
DDAGYDEDDFKKPV
H
VN
L
PVV
D
DLPPEGVFDTEFCNRYEKGG
E
DGITM
VF
IA
S
S
P
S
V
Q
D
K
P
AST
-
--
DNTN
VN
G
--------
-
-
-
--
-
-
E
DM
T
E
I
E
ENMLL
PV
SG
Q
E
L
PI
R
W
L
-
-
-
-
--
-------
A
Q
H
G
S
E
K
PV
THVSR
DG
-----
-
-
LQ
A
L
HIARAEELPAV
TALA
VS
H
K
T
SL
LD
P
L
EI
R
D
L
HK
L
V
R
D
TDK
V
FP
NPG
N
S
--
S
LGLI
TAFF
E
A
YLN
ADYTD
R
GLLTKEW
M
KGNRVS
H
ITRTASGANAGGGNLTDRGEGFVHDLTSLARDVATGVLARSMD
L
DIYNLHPAHAKR
I
EEII
A
ENKPPFSVFRDKFI
T
MPGGLDYSRAIVVASVKEAPIGIE
V
IPAHVTEYLNKVLTETDHANPDPEIVDIACGRSSAPMPQR
V
T
E
EGK
Q
DDEEK
P
QPS
C
AMA
D
EQA
T
AE
T
VEPDAT
E
HHQDTQ
P
LD
AQS
QV
NA
VDAKYQELRAELH
E
ARKNIPPKNPVDADKLLAASRGEFVEGIS
D
P
N
DPKW
IP
G
HHISSNEVKKTENE
V
R
Q
TED
K
QH
QN
------
SE
PEE
GT
P
---
-
-
-
-
-
-
--
-
-
--
C
N
QTG
E
DNCPDCGAVMGDATYQETFDE
KK
--
--
P
D
E
AQ
E
EE
PKKT
E
K
A
D
D
LLP
ENA
G
SDQH
N
D
SNNET
G
E
------
DE
L
N
WK
K
Q
VL
IAAVYGLCANPACIA
Y
APAIPDIAMMIAN
K
LENFG
fig|344610.7.peg.4102
Escherichia coli 53638 (1-636/646)
MS
T
DKQ
VYP
L
YY
EAKNDKVRKRLGIKGGF
Y
WAEA
K
KLSIAISR
G
A
V
A
I
DDAGYDEDDFKKPV
H
VN
L
PVV
D
DLPPEGVFDTEFCNRYEKGG
E
DGITM
VF
IA
S
S
P
S
V
Q
D
K
P
AST
-
--
DNTN
VN
G
--------
-
-
-
--
-
-
E
DM
T
E
I
E
ENMLL
PV
SG
Q
E
L
PI
R
W
L
-
-
-
-
--
-------
A
Q
H
G
S
E
K
PV
THVSR
DG
-----
-
-
LQ
A
L
HIARAEELPAV
TALA
VS
H
K
T
SL
LD
P
L
EI
R
D
L
HK
L
V
R
D
TDK
V
FP
NPG
N
S
--
S
LGLI
TAFF
E
A
YLN
ADYTD
R
GLLTKEW
M
KGNRVS
H
ITRTASGANAGGGNLTDRGEGFVHDLTSLARDVATGVLARSMD
L
DIYNLHPAHAKR
I
EEII
A
ENKPPFSVFRDKFI
T
MPGGLDYSRAIVVASVKEAPIGIE
V
IPAHVTEYLNKVLTETDHANPDPEIVDIA
Y
GRSSAPMPQR
V
T
E
EGK
Q
DDEEK
P
QPS
C
AMA
D
EQA
T
AE
T
VEPDAT
E
HHQDTQ
P
LD
AQS
QV
NA
VDAKYQELRAELH
E
ARKNIPPKNPVDADKLLAASRGEFVEGIS
D
P
N
DPKW
IP
G
HHISSNEVKKTENE
V
R
Q
TED
K
QH
QN
------
SE
PEE
GT
P
---
-
-
-
-
-
-
--
-
-
--
C
N
QTG
E
DNCPDCGAVMGDATYQETFDE
KK
--
--
P
D
E
AQ
E
EE
PKKT
E
K
A
D
D
LLP
ENA
G
SDQH
N
D
SNNET
G
E
------
DE
L
N
WK
K
Q
VL
IAAVYGLCANPACIA
Y
APAIPDIAMMIAN
K
LENFG
Consen1
Primary consensus
MSaDKQtfaLhcEAKNDKVRKRLGIKGGFfWaEArKLSiAiSRcaaAmDDAGYDEDDFKKPVrVNfPVVnDLPPEGVFDTEFCNRYEKGGnDGITMmaIafsdsiqgkdAsTa
--
DNtNldG
--------
s
-
s
--
s
-
dDmsEcE
----
iPVatqsLthRfLh
-
f
-
fs
-------
ahHaspkqrthVsRme
-----
s
-
LQsL
----------
lTAlaaaHht
--
LDkLtnrhLsrLarstdKaFPhsaNh
--
SpaeftaFfsawmkaDYtDqGLLtKEWqKGNrVsgITRTaSGANAGGGNLTDRGEGFvHdltSLARDvATGVLARSMDvDIYNLHPAHAKRvEEIisENKPPFSVFRDKFIaMPGGLDYSRAIVVASVKEAPIGIEaIPAhVTEYLNKVLTETDHaNPDPEIVdIACGRSSAPMPQRgTaEgkhDDEEKqQpSdAManEQAaaEsVEpdatkHhqDTQsLd
---
qVssVdaKYQELRaELhkARkNIPPKNPVDADKLLAASRGEFVEGISnPaDPKWvkGiq
-
trdtedqnqskVeQiapkagQNspdtqqngPEEqqPgpv
-
q
e
ek
c
CgQtGgdNCPDCgAVMGDaTYqETFdEndaa
--
eDsAQtEEkiiqEnavDaaqEnatsdQhedgsdtsGddansepvtLdWKrQlvIAAVYGLCANPACIAtAPAIPDIAmMIANrLENFGGDKS
Consen2
Secondary consensus
t
vyp
yy
y
t
k
v
v
giv
i
h
l
d
e
vf
psdpnvndep
t
-
gd
d
vn
tipddvek
-
e
-
pd
-
de
ct
i
enmll
sgle
pi
w
-
l
-
l
--
kdedgkyrq
gteepvnn
i
dgmdted
-
y
a
hiaraeelpav
vrvs
kesl
p
eiyd
hk
vedvg
v
npg
sri
lglidk
ietylnt
v
r
a
m
y
th
p
t
nqa
i
l
i
va
t
v
r
s
e
v
e
eiq
p
t
c
sd
tp
t
eipve
na
p
eaqsn
na
et
e
ne
e
d
n
ip
hhissnevkktene
r
tedeqh
------
se
gt
---
-
-
--
-
n
eg
s
s
t
g
kk
--
dgp
e
e
pkkt
ksd
llp
gegvv
nnpsnnet
e
------
de
n
k
vl
y
i
k
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character