fig|1040638.4.peg.5505
Escherichia coli O104:H4 str. LB226692
MFR
L
PT
P
RLFS
G
LKSALRPAMPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTR
N
VKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|6666666.5357.peg.2172
Escherichia coli TY-2482
MFR
L
PT
P
RLFS
G
LKSALRPAMPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTR
N
VKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|585055.6.peg.222
Escherichia coli 55989
MFR
L
PT
P
RLFS
G
LKSALRPAMPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTR
N
VKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|585055.8.peg.222
Escherichia coli 55989
MFR
L
PT
P
RLFS
G
LKSALRPAMPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTR
N
VKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|585034.4.peg.221
Escherichia coli IAI1
MFR
L
PT
P
RL
L
S
G
LKSALRPAMPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
A
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVK
E
LN
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|331111.12.peg.553
Escherichia coli E24377A
MPRFK
I
SA
F
WLL
I
LAWIFLLVWIWWKGP
M
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTR
N
VKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|573235.3.peg.224
Escherichia coli O26:H11 str. 11368
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
M
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQF
I
TRYGLQSYFVKQRDELVELTAMDSWVLNLTR
N
VKYSDADRAEIQ
H
QLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|331112.6.peg.222
Escherichia coli HS
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
A
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKALN
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRL
T
DQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|409438.11.peg.338
Escherichia coli SE11
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
A
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKALN
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRL
T
DQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|656443.3.peg.368
Escherichia coli TA271
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
A
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKALN
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRL
T
DQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|585034.5.peg.221
Escherichia coli IAI1
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
A
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVK
E
LN
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|679206.4.peg.3181
Escherichia coli MS 119-7
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
A
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVK
E
LN
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|679204.3.peg.4970
Escherichia coli MS 145-7
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
A
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVK
E
LN
A
APPESEEKLAVLRVMRMLEDKSGRNN
Q
VVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|595495.4.peg.4437
Escherichia coli KO11
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
M
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGD
L
PLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|566546.3.peg.4470
Escherichia coli W
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
M
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGD
L
PLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|566546.4.peg.216
Escherichia coli W
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
M
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGD
L
PLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|656414.3.peg.346
Escherichia coli H736
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQR
Y
LDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
R
T
SLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRL
T
DQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|749538.3.peg.2641
Escherichia coli MS 116-1
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQR
Y
LDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
R
T
SLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRL
T
DQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|670888.3.peg.804
Escherichia coli 1827-70
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQR
Y
LDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
R
T
SLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAE
H
QAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRL
T
DQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|749548.3.peg.5086
Escherichia coli MS 196-1
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQR
Y
LDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
R
T
SLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
A
S
PESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEF
T
PENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRL
T
DQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|550676.3.peg.16
Escherichia coli B185
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEE
H
WLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
I
DP
F
SVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLL
H
EGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLV
G
TAPYFTRSLFPQALLAEPNLATESRAWL
M
RSRRRLTVFS
T
TGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDD
Y
GNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKER
D
EALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|340184.6.peg.2207
Escherichia coli B7A
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
M
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGD
L
PLQRALTVLRDNTQPGVFSEKLSAKEREEALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDN
S
VIIREDIIAQL
K
TAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTL
V
G
A
N
G
G
APRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|656419.3.peg.389
Escherichia coli M718
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQQLEK
M
QKQQREEA
I
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWY
I
VIGPAGSGKTTLLREGFPSDIIYAPEGARG
T
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQL
Q
VYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
M
RSRRRLTVFS
T
TGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDD
Y
GNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQP
S
VFSEKLSAKER
D
EALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|749531.3.peg.4578
Escherichia coli MS 69-1
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVR
M
MKRLQQLEK
M
QKQQREEA
I
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESR
T
WL
M
RSRRRLTVFS
T
TGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPP
L
GEDD
Y
GNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPV
A
T
AQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKER
D
E
T
LAEPDYQLLTRLGHEFAPENSTL
T
VQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNP
H
SAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTL
V
G
AR
G
G
APRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|701177.3.peg.223
Escherichia coli O55:H7 str. CB9615
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIALVWLTVRVMKRLQ
L
LEKQQKQQREEA
I
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
T
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKR
H
REHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMV
T
QTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
M
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDD
Y
GNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKER
D
EALAEPDYQLLTRLGHEFAPEN
I
TLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRL
T
DQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|344601.5.peg.2035
Escherichia coli B171
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEKQQKQQREEA
I
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEG
S
RG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEH
T
LGWLKEKRARQPLNGIILTLDLPDLLTADKRRRE
Y
LLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESR
D
WL
M
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQG
K
DD
Y
GNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRW
M
PYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKER
D
EALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDN
S
VIIREDIIAQL
K
TAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVD
A
GAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|585395.4.peg.221
Escherichia coli O103:H2 str. 12009
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEKQQKQQREEA
I
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEG
S
RG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEH
T
LGWLKEKRARQPLNGIILTLDLPDLLTADKRRRE
Y
LLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESR
D
WL
M
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQG
K
DD
Y
GNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRW
M
PYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKER
D
EALAEPDYQLLTRLGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDN
S
VIIREDIIAQL
K
TAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVD
A
GAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|340185.4.peg.1624
Escherichia coli E22
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEKQQKQQREEA
I
DPLSVELNAQQRYLDRWLLRLQRHLDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEG
S
RG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEH
T
LGWLKEKRARQPLNGIILTLDLPDLLTADKRRRE
Y
LLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
RASLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESR
D
WL
M
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQG
K
DD
Y
GNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRW
M
PYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKER
D
EALAEPDYQLLT
L
LGHEFAPENSTLAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRLADQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDN
S
VIIREDIIAQL
K
TAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVD
A
GAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|749540.3.peg.1995
Escherichia coli MS 146-1
MPRFKVSA
F
WLL
I
LAWIFLLVWIWWKGP
T
WTLYEEQWLKPLANRWLATAAWGIIAL
M
WLTVRVMKRLQQLEK
M
QKQQREEA
V
DPLSVELNAQQRYLDRWLLRLQR
Y
LDNRRFLWQLPWYMVIGPAGSGKTTLLREGFPSDIIYAPEGARG
A
EQRLYLTPHVGKQAVIFDIDGTLCAPADADILHRRLWEHALGWLKEKRARQPLNGIILTLDLPDLLTADKRRREHLLQ
T
LRSRLQDIRQHLHCQLPVYVVLTRLDLL
Q
GFAALFQSLNRQDRDAILGVTFTRRAHENDDWRTELNAFWQTWVDRMNLALPDLMVAQTHT
--
R
T
SLFSFSRQMQGSREPLVSLLEGLLDGENMNVMLRGVYLTSSLQRGQMDDIFTQSAARQYRLGNNPLASWPLVDTAPYFTRSLFPQALLAEPNLATESRAWL
I
RSRRRLTVFSATGGVAALLLITGWHHYYNGNYQSGITVLKQAKAFMDVPPPQGEDDFGNLQLPLLNPVRDATLAYGDWGDRSRLADMGLYQG
R
RIGPYVEQTYLQLLEQRYLPSLFNGLVKA
M
N
A
APPESEEKLAVLRVMRMLEDKSGRNNEVVKQYMAKRWSEKFHGQRDIQAQLM
S
HLDYALAHTDWHAERQAGDGDAISRWTPYDKPVVSAQ
K
ELSKLPVYQRVYQSLKTRALGVLPADLNLRDQVGPTFDQVFTSADDNKLVVPQFLTRYGLQSYFVKQRDELVELTAMDSWVLNLTRSVKYSDADRAEIQRQLTEQYISDYTATWRAGMDNLNIRNFESI
G
QLTGALEQVISGDQPLQRALTVLRDNTQPGVFSEKLSAKEREEA
----------------------
LAVQKDKESTMQAVYQQLTELHRYLLAIQNAPVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGRL
T
DQAWHVVMVEAVHYMEVDWRDSVVKPFNEQLANNYPFNPRSAQDASLDAFERFFKPDGILDTFYQQNLKLFIDNDLSLEDGDNNVIIREDIIAQLETAQKIRDIFFSKQNGLGTSFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSGNAPRSISFSGPWAQFRLFGAGQLTGVQDGNFTVRFSVDGGAMTYRVHTDTEDNPFSGGLFSQFGLSDTLY
fig|216592.1.peg.5070
Escherichia coli 042
MFRFPTSRLFSTL
R
SALRPAMPRF
R
VSA
A
WLL
A
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PL
T
NRWLATA
V
WG
L
IAL
I
WLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
SA
LLREGFPSDIIY
T
PE
SI
RG
T
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
S
P
GGD
D
L
LHRRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
T
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
DDWR
S
EL
G
AFWQTWV
QQV
NLAL
S
DLM
L
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FSA
C
G
AAL
A
A
LL
VGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQL
S
LLNPVRDATLAYGD
YR
DR
GF
LADMGLYQG
V
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
N
APPESEEKLAVLRV
L
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
A
HLDYAL
E
HTDWHA
Q
RQ
S
GD
S
DA
V
SRWTPYDKPV
IN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
AP
H
SI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|364106.7.peg.358
Escherichia coli UTI89
MF
K
FPTSRLFSTLKSALRPAMPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPYDKP
IIN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
Y
Q
VH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|364106.8.peg.356
Escherichia coli UTI89
MF
K
FPTSRLFSTLKSALRPAMPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPYDKP
IIN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
Y
Q
VH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|405955.9.peg.193
Escherichia coli APEC O1
MF
K
FPTSRLFSTLKSALRPAMPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPYDKP
IIN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
T
V
QKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
Y
Q
VH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|340197.3.peg.1542
Escherichia coli F11
MF
K
FPTSRLFSTLKSALRPAMPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVP
L
PQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPY
N
KPV
IN
AQ
H
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYF
I
KQ
H
D
G
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQY
L
SDYTATWRAGMDNLN
V
R
DY
E
T
M
P
A
LT
D
ALEQ
I
ISGDQP
FL
RALT
A
LRDNT
HALTL
S
G
KL
DD
K
AK
E
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PV
S
GKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNP
HAT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GRML
IREDI
RQ
QL
D
TAQKIR
N
IFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNES
Q
LTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFSQF
R
L
P
DTLY
fig|340197.5.peg.1633
Escherichia coli F11
MF
K
FPTSRLFSTLKSALRPAMPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVP
L
PQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPY
N
KPV
IN
AQ
H
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYF
I
KQ
H
D
G
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQY
L
SDYTATWRAGMDNLN
V
R
DY
E
T
M
P
A
LT
D
ALEQ
I
ISGDQP
FL
RALT
A
LRDNT
HALTL
S
G
KL
DD
K
AK
E
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PV
S
GKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNP
HAT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GRML
IREDI
RQ
QL
D
TAQKIR
N
IFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNES
Q
LTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFSQF
R
L
P
DTLY
fig|749550.3.peg.3575
Escherichia coli MS 200-1
MF
K
FPTSRLFSTLKSALRPAMPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VE
P
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVP
L
PQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPY
N
KPV
IN
AQ
H
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYF
I
KQ
H
D
G
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQY
L
SDYTATWRAGMDNLN
V
R
DY
E
T
M
P
A
LT
D
ALEQ
I
ISGDQP
FL
RALT
A
LRDNT
HALTL
S
G
KL
DD
K
AK
E
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PV
S
GKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNP
HAT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GRML
IREDI
RQ
QL
D
TAQKIR
N
IFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNES
Q
LTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFSQF
R
L
P
DTLY
fig|216592.3.peg.225
Escherichia coli 042
MPRF
R
VSA
A
WLL
A
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PL
T
NRWLATA
V
WG
L
IAL
I
WLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
SA
LLREGFPSDIIY
T
PE
SI
RG
T
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
S
P
GGD
D
L
LHRRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
T
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
DDWR
S
EL
G
AFWQTWV
QQV
NLAL
S
DLM
L
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FSA
C
G
AAL
A
A
LL
VGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQL
S
LLNPVRDATLAYGD
YR
DR
GF
LADMGLYQG
V
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
N
APPESEEKLAVLRV
L
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
A
HLDYAL
E
HTDWHA
Q
RQ
S
GD
S
DA
V
SRWTPYDKPV
IN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
AP
H
SI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|656437.3.peg.285
Escherichia coli TA143
MPRF
R
VSA
A
WLL
A
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PL
T
NRWLATA
V
WG
L
IAL
I
WLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
SA
LLREGFPSDIIY
T
PE
SI
RG
T
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
S
P
GGD
D
L
LHRRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
T
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
DDWR
S
EL
G
AFWQTWV
QQV
NLAL
S
DLM
L
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FSA
C
G
AAL
A
A
LL
VGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQL
S
LLNPVRDATLAYGD
YR
DR
GF
LADMGLYQG
V
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
N
APPESEEKLAVLRV
L
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
A
HLDYAL
E
HTDWHA
Q
RQ
S
GD
S
DA
V
SRWTPYDKPV
IN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
AP
H
SI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|714962.3.peg.224
Escherichia coli IHE3034
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPYDKP
IIN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
Y
Q
VH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|585035.6.peg.227
Escherichia coli S88
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPYDKP
IIN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
Y
Q
VH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|869729.3.peg.3471
Escherichia coli UM146
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPYDKP
IIN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
Y
Q
VH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|405955.13.peg.226
Escherichia coli APEC O1
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPYDKP
IIN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
T
V
QKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
Y
Q
VH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|656379.3.peg.811
Escherichia coli FVEC1302
MPRF
R
VSA
A
WLL
A
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PL
T
NRWLATA
V
WG
L
IAL
I
WLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
SA
LLREGFPSDIIY
T
PE
SI
RG
T
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
S
P
GGD
D
L
LHRRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
T
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
DDWR
S
EL
G
AFWQTWV
QQV
NLAL
S
DLM
L
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FSA
C
G
AAL
A
A
LL
VGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQL
S
LLNPVRDATLAYGD
YR
DR
GF
LADMGLYQG
V
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
N
APPESEEKLAVLRV
L
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
A
HLDYAL
E
HTDWHA
Q
RQ
S
GD
S
DA
V
SRWTPYDKPV
IN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQ
Y
SSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
AP
H
SI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|656380.3.peg.714
Escherichia coli FVEC1412
MPRF
R
VSA
A
WLL
A
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PL
T
NRWLATA
V
WG
L
IAL
I
WLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
SA
LLREGFPSDIIY
T
PE
SI
RG
T
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
S
P
GGD
D
L
LHRRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
T
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
DDWR
S
EL
G
AFWQTWV
QQV
NLAL
S
DLM
L
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FSA
C
G
AAL
A
A
LL
VGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQL
S
LLNPVRDATLAYGD
YR
DR
GF
LADMGLYQG
V
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
N
APPESEEKLAVLRV
L
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
A
HLDYAL
E
HTDWHA
Q
RQ
S
GD
S
DA
V
SRWTPYDKPV
IN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQ
Y
SSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
AP
H
SI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|749549.3.peg.3850
Escherichia coli MS 198-1
MPRF
R
VSA
A
WLL
A
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PL
T
NRWLATA
V
WG
L
IAL
I
WLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
SA
LLREGFPSDIIY
T
PE
SI
RG
T
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
S
P
GGD
D
L
LHRRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
T
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
DDWR
S
EL
G
AFWQTWV
QQV
NLAL
S
DLM
L
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FSA
C
G
AAL
A
A
LL
VGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQL
S
LLNPVRDATLAYGD
YR
DR
GF
LADMGLYQG
V
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
N
APPESEEKLAVLRV
L
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
A
HLDYAL
E
HTDWHA
Q
RQ
S
GD
S
DA
V
SRWTPYDKPV
IN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQ
Y
SSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
AP
H
SI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|585056.7.peg.405
Escherichia coli UMN026
MPRF
R
VSA
A
WLL
A
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PL
T
NRWLATA
V
WG
L
IAL
I
WLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
SA
LLREGFPSDIIY
T
PE
SI
RG
T
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
S
P
GGD
D
L
LHRRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
T
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
DDWR
S
EL
G
AFWQTWV
QQV
NLAL
S
DLM
L
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FSA
C
G
AAL
A
A
LL
VGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQL
S
LLNPVRDATLAYGD
YR
DR
GF
LADMGLYQG
V
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
N
APPESEEKLAVLRV
L
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
A
HLDYAL
E
HTDWHA
Q
RQ
S
GD
S
DA
V
SRWTPYDKPV
IN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQ
Y
SSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
AP
H
SI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|585397.7.peg.226
Escherichia coli ED1a
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WT
I
YE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
L
S
WL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPYDKP
IIN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
Y
Q
VH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|585397.9.peg.226
Escherichia coli ED1a
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WT
I
YE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
L
S
WL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPYDKP
IIN
AQ
Q
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYFVKQR
EG
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQYISDYTATWRAGMDNLN
V
R
DY
E
A
M
S
A
LT
D
ALEQ
I
ISGDQP
F
QRALT
A
LRDNT
HALTL
S
G
KL
DD
K
A
RE
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PVPGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GR
V
L
IREDI
RQ
QL
D
TAQKIRDIFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESKLTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
Y
Q
VH
V
DTEDNPF
T
GGLFS
L
F
R
L
P
DTLY
fig|753642.3.peg.1485
Escherichia coli NC101
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVPPPQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPY
N
KPV
IN
AQ
H
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYF
I
KQ
H
D
G
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQY
L
SDYTATWRAGMDNLN
V
R
DY
E
T
M
P
A
LT
D
ALEQ
I
ISGDQP
FL
RALT
A
LRDNT
HALTL
S
G
KL
DD
K
AK
E
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PV
S
GKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNPR
AT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GRML
IREDI
RQ
QL
D
TAQKIR
N
IFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNES
Q
LTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFSQF
R
L
P
DTLY
fig|362663.8.peg.228
Escherichia coli 536
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVP
L
PQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPY
N
KPV
IN
AQ
H
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYF
I
KQ
H
D
G
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQY
L
SDYTATWRAGMDNLN
V
R
DY
E
T
M
P
A
LT
D
ALEQ
I
ISGDQP
FL
RALT
A
LRDNT
HALTL
S
G
KL
DD
K
AK
E
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PV
S
GKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNP
HAT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GRML
IREDI
RQ
QL
D
TAQKIR
N
IFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNES
Q
LTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFSQF
R
L
P
DTLY
fig|362663.9.peg.228
Escherichia coli 536
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
R
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVP
L
PQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPY
N
KPV
IN
AQ
H
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQVGPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYF
I
KQ
H
D
G
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQY
L
SDYTATWRAGMDNLN
V
R
DY
E
T
M
P
A
LT
D
ALEQ
I
ISGDQP
FL
RALT
A
LRDNT
HALTL
S
G
KL
DD
K
AK
E
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PV
S
GKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNP
HAT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GRML
IREDI
RQ
QL
D
TAQKIR
N
IFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNES
Q
LTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFSQF
R
L
P
DTLY
fig|685038.3.peg.219
Escherichia coli O83:H1 str. NRG 857C
MPRFKVSA
T
WLL
T
LAWIFLLVWIWW
Q
GP
K
WTLYE
QH
WL
A
PLANRWLATA
V
WG
L
IALVWLT
W
RVMKRLQ
K
LEKQQKQQREE
E
K
DPL
T
VEL
HR
QQ
Q
YLD
H
WLLRL
R
RHLDNRR
Y
LWQLPWYMVIGPAGSGK
S
TLLREGFPSDI
V
Y
T
PE
SI
RG
V
E
YHPLI
TP
R
VG
N
QAVIFD
V
DG
V
L
T
T
P
GGD
D
L
L
-
RRL
R
EH
W
LGWL
MQT
RARQPLNG
L
ILTLDLPDLLTADK
S
RRE
T
L
V
Q
N
LR
QQ
LQ
E
IRQ
S
LHC
R
LPVYVVLTRLDLL
N
GFAALF
H
SL
DKK
DRDAILGVTFTRRAHE
S
D
G
WR
S
EL
G
AFWQTWV
QQV
NLAL
S
DL
VL
AQT
GAAP
R
SAV
FSFSRQMQG
TG
E
IVTA
LL
AA
LLDGENM
D
VMLRGV
W
LTSSLQRGQ
V
DDIFTQSAARQY
G
LGN
SS
LA
T
WPLV
E
T
T
PYFTR
R
LFP
EV
LLAEPNLA
G
E
NSV
WL
N
S
SRRRLT
A
FS
TC
G
AAL
A
A
L
MVGS
WHHYYN
Q
N
W
QSG
VN
VL
A
QAKAFMDVP
L
PQG
T
D
E
FGNLQLPLLNPVRDATLAYGD
YR
D
HGF
LADMGLYQG
A
R
V
GPYVEQTY
I
QLLEQRYLPSL
M
NGL
IRD
LN
I
APPESEEKLAVLRV
V
RM
M
EDKSGRNNE
A
VKQYMA
R
RWS
NE
FHGQRDIQAQLM
V
HLDYAL
E
HTDWHA
Q
RQ
SS
D
S
DA
V
SRWTPY
N
KPV
IN
AQ
H
ELSKLP
I
YQRVYQ
T
L
R
T
K
AL
S
VLPADLNLRDQ
I
GPTFD
N
VF
VAGN
D
E
KLV
I
PQFLTRYGLQSYF
I
KQ
H
D
G
LVELTA
L
DSWVLNLT
Q
SV
A
YS
E
ADR
E
EIQR
HI
TEQY
L
SDYTATWRAGMDNLN
V
R
DY
E
T
M
P
A
LT
D
ALEQ
I
ISGDQP
FL
RALT
A
LRDNT
HALTL
S
G
KL
DD
K
AK
E
A
A
IN
E
M
DY
R
LL
S
RLGHEFAPENS
A
L
EE
QKDK
A
ST
L
QAVYQQLTELHRYLLAIQN
S
PV
S
GKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVG
K
LADQAWHVVMVEAV
R
YMEVDWRD
N
VVKPFNEQLA
D
NYPFNP
HAT
QDASLD
S
FERFFKPDGILD
N
FY
KN
NL
R
LF
LE
NDL
TFG
D
-
D
GRML
IREDI
RQ
QL
D
TAQKIR
N
IFFS
Q
QNGLG
AQ
FAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNES
Q
LTLIGTSG
R
APRSI
A
FSGPWAQFRLFGAGQLT
N
V
TSDT
F
N
VRF
N
VDGGAM
V
YRVH
V
DTEDNPF
T
GGLFSQF
R
L
P
DTLY
Consen1
Primary consensus
MFrfPTsRLFStLKSALRPAMPRFkVSA
WLL
LAWIFLLVWIWWkGP
WTLYEeqWLkPLaNRWLATAaWGiIALvWLTvRVMKRLQqLEKqQKQQREEa
DPLsVELnaQQrYLDrWLLRLqRHLDNRRfLWQLPWYMVIGPAGSGKttLLREGFPSDIiYaPEgaRG
EqrlylTPhVGkQAVIFDiDGtLcaPadaDiLhRRLwEHaLGWLkekRARQPLNGiILTLDLPDLLTADKrRREhLlQ
LRsrLQdIRQhLHCqLPVYVVLTRLDLL
GFAALFqSLnrqDRDAILGVTFTRRAHEnDdWRtELnAFWQTWVdrmNLALpDLmvAQTht
--
RaslFSFSRQMQGsrEplvsLLegLLDGENMnVMLRGVyLTSSLQRGQmDDIFTQSAARQYrLGNnpLAsWPLVdTaPYFTRsLFPqaLLAEPNLAtEsraWL
rSRRRLTvFSatGgvaAlLlitgWHHYYNgNyQSGitVLkQAKAFMDVPpPQGeDdfGNLQLpLLNPVRDATLAYGDwgDrsrLADMGLYQG
RiGPYVEQTYlQLLEQRYLPSLfNGLvkalN
APPESEEKLAVLRVmRMlEDKSGRNNevVKQYMAkRWSekFHGQRDIQAQLM
HLDYALaHTDWHAeRQagDgDAiSRWTPYdKPvvsAQ
ELSKLPvYQRVYQsLkTrALgVLPADLNLRDQVGPTFDqVFtsadDnKLVvPQFLTRYGLQSYFvKQrdeLVELTAmDSWVLNLTrsVkYSdADRaEIQRqlTEQYiSDYTATWRAGMDNLNiRnfEsi
qLTgALEQvISGDQPlqRALTvLRDNTqpgvfSeKLsaKereeAlaEpDYqLLtRLGHEFAPENStLavQKDKeSTmQAVYQQLTELHRYLLAIQNaPVpGKSALKAVQLRLDQNSSDPIFATRQMAKTLPAPLNRWVGrLaDQAWHVVMVEAVhYMEVDWRDsVVKPFNEQLAnNYPFNPrsaQDASLDaFERFFKPDGILDtFYqqNLkLFidNDLsleDgDnnviIREDIiaQLeTAQKIRdIFFSkQNGLGtsFAVETVSLSGNKRRSVLNLDGQLVDYSQGRNYTAHLVWPNNMREGNESkLTLIGTSGnAPrSIsFSGPWAQFRLFGAGQLTgVqdgnFtVRFsVDGGAMtYrVHtDTEDNPFsGGLFSqFgLsDTLY
Consen2
Secondary consensus
kl
p
g
r
q
qh
a
t
v
l
w
k
m
e
t
hr
q
h
r
y
sa
v
t
si
yhpli
r
n
v
v
t
ggd
l
r
r
w
mqt
l
s
t
v
qq
e
s
r
h
dkk
s
g
s
g
qqv
s
vl
gaap
sav
tg
ivta
aa
d
w
v
g
ss
t
e
t
r
ev
g
nsv
s
a
tc
aal
a
mvgs
q
w
vn
a
l
t
ey
s
yr
hgf
v
i
m
irdm
m
qa
r
ne
e
q
ss
s
v
n
iin
i
t
r
k
s
n
vagn
e
i
i
heg
l
qn
a
e
e
hi
l
v
dy
m
a
d
i
fl
a
haltl
g
dd
akda
in
m
r
s
a
ee
a
l
s
s
k
t
r
n
d
hat
s
n
kn
r
le
tfg
-
grml
rq
d
n
q
aq
q
r
h
a
n
tsdt
n
n
v
q
v
t
l
r
p
Consensus 1
(when a gap)
Conservative difference
Consensus 2
(when a gap)
Nonconservative diff.
Other character