|
|
|
|
|
|
[1]
|
NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Copeland A.,
Lucas S.,
Lapidus A.,
Glavina del Rio T.,
Dalin E.,
Tice H.,
Bruce D.,
Goodwin L.,
Pitluck S.,
Kiss H.,
Brettin T.,
Detter J.C.,
Han C.,
Kuske C.R.,
Schmutz J.,
Larimer F.,
Land M.,
Hauser L.,
Kyrpides N.,
Mikhailova N.,
Ingram L.,
Richardson P.;
"Complete sequence of Escherichia coli C str. ATCC 8739.";
Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
|
|
|
|
|
|
|
|
|
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms.
Distributed under the Creative Commons Attribution-NoDerivs License.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Length: 457 AA [This is the length of the unprocessed precursor] |
Molecular weight: 49951 Da [This is the MW of the unprocessed precursor] |
CRC64: DDDE0DC90C5CF2F4 [This is a checksum on the sequence] |
|
10 20 30 40 50 60
MDHLPIFCQL RDRDCLIVGG GDVAERKARL LLDAGARLTV NALAFIPQFT AWADAGMLTL
70 80 90 100 110 120
VEGPFDESLL DTCWLAIAAT DDDALNQRVS EAAEARRIFC NVVDAPKAAS FIMPSIIDRS
130 140 150 160 170 180
PLMVAVSSGG TSPVLARLLR EKLESLLPLH LGQVAKYAGQ LRGRVKQQFA TMGERRRFWE
190 200 210 220 230 240
KLFVNDRLAQ SLANNDQKAI TETTEQLINE PLDHRGEVVL VGAGPGDAGL LTLKGLQQIQ
250 260 270 280 290 300
QADVVVYDRL VSDDIMNLVR RDADRVFVGK RAGYHCVPQE EINQILLREA QKGKRVVRLK
310 320 330 340 350 360
GGDPFIFGRG GEELETLCNA GIPFSVVPGI TAASGCSAYS GIPLTHRDYA QSVRLITGHL
370 380 390 400 410 420
KTGGELDWEN LAAEKQTLVF YMGLNQAATI QQKLIEHGMP GEMPVAIVEN GTAVTQRVID
430 440 450
GTLTQLGELA QQMNSPSLII IGRVVGLRDK LNWFSNH
|
B1IP96 in FASTA format |
|