Home  |  Contact

ProtParam

User-provided sequence:

        10         20         30         40         50         60 
MVGERYRSHH AVDPSDAQEK EVLDHYELNR EKYNDERDNV QKKTFTKWVN KHLSKTDHKI

70 80 90 100 110 120
DDLFVDLRDG YALIALLEAL TGERIQKENG YTRFHRIQNV QYCLDFLKKK NIKLVNIRPE

130 140 150 160 170 180
DIVEGNGKLT LGLIWTIILN FQVSVIRQRL LLESSQHEQM SAKHTTTNSQ VSLHGSDATS

190 200 210 220 230 240
ARDALLQWAR RVTAGYPRVN VNNFSSSWRD GLAFNAILHR YRSSAIDWNK ISSDSVSNTE

250 260 270 280 290 300
RLNNAFAAAD REFGVERLLD AEDVDTNNPD EKSIITYVSS LYNALPHEPE MSRLQKVQEE

310 320 330 340 350 360
YIEEAYEWRE WVVRAIQLVD DRHLQGTASE LIYELQRFRE DDLPPREEQK RRLTLVYEHL

370 380 390 400 410 420
EKVMRSTELF AIPHELSAPE LQRVWNELQN SIDRRFDVLE RHRIQEGNSN DLLSRLARGI

430 440 450 460 470 480
GITNEKLDLI LKRIEDVEAR VDTSPPAAVE RTVSEIVDDL NALESPIARF FEDVEELKSM

490 500 510 520 530 540
QHPEANDFYK QVYGLHQRRT TYLDRLTNQI LVRLGVRTDS LHKENQQRLE NMRKTSFSRV

550 560 570 580 590 600
EECIEWVRVR MEKLTTMEFL EDLETLEHVF EQHKFDNRDI QDFRQNVDEC IARQAEVSAE

610 620 630 640 650 660
DTYEYCELLR VLESEYQQLR DLSAGRMLDL DSLIAFVRAA QLELIWVSER ESIEVTRNWS

670 680 690 700 710 720
DIKQLDLPML TNYYKQLLHE MELREKQYND VHNQGAALLN QGHPAIRVIE VYLRQMQSQW

730 740 750 760 770 780
DWLLALSKCL EEHLRDALNL KSFMEEASDA EAWIQEQSVR LENNYNRTDF SLEEGERFLR

790 800 810 820 830 840
ELDEIKEILN KYHQVLMALT ERCASISPLW QRGERIPHPI KVTALCDYSD ENVTIKAGDD

850 860 870 880 890 900
VYLLDNSDLI KWTIRDISGA EGQVPSVVFR IPPTDARLTA LLNRLLQQFE KLKKLWDKKH

910 920 930 940 950 960
RMVRFNMVLN TMRTIQGWDL DTFNSIDPDQ RDAIMKALND DANKLLSELD PNDPLALRLR

970 980 990 1000 1010 1020
EELRRTNEHF WNLLNASQKP PEPDWASQYD QKMAELLKKL EEAWRELNDA VGKPISRSPE

1030 1040 1050 1060 1070 1080
DLERVIHAHK RFEDALQALD SDVANVKELF RQLPNPTPTQ RVNHDRLNGL WDDLWDLSRM

1090 1100 1110 1120 1130 1140
YVERIKVLES VLNGMVEVAD IVRQHEITLN SFDDLPAALD KLRGHHSQLL EINMVLKVSK

1150 1160 1170 1180 1190 1200
LGAQNEHFSQ QQTVIDQLNR NVALLRQHVS RTRINEGHHP DVDAIEDEVQ KLNVRWENVN

1210 1220 1230 1240 1250 1260
SQIASRLLAV ESALQIQMVY RSEYETEMSW LDTVEETINR LRKPEELRPE QYQQQLDMLI

1270 1280 1290 1300 1310 1320
AEYTNLQEHT QAIEHVNKEG GRFIHEAKIF DAKLGQYSDG IVGIHGPGIK SEFRRTKPQP

1330 1340 1350 1360 1370 1380
KNGSQIVTEE LELLNRRFAQ LSSLILERRN TMQVLIQNWK RQKQENVTQV VSFREAEVSG

1390 1400 1410 1420 1430 1440
MMTDLTRFRQ EIFTTHLTFN SNPESIDAAT KNVQNVKQSL DSWRDRIKER LDEIDRLCTE

1450 1460 1470 1480 1490 1500
EGDSLTPEQY SALREMRRQL ADEYDTVLRT VEGIHTRLNI LSALLIEFSS VTSSMQSWMT

1510 1520 1530 1540 1550 1560
DRTRLAGDIR HKSGDPMRID EARFEAKSLM DEVIREESRL KTIGASVLKI EQEISAMRDD

1570 1580 1590 1600 1610 1620
VRASGSTDDV GISVDEVYET RRRVEDDYMQ LLRQCQDLIS FQNRLHAMND EHSEQARRAD

1630 1640 1650 1660 1670 1680
EWLQMLQNDV EDVDQDPRFQ RDEDRIQRIE ELNRMAAGGS SQLDDAEQAS RRLLTALEGT

1690 1700 1710 1720 1730 1740
NVANDVRARH EELANLRRGK HQKVIDRLSQ NMMEAASRKA EAEGVKQAVE NLRQWSEQTA

1750 1760 1770 1780 1790 1800
QRTRQPVQLP LTELDLHEAR KDEQVLHGEI ENRLALIEEL EKKAADVGDH ASLAELQECK

1810 1820 1830 1840 1850 1860
MKLKRSNSDL KGLRDNIFDA INGLQTVNSE GETLSRAVDS AGAKIRSARL PEAQSEVEAL

1870 1880 1890 1900 1910 1920
QDQADNLERI TNNLCNIPNV TRTEPVIQKS KDLRKRVDSC AQELDARMGK LAELESLDAE

1930 1940 1950 1960 1970 1980
FDGAKNKLSS FIGAFDDELK GLEKVSIDKE KLAEQRRQTQ DLVDKHSEGN AILDDVEAIA

1990 2000 2010 2020 2030 2040
QKVTAEDPSK TGSAQKSVGE LGARLQRQAS ELKARGDKIN KLDSKATSFA ESEAAVLGYI

2050 2060 2070 2080 2090 2100
EKQKDQLSTG FPVPATKEGV KSQLLDLERM NKTGKEEQRR VDDARHSARE LAREASVEKE

2110 2120 2130 2140 2150 2160
VQDMNQREKK LLDEWEDLAD QFDAVRSRAN KAEQVLNECA QMEKYIGAKK NMLEGIGAPS

2170 2180 2190 2200 2210 2220
TEPGVAKANR AQIQSMKAET EGEKSALEHV NSLANELIAD GGANVEELMK KMDRLNRKWH

2230 2240 2250 2260 2270 2280
SLESGLDENA GRVEEAAKLG QELKDIQKEL RKELGELESN VEKASAMSSN DIGDQLATLD

2290 2300 2310 2320 2330 2340
SLKSRFGGVD KALEKLKGIL EATEELEVDA TNRAEIQEQL ETTQKKADEL ERKIENVKKA

2350 2360 2370 2380 2390 2400
ALNAQNEGLE LEKKLDELIG TVNSAENELE LAAPIAAESL KLADELKRAE ELFQKLIENE

2410 2420 2430 2440 2450 2460
GDVSLIRAKV AEELKKKPDA ELKKKLELLY QKWPKALGAA RDRKDLVSKA GDLVKQFGDQ

2470 2480 2490 2500 2510 2520
VQALEQRLQG DQAELDELLA SDKAHDPEVC DALKLVELTM ARRLADVDAL NAVMNRIESS

2530 2540 2550 2560 2570 2580
APGPDANRLR RRADTLSDDA KGMAKKARTA ADLAQRKQGL AKKFERLCDE VSQFTENQKA

2590 2600 2610 2620 2630 2640
EIQDAIEKDL LNAERVQSKL NKIDDFWSSN SRELKNVGDE IKIDATPEDA QAVDTKLAEL

2650 2660 2670 2680 2690 2700
QAGIDGLLAT LQEQNVHLEE KREQANRVQS ESQKAAGKIN SLVAEIADLD PIGRSRDELQ

2710 2720 2730 2740 2750 2760
KQKKEVVELA GDLGSAQTKM LELGAEWEAA LGAGIVAQPV FEMNRAATDE LNKLAARAGK

2770 2780 2790 2800 2810 2820
RLAQREKKIT ETEDEIDKLH ADADQIVGAL EAIAKDEALQ GAPSQLLDPK QVSEKVRQLK

2830 2840 2850 2860 2870 2880
ESLKPVGEKM DAFNTDCKLL IKTAGPESDT KELDSLLKKV GDAYSDVVGK VSDKEMSVDA

2890 2900 2910 2920 2930 2940
AVQQQGKVED AYRALLNWLE ETEEMMENRK KPSADAKVAK AQLHDYEVLM KHVEDKKPSV

2950 2960 2970 2980 2990 3000
DGFKAMIEKI VAEASSDEEK KALGNKNAQI EDRYKDLLNS AVDRQRKLLD AVDLAERLQE

3010 3020 3030 3040 3050 3060
VTIPLDSWLQ SADKRLQALA KVPITVEKAE EMIGEQEALQ DELEHKSDDL KDVLEIAPML

3070 3080 3090 3100 3110 3120
ASLVSVEDAN SISGQVNQLE ARARALDAGI TNMRPLLESF LQQIQDFTLD AEDMTQFVGE

3130 3140 3150 3160 3170 3180
TEVKLGELDE LPIEPDDLVE QTNILAEIAV SIADRDEMMA NIFEVGKQLA IQGEPEEALI

3190 3200 3210 3220 3230 3240
AQKKLDDLKF RYADLMTSAD EKIALLAKAI PLSEGFHEGF DTVMQVLEDM DRDLQTIDEE

3250 3260 3270 3280 3290 3300
DPETQAELIF LLEEDISQKM RPSVDELTAL SNQLQVLCSA DKADELQTNT IAMNKLVNSV

3310 3320 3330 3340 3350 3360
ADRVARRAER IEMASKQSRA VLDDLQYLIE WFSAARERIL EGAPPSLDLE VLKSQLKHQR

3370 3380 3390 3400 3410 3420
ITNEEASANK VQFRNVAGEA KKVARQLGME GNEANEKISD TVDEGKELVE EVMALCADRT

3430 3440 3450 3460 3470 3480
ETLERALALM EQLTSQFDEL NKWLDQMDAE LQASPSVTTA TPAAELREMH DHNEELARMV

3490 3500 3510 3520 3530 3540
AAYRPIIEGF KSDVGSLHEV LAEDQAPLLE SVAGELVQGY EEVREAVRAR GHAIDNMMGA

3550 3560 3570 3580 3590 3600
TIGFGERLET LVANLQGAAD RLRENEGISA DPSVLESRLA ENRSIVESLR DKQNAYDALK

3610 3620 3630 3640 3650 3660
QTASELLASA PEGDAAAGDV ENKLNRLEKL WKEIEREAVD RGVLLEDVLD KAKHFWSELD

3670 3680 3690 3700 3710 3720
SCQKAVDDLR NRLELVEPAT GHPEQLADQQ EIMAQVASEM ERARPRIEAL SIAGKQLADY

3730 3740 3750 3760 3770 3780
VPDDEKAVIE NQVANVRGGF STITGLFAEK KRDLIAAMEE AMTFHGDLQE LLKWLDMAEQ

3790 3800 3810 3820 3830 3840
KLLKMSPVEH AKHMTEIEQL LKELHTFKDE VHERGVAKEQ VVATALQLAA DAPPHLAATV

3850 3860 3870 3880 3890 3900
RQPVADLNTR WSRLNAALAE REHKLENLML QMGKLASTIA QLTAWMDKTR ATLKDIAPPK

3910 3920 3930 3940 3950 3960
NAVNLRDIEI AQCKLVVLSN DIHAHQDSVN AVNRAAQKYI QTSGALDAET SDSLKSMNLK

3970 3980 3990 4000 4010 4020
WEDIQKVLES LAFDMEVAKK EAENVGGEVE KWQRWLEETE SALLSTKPTG GLPETAEFQL

4030 4040 4050 4060 4070 4080
DEFKALKLDV EHNASPLEAH LHATEQHLKE EPQDADTWLS KTHGAMKTKW NKVKELLVDR

4090 4100 4110 4120 4130 4140
EKKLQVAYEQ AVALESALND MEDWIIAAER KLTDQPSISR LPDVIEKQLA EHESWMEEVA

4150 4160 4170 4180 4190 4200
GRKMAMTKHQ ASGVHMQYYC EKKDAIPIKN RLVSLKHRVE KISGRTAERA KQLAVTRDEV

4210 4220 4230 4240 4250 4260
ATWQDGLHDL EHFISDVLVK IAPEPNTTSS LEKLKAKLEE VKEAQRDVTA KQTLFDVTRK

4270 4280 4290 4300 4310 4320
RGIGLAERAT RSEYKQISMT NEKMSKKWAE MLKKLRDRLR EAEQAVLEGG AFEESMNDLE

4330 4340 4350 4360 4370 4380
SWVDDELERY QKAEHEPVFA DIDGVRALVD EESRRSAERK TKENGVKTVV KKADALMASG

4390 4400 4410 4420 4430 4440
VDEKDSIAQA KERLVEKWNQ VEEAARHRGN SIKEAEQAAE EFDAKTHALL DWLAVEEQKL

4450 4460 4470 4480 4490 4500
KASGLDEVEG VKQEMDEAKG RYQECLKKGE EILSKCQPAA EPILRNWMRV VEARWKEVSE

4510 4520 4530 4540 4550 4560
KVDEREFTLL EQEQKAKEQN EQIEKLAKFA AQKREELNRM IEQPPAQDLD TMEQNICDFA

4570 4580 4590 4600 4610 4620
NLDSELREQQ PEVDAACKSA KKGARNPAAE MLSTEWKKLW LDAMGLQSSL DNQKALLEEM

4630 4640 4650 4660 4670 4680
KRLEGWKWED WKERYVEWND HAKARVNDLF RRIDRLHTGN VPRQVFIDGI IGSKFPTSRL

4690 4700 4710 4720 4730 4740
EMAKVADRFD KGDGMINAKE FINALRFDAS NRNAKPQTDT EKITHEIELQ KKTCSCCTPY

4750 4760 4770 4780 4790 4800
QIEKISENHY RFGDTHIKRM VRILRSTVMV RVGGGWESLD EFLHKHDPCR AKGRLNINMF

4810 4820 4830 4840 4850 4860
PEARPIHALD SMRSFTKNRH GKQLPTTGTP GPIMKIREKT DRSVPMSGGL GGTAGYTVTT

4870 4880 4890 4900 4910 4920
DSHRHTDARP SRIPRAPSDM SAGRLSRVGS VSNSKNSIVD SSTPSRPESR ASSDAGDRQT

4930 4940
RIPSLRARKG QRYIPQGPSS SSSK


References and documentation are available.

Number of amino acids: 4944 Molecular weight: 561720.73 Theoretical pI: 5.11
Amino acid composition: 
Ala (A) 473 9.6% Arg (R) 344 7.0% Asn (N) 211 4.3% Asp (D) 368 7.4% Cys (C) 29 0.6% Gln (Q) 281 5.7% Glu (E) 550 11.1% Gly (G) 200 4.0% His (H) 104 2.1% Ile (I) 223 4.5% Leu (L) 538 10.9% Lys (K) 353 7.1% Met (M) 120 2.4% Phe (F) 100 2.0% Pro (P) 132 2.7% Ser (S) 295 6.0% Thr (T) 201 4.1% Trp (W) 65 1.3% Tyr (Y) 64 1.3% Val (V) 293 5.9% Pyl (O) 0 0.0% Sec (U) 0 0.0% (B) 0 0.0% (Z) 0 0.0% (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 918 Total number of positively charged residues (Arg + Lys): 697 Atomic composition: Carbon C 24354 Hydrogen H 39428 Nitrogen N 7094 Oxygen O 7833 Sulfur S 149 Formula: C24354H39428N7094O7833S149 Total number of atoms: 78858 Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 454610 Abs 0.1% (=1 g/l) 0.809, assuming all pairs of Cys residues form cystines Ext. coefficient 452860 Abs 0.1% (=1 g/l) 0.806, assuming all Cys residues are reduced Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo). Instability index: The instability index (II) is computed to be 41.52 This classifies the protein as unstable. Aliphatic index: 86.78 Grand average of hydropathicity (GRAVY): -0.666