Home  |  Contact

ProtParam

User-provided sequence:

        10         20         30         40         50         60 
MVGERYRSHH AVDPSDAQEK EVLDHYELNR EKYNDERDNV QKKTFTKWVN KHLSKTDHKI

70 80 90 100 110 120
DDLFVDLRDG YALIALLEAL TGERIQKENG YTRFHRIQNV QYCLDFLKKK NIKLVNIRPE

130 140 150 160 170 180
DIVEGNGKLT LGLIWTIILN FQVSVIRQRL LLESSQHEQM SAKHTTTNSQ VSLHGSDATS

190 200 210 220 230 240
ARDALLQWAR RVTAGYPRVN VNNFSSSWRD GLAFNAILHR YRSSAIDWNK ISSDSVSNTE

250 260 270 280 290 300
RLNNAFAAAD REFGVERLLD AEDVDTNNPD EKSIITYVSS LYNALPHEPE MSRLQKVQEE

310 320 330 340 350 360
YIEEAYEWRE WVVRAIQLVD DRHLQGTASE LIYELQRFRE DDLPPREEQK RRLTLVYEHL

370 380 390 400 410 420
EKVMRSTELF AIPHELSAPE LQRVWNELQN SIDRRFDVLE RHRIQEGNSN DLLSRLARGI

430 440 450 460 470 480
GITNEKLDLI LKRIEDVEAR VDTSPPAAVE RTVSEIVDDL NALESPIARF FEDVEELKSM

490 500 510 520 530 540
QHPEANDFYK QVYGLHQRRT TYLDRLTNQI LVRLGVRTDS LHKENQQRLE NMRKTSFSRV

550 560 570 580 590 600
EECIEWVRVR MEKLTTMEFL EDLETLEHVF EQHKFDNRDI QDFRQNVDEC IARQAEVSAE

610 620 630 640 650 660
DTYEYCELLR VLESEYQQLR DLSAGRMLDL DSLIAFVRAA QLELIWVSER ESIEVTRNWS

670 680 690 700 710 720
DIKQLDLPML TNYYKQLLHE MELREKQYND VHNQGAALLN QGHPAIRVIE VYLRQMQSQW

730 740 750 760 770 780
DWLLALSKCL EEHLRDALNL KSFMEEASDA EAWIQEQSVR LENNYNRTDF SLEEGERFLR

790 800 810 820 830 840
ELDEIKEILN KYHQVLMALT ERCASISPLW QRGERIPHPI KVTALCDYSD ENVTIKAGDD

850 860 870 880 890 900
VYLLDNSDLI KWTIRDISGA EGQVPSVVFR IPPTDARLTA LLNRLLQQFE KLKKLWDKKH

910 920 930 940 950 960
RMVRFNMVLN TMRTIQGWDL DTFNSIDPDQ RDAIMKALND DANKLLSELD PNDPLALRLR

970 980 990 1000 1010 1020
EELRRTNEHF WNLLNASQKP PEPDWASQYD QKMAELLKKL EEAWRELNDA VGKPISRSPE

1030 1040 1050 1060 1070 1080
DLERVIHAHK RFEDALQALD SDVANVKELF RQLPNPTPTQ RVNHDRLNGL WDDLWDLSRM

1090 1100 1110 1120 1130 1140
YVERIKVLES VLNGMVEVAD IVRQHEITLN SFDDLPAALD KLRGHHSQLL EINMVLKQQQ

1150 1160 1170 1180 1190 1200
TVIDQLNRNV ALLRQHVSRT RINEGHHPDV DAIEDEVQKL NVRWENVNSQ IASRLLAVES

1210 1220 1230 1240 1250 1260
ALQIQMVYRS EYETEMSWLD TVEETINRLR KPEELRPEQY QQQLDMLIAE YTNLQEHTQA

1270 1280 1290 1300 1310 1320
IEHVNKEGGR FIHEAKIFDA KLGQYSDGIV GIHGPGIKSE FRRTKPQPKN GSQIVTEELE

1330 1340 1350 1360 1370 1380
LLNRRFAQLS SLILERRNTM QVLIQNWKRQ KQEEEDRRRA EEEEKRRAFE AARLKALEDA

1390 1400 1410 1420 1430 1440
ERLRREREDA EARRRAKDDA DRARRMAEDA ERARREAEEA ERRRREEEER RRREAEDRKR

1450 1460 1470 1480 1490 1500
RQDEEDRRRR EEEERRRREE EEKRRKKPDF DLKIQKPTMN IEPVVTSAGD EWEIVDPIGD

1510 1520 1530 1540 1550 1560
RAKISEVEDE MQTFAEETIT NTQFYEMEGN LNKKTGEVLT FIEAIRQGNL TAAGQFFDIP

1570 1580 1590 1600 1610 1620
SASVMSLEEA AKYGLVEQDL PTVLNTTWGI HHPETRQPIT LSEAIRIGLY DSNIRQFRDI

1630 1640 1650 1660 1670 1680
HTGEILSQSD LMSKGIANWE TVLKLIKEKI MKLPPTSLAN ALEKRMVDPQ TGVFKGRTTD

1690 1700 1710 1720 1730 1740
MELQLWAAIY HGYLSIENPE HVTTIGTSLT DAIENGFINA NNAEFNDRNS NDSFKLRDAV

1750 1760 1770 1780 1790 1800
SKRTGLINND VVEIVQMTDD EKKRVPLGAA LVRNVIGVKD GKYTNLASRQ SMSLKEAHQD

1810 1820 1830 1840 1850 1860
GLIGKALTLE EAAQNKLVDS SGYFVDRGIL GQRYTLLEAI VAGLIDAEVR HIVDPNENDV

1870 1880 1890 1900 1910 1920
ISISEAMERG LLLPNGKIVL EKDEKEFTIP EAVHEGLLTK RVRHSIFNIR GIRNTETSQN

1930 1940 1950 1960 1970 1980
LSFNEAVEAG VIIPNAERVV DLQTQESFLI TDDRAKNLIE VALHELLTTP VGIKNDRGAY

1990 2000 2010 2020 2030 2040
ELNLVRAVSS GIIDPVKGVF FNKNTKHELS TKEAYEQGFI TLRGALKVFG LLNVPPALVT

2050 2060 2070 2080 2090 2100
PAKKLDRTRR IGRPGQEGGA DGNHVTVTLQ DAMRQGLVGN QRYVHGKVDL SLEEALNRGL

2110 2120 2130 2140 2150 2160
LDPNSQWIMP DQSKGDGPTI EEKTTETMTE TGQQLAPKYF PDKNIEESVT TVKRVRTTET

2170 2180 2190 2200 2210 2220
TALGGPGGVS VYRSIAGGKG ALEVPSRGYH IYEAERKGLI DLTNGKISAP NVDRVLSFAE

2230 2240 2250 2260 2270 2280
GIELGIIDAT TIQVSTGGRS VSVKEALEKK IMESDGSVNG RNIEQAVEAK TIVIDAEPLV

2290 2300 2310 2320 2330 2340
PYNNQSKNII QIPSGNGPII SFRQVGQPVV EESTQSWEFD TQQGVFVDNL TGEKLSLERA

2350 2360 2370 2380 2390 2400
LATGKVAPED ISVRDGLTGR EMPLEEAEKW GIIDSKNRYY FDKSQNKRMS YTEAAQQHLM

2410 2420 2430 2440 2450 2460
YLTGGVPENA SDAVHTTVKV QTRTAVAKKE ALSSGLPLSD YSLGKALSLG WYDQSAGTFT

2470 2480 2490 2500 2510 2520
HPETQKQLTL KEAIMKGLFD PYDTTIVDKK AGKEISLLEA IREEIVDDNA GTVKDTQTGR

2530 2540 2550 2560 2570 2580
VHNFLEAGNL GLVKGKNFGD TLDSSLFSGR LDLGSGSYTR PSGGSSMPIH EAINRNYVDR

2590 2600 2610 2620 2630 2640
SSVSVRDPSS GQQYSYQEAV DRRIVDADRG LIHGGGDSTS FSQGLTSGHL VSSGSSRPTS

2650 2660 2670 2680 2690 2700
GNQSRLVEQR LQLTPFAPNG GAGRSRDGRH ELVDLGGGQQ VQVKVVRGEG GVEKGEYTDP

2710 2720 2730 2740 2750 2760
KSGMKFTIQM HGDPVVTETK TSVKSTSQVH SVELEPHAEF VGIDRVRDKR NNHVYSLEEA

2770 2780 2790 2800 2810 2820
RRLGLAKVDK KGKQMTRSYQ AFRSNIDNAV NNGVVDSHNQ KISLENAIHA RIIDIRNLSY

2830 2840 2850 2860 2870 2880
HHPTHGSVDL KTAASQGLVD VTLSEVLPKG IIHPGTGERI DIKRGIELRI IDAATGKVRD

2890 2900 2910 2920 2930 2940
PRDNKTITWI DILKPVYQAI ASEGVFDPTK GHHVPVTTAL NDGLINASTG NYRNSITGED

2950 2960 2970 2980 2990 3000
VPLSDTVDRG LIDRSTYETI TKPFFTDYRS NRKLNLVEAV RERLIDPKNR TIQLSRQSIV

3010 3020 3030 3040 3050 3060
PIAKAVQDGR IPMEIGEKLR KVDKLNFAEA LGKGLIDSKQ NVFTDPDTGR QMSIAQAIQE

3070 3080 3090 3100 3110 3120
GFIDTGSVQG IEGNDESNLF NVLQSSDFDE TSGRIYDKKS SLHLTFNDAV RRGVIDGDSL

3130 3140 3150 3160 3170 3180
LHLQATGDLL TLRDALHQNK IDSNGKFVEG ANRLKISDAV KSGRLTVIAS PSEAVQAVTE

3190 3200 3210 3220 3230 3240
GVKRRDAEGY KFRISEYDDS QAQRQSVPKF RETVTVSKLT PQYNEPGLSV RMRQSTTSIG

3250 3260 3270 3280 3290 3300
DRASKFLEDP SQLAEIQQDF LSSLEAAQFD TDEKVIEHPH TRQRVSVREA AETGLLDVQT

3310 3320 3330 3340 3350 3360
GEIVNPDNGR RFSIPRAVHM KLVGGDAAKR IMEQLNMPVE EVAYATQTIT SSTHRAPSPV

3370 3380 3390 3400 3410 3420
FATASVSQPS GGATTTREFT RTINWHGQPS ELRNSQTDPL AAYTTVSSNV TESTEDAPAW


ARKQ


References and documentation are available.

Number of amino acids: 3424 Molecular weight: 388253.00 Theoretical pI: 5.56
Amino acid composition: 
Ala (A) 229 6.7% Arg (R) 271 7.9% Asn (N) 171 5.0% Asp (D) 231 6.7% Cys (C) 7 0.2% Gln (Q) 173 5.1% Glu (E) 313 9.1% Gly (G) 200 5.8% His (H) 83 2.4% Ile (I) 202 5.9% Leu (L) 322 9.4% Lys (K) 183 5.3% Met (M) 56 1.6% Phe (F) 91 2.7% Pro (P) 119 3.5% Ser (S) 226 6.6% Thr (T) 201 5.9% Trp (W) 37 1.1% Tyr (Y) 76 2.2% Val (V) 233 6.8% Pyl (O) 0 0.0% Sec (U) 0 0.0% (B) 0 0.0% (Z) 0 0.0% (X) 0 0.0%
Total number of negatively charged residues (Asp + Glu): 544 Total number of positively charged residues (Arg + Lys): 454 Atomic composition: Carbon C 16944 Hydrogen H 27175 Nitrogen N 4967 Oxygen O 5360 Sulfur S 63 Formula: C16944H27175N4967O5360S63 Total number of atoms: 54509 Extinction coefficients: Extinction coefficients are in units of M-1 cm-1, at 280 nm measured in water. Ext. coefficient 317115 Abs 0.1% (=1 g/l) 0.817, assuming all pairs of Cys residues form cystines Ext. coefficient 316740 Abs 0.1% (=1 g/l) 0.816, assuming all Cys residues are reduced Estimated half-life: The N-terminal of the sequence considered is M (Met). The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo). Instability index: The instability index (II) is computed to be 43.20 This classifies the protein as unstable. Aliphatic index: 86.11 Grand average of hydropathicity (GRAVY): -0.622