ProtParam
User-provided sequence:
10 20 30 40 50 60
MDPSGVKVLE TAEDIQERRQ QVLDRYHRFK ELSTLRRQKL EDSYRFQFFQ RDAEELEKWI
70 80 90 100 110 120
QEKLQVASDE NYKDPTNLQG KLQKHQAFEA EVQANSGAIV KLDETGNLMI SEGHFASETI
130 140 150 160 170 180
RTRLMELHRQ WELLLEKMRE KGIKLLQAQK LVQYLRECED VMDWINDKEA IVTSEELGQD
190 200 210 220 230 240
LEHVEVLQKK FEEFQTDLAA HEERVNEVNQ FAAKLIQEQH PEEELIKTKQ EEVNAAWQRL
250 260 270 280 290 300
KGLALQRQGK LFGAAEVQRF NRDVDETIGW IKEKEQLMAS DDFGRDLASV QALLRKHEGL
310 320 330 340 350 360
ERDLAALEDK VKALCAEADR LQQSHPLSAN QIQVKREELI TNWEQIRTLA AERHARLDDS
370 380 390 400 410 420
YRLQRFLADF RDLTSWVTEM KALINADELA NDVAGAEALL DRHQEHKGEI DAHEDSFKSA
430 440 450 460 470 480
DESGQALLAA GHYASDEVRE KLSILSEERA ALLELWELRR QQYEQCMDLQ LFYRDTEQVD
490 500 510 520 530 540
NWMSKQEAFL LNEDLGDSLD SVEALLKKHE DFEKSLSAQE EKITALDEFA TKLIQNNHYA
550 560 570 580 590 600
MEDVATRRDA LLSRRNALHE RAMHRRAQLA DSFHLQQFFR DSDELKSWVN EKMKTATDEA
610 620 630 640 650 660
YKDPSNLQGK VQKHQAFEAE LSANQSRIDA LEKAGQKLID VNHYAKEEVA ARMNEVISLW
670 680 690 700 710 720
KKLLEATELK GVKLREANQQ QQFNRNVEDI ELWLYEVEGH LASDDYGKDL TNVQNLQKKH
730 740 750 760 770 780
ALLEADVAAH QDRIDGITIQ ARQFQDAGHF DAENIKKKQE ALVARYEALK EPMVARKQKL
790 800 810 820 830 840
ADSLRLQQLF RDVEDEETWI REKEPIAAST NRGKDLIGVQ NLLKKHQALQ AEIAGHEPRI
850 860 870 880 890 900
KAVTQKGNAM VEEGHFAAED VKAKLSELNQ KWEALKAKAS QRRQDLEDSL QAQQYFADAN
910 920 930 940 950 960
EAESWMREKE PIVGSTDYGK DEDSAEALLK KHEALMSDLS AYGSSIQALR EQAQSCRQQV
970 980 990 1000 1010 1020
APMDDETGKE LVLALYDYQE KSPREVTMKK GDILTLLNST NKDWWKVEVN DRQGFVPAAY
1030 1040 1050 1060 1070 1080
VKKLDPAQSA SRENLLEEQG SIALRQGQID NQTRITKEAG SVSLRMKQVE ELYQSLLELG
1090 1100 1110 1120 1130 1140
EKRKGMLEKS CKKFMLFREA NELQQWINEK EAALTSEEVG ADLEQVEVLQ KKFDDFQKDL
1150 1160 1170 1180 1190 1200
KANESRLKDI NKVAEDLESE GLMAEEVQAV QQQEVYGMMP RDEADSKTAS PWKSARLMVH
1210 1220 1230 1240 1250 1260
TVATFNSIKE LNERWRSLQQ LAEERSQLLG SAHEVQRFHR DADETKEWIE EKNQALNTDN
1270 1280 1290 1300 1310 1320
YGHDLASVQA LQRKHEGFER DLAALGDKVN SLGETAQRLI QSHPESAEDL KEKCTELNQA
1330 1340 1350 1360 1370 1380
WTSLGKRADQ RKAKLGDSHD LQRFLSDFRD LMSWINGIRG LVSSDELAKD VTGAEALLER
1390 1400 1410 1420 1430 1440
HQEHRTEIDA RAGTFQAFEQ FGQQLLAHGH YASPEIKEKL DILDQERTDL EKAWVQRRMM
1450 1460 1470 1480 1490 1500
LDHCLELQLF HRDCEQAENW MAAREAFLNT EDKGDSLDSV EALIKKHEDF DKAINVQEEK
1510 1520 1530 1540 1550 1560
IAALQAFADQ LIAVDHYAKG DIANRRNEVL DRWRRLKAQM IEKRSKLGES QTLQQFSRDV
1570 1580 1590 1600 1610 1620
DEIEAWISEK LQTASDESYK DPTNIQSKHQ KHQAFEAELH ANADRIRGVI DMGNSLIERG
1630 1640 1650 1660 1670 1680
ACAGSEDAVK ARLAALADQW QFLVQKSAEK SQKLKEANKQ QNFNTGIKDF DFWLSEVEAL
1690 1700 1710 1720 1730 1740
LASEDYGKDL ASVNNLLKKH QLLEADISAH EDRLKDLNSQ ADSLMTSSAF DTSQVKEKRD
1750 1760 1770 1780 1790 1800
TINGRFQKIK SMATSRRAKL SESHRLHQFF RDMDDEESWI KEKKLLVSSE DYGRDLTGVQ
1810 1820 1830 1840 1850 1860
NLRKKHKRLE AELAAHEPAI QGVLDTGKKL SDDNTIGQEE IQQRLAQFVE HWKELKQLAA
1870 1880 1890 1900 1910 1920
ARGQRLEESL EYQQFVANVE EEEAWINEKM TLVASEDYGD TLAAIQGLLK KHEAFETDFT
1930 1940 1950 1960 1970 1980
VHKDRVNDVC TNGQDLIKKN NHHEENISSK MKGLNGKVSD LEKAAAQRKA KLDENSAFLQ
1990 2000 2010 2020 2030 2040
FNWKADVVES WIGEKENSLK TDDYGRDLSS VQTLLTKQET FDAGLQAFQQ EGIANITALK
2050 2060 2070 2080 2090 2100
DQLLAAKHIQ SKAIEARHAS LMKRWTQLLA NSATRKKKLL EAQSHFRKVE DLFLTFAKKA
2110 2120 2130 2140 2150 2160
SAFNSWFENA EEDLTDPVRC NSLEEIKALR EAHDAFRSSL SSAQADFNQL AELDRQIKSF
2170 2180 2190 2200 2210 2220
RVASNPYTWF TMEALEETWR NLQKIIKERE LELQKEQRRQ EENDKLRQEF AQHANAFHQW
2230 2240 2250 2260 2270 2280
IQETRTYLLD GSCMVEESGT LESQLEATKR KHQEIRAMRS QLKKIEDLGA AMEEALILDN
2290 2300 2310 2320 2330 2340
KYTEHSTVGL AQQWDQLDQL GMRMQHNLEQ QIQARNTTGV TEEALKEFSM MFKHFDKDKS
2350 2360 2370 2380 2390 2400
GRLNHQEFKS CLRSLGYDLP MVEEGEPDPE FEAILDTVDP NRDGHVSLQE YMAFMISRET
2410 2420 2430 2440 2450 2460
ENVKSSEEIE SAFRALSSEG KPYVTKEELY QNLTREQADY CVSHMKPYVD GKGRELPTAF
2470
DYVEFTRSLF VN
References and
documentation are available.
Number of amino acids: 2472
Molecular weight: 284637.50
Theoretical pI: 5.20
Total number of negatively charged residues (Asp + Glu): 461
Total number of positively charged residues (Arg + Lys): 346
Atomic composition:
Carbon C 12414
Hydrogen H 19705
Nitrogen N 3585
Oxygen O 3963
Sulfur S 64
Formula: C
12414H
19705N
3585O
3963S
64
Total number of atoms: 39731
Extinction coefficients:
Extinction coefficients are in units of M
-1 cm
-1, at 280 nm measured in water.
Ext. coefficient 281965
Abs 0.1% (=1 g/l) 0.991, assuming all pairs of Cys residues form cystines
Ext. coefficient 281090
Abs 0.1% (=1 g/l) 0.988, assuming all Cys residues are reduced
Estimated half-life:
The N-terminal of the sequence considered is M (Met).
The estimated half-life is: 30 hours (mammalian reticulocytes, in vitro).
>20 hours (yeast, in vivo).
>10 hours (Escherichia coli, in vivo).
Instability index:
The instability index (II) is computed to be 42.70
This classifies the protein as unstable.
Aliphatic index: 79.34
Grand average of hydropathicity (GRAVY): -0.789