GlycoProtDB ID - GPDB0001559


Protein Name Alstrom syndrome protein 1 homolog
Protein Accession Number A0A1D5RMI8
Gene Alms1
Length 3720
Status
release : 2016-12-05

Glycosylation Sites

Schema of N-glycosylation site(s) of the protein

Sequence

(__:Potential Sequon , N:Identified Site)

1
101
201
301
401
501
601
701
801
901
1001
1101
1201
1301
1401
1501
1601
1701
1801
1901
2001
2101
2201
2301
2401
2501
2601
2701
2801
2901
3001
3101
3201
3301
3401
3501
3601
3701
MEPEDLPWPD ELEEEEEEEE EEGEEEEGKK EVENASAAAT EEALTSEESG RLEEFEEAGP DLDFNYESQR QESSDEEEDE LAKAWLQAHP DRPGSAFSLP
PPTPPPPPPP LSPRLRYTPV EHLGKTEVVP LTCRVWQQSS YQDNSRAQFS NSSTMLLETG VRWGSEEDQR TESWHCLPQE RDSSQTLAMS QTEIGRVEGT
EVPDLPSQEG GLPAQSQCPG KKPKLNVLCS PLLVIQDNFA APDLPLLTCL IQDQEEVEPD SLFQQSELEF APLRGIPDKS EDSEWLARPS EVSEALIQAT
SETSSDLANS CFSISQHPLT GSTTFGSQHS LFMFEQENNG ETVPSNAPDE LKSPKDSDHD VLCSYMSWPS EKDTQQPEGN LADKDQVSTS TLSDISDESI
AAKRNNSFGV SCSCNQYWNQ KALQQTEACL ASKDHASFPC KITPLPNKVP YSFLQRMLKR GKEAPLIADN HLENVCLRTP AEDEMSLKNE SLEGFEGNED
LQKELSQFFK QKETRNTWAS SQVHTRYPEI KYIGTVAVPD SKKQVAEPYI LYRGLDISDC PLPTVKSNLD ETSEQLYFQE ERNQKAASTF DEKNVDATEN
VAFEAIIASI ESSNQESLIS ETQTDNTELL ADDSEEKEPL KPDLSPLNCK NEISFIDKLK YLKYQSTPGV FEPADSKSLL YKEDDASSLY LYPDLKSSIN
APQNMLNKPV SVTLELKSSL SEKACNQAVG LETQSPSFQY GISNLDLTEK VGSSSISFAM KIATSESWVL EDLKQQTFEE VADSSDYKLE GLQGKAESGV
LTRCGDAKYS SLYENLGAQS ERIAVLQREV GCSNLGISQA SPSSLPSFVP QEPTSEPEYH SSNLRMLRVS PDTLLTTHTH SAGSADQKIG AAVVSSAYSQ
EIKPGSFHQE ELPDRHLNEE IRKVSPALRT AGQKPEMLPV QSSSYSKGMK SIFYQHPVSH GHQGKEPLSV SAVCGSAGNK AFHQLSTLSD SLLTEETWPV
SVIPGLGNQK TPLPSEFSLS YSHRGKNLPE DVVKVSTDSG SAHKKADILT ASSRTYQHKM KPANIYHQEL PDSRVPIGTR KVAFESGPAG QKSGVSHPYG
EMPSVFYQQG LPDRHSAKSP TKTFIPGPAD QKTDLSPVPP TSSSHAEKPV SPYQLTLPGS HLPEDVFKAS SVCKSSDELS GITALTSASY SYKGRPNSSY
QQKFPDSHLN EEAQKILGTT GTVDQKTVTP TMSSSFLQKE KPSIFYQQTL PDGGLSEEDL QVSAVPWPAD QNIAIPTVTS AAFSQREKPR IFYQQTLSVD
RLPGEPLNVL GTSGPPDQNT GAPTVTPSSY FPGEESIIFY QAGFPGNTLS AMSFKVPRIS GSTEQTNVTT GSSSSYSVGE KSIIFYHQAL PDGRLPQEAS
PAPADLNTGE PPMYLASCSV GVKPIIFYQQ PMSDSQRTKG HKESDVPGPT DQKTGIATVH STSQSYIGRR TVSYQKEFPD LSEKALKVLG DVGSTEQKTQ
IPVVSSALLH KEGPSAYQED LPDLTEEPLQ ILGVSEEVSS SSYQRKLPDH IEVFLKSVGS GSADRKTGAQ IVSSSREKSS GFHQQELPNT GGDAVDAFHP
EPVVQEVRKV QTPGAPAGPS SSHFHKEKLS DYQKASPHRD LTESSLKAST VPGLSDQKKK PAVSSGFCLH KEKHEISASA LLNCQTAELL TVTQRSCLHR
EDPAISTVIK PDDQKIPLPT TFHGSSDQKV KPVIFVQKQL RDRDQSEDIP KISTVSEPTV VNTVLPVLLP GSYSHREKSD SFYPQELPDG HLTEVDLKVS
SGLGQADQIS GLPTGIPGTY SHSEKHQLIS EHVQELMDNL NSSESSCLSV DSMPLNSQID DGVIICKPES LGFANAGCEE MQNIDRGSKT LKEIQTLLME
AENMALKRCN FSVPLVPFRD VNDVSFIRSK KVVCFKESST TDVCTQRESF VEEVPHIEYV QKDIGTQTNL KYQRGVGNWE FISSATFRSP LQEAEGTARM
AYDETFRQYK AARSVMRSEP EGCSTGIGNK MIIPMMTIIK SDSSSDVSDG CCSWDNNLPE SLESVSDVFL NFFPYTSPKT SITDSREEEW LSESEDGYGS
TDSLAAHVKY LLQCETSLNQ AKQILKNAEE EEYRVRTQAW NLKFNLGRDR GYSISELNED DRRKVEEIKA KLFGHGRATH MSEGLRSPQG IGCLPEAVCS
RIIIESHEKG CFRTLTAEQP RPDSCHCAFR SVEPSDLIRG HRSPSSWRGR HINLSRSIEQ SNPCFKVGSS FQLQSHPPFQ KLLPDDIKIS KGVGMPVHAY
MDPQPSELVE PTCVPAKEMD FPSSSQILPP EPKKQFTTAI TFSSHEHSEC ISDSSGCKVG VTADSQCSGP SLGVFKPHIP EEQISPRDLK QKTSFQSSLE
RHGSTPVTIL ADGSRQRQKL PVDFEHSHQK EKLLQRLGFK VSHSEPNVST NVSNFKGVQF SGKDTIVSQD KLTSTVEVKE KNVTVTPDLP SCIFLEQPEL
FEESHTPHTD LQMRKYPSPS CPEIASRIFL EQPKLSEQSK APHVDREIRE DHSFFPKCQD YIVADPSPDF PDQQQCKPPD VVGHTRKQNS LLSEGQDYEL
EEVQHIPQSY FSNMVNVEAK VSDAISQSAP DHCTAASTPP SNRKALSCVR ITLCPKTSSK LDSGTLGERF HSLDPASKTR INSEFNSDLR IISSRSLEPT
SKLLTCKPVA QDQESLVFLG PKSPLDLQVA QSSLPDSKTI FQDLKTKPPQ NSQIVTSRQT QVNISHLEGY SKPEGTPVSA DGSQEQSKVS FTTSFGKLSS
DAITQITTES PEKTTFSSEI FIHADDRGQG ILDPMAQKPS RFASSSSVQQ IPASHGKDAQ PVLLPYKPSG SSKMYYVPLL KRVPSYLDSK SDTTVESSHS
GSNDAIAPDF PPQMLGTRDD DLSNTVNIKH KEGIYSKRAA TKGKNPSQKG DAAAPVQMPI TWDENVLDEN QEEVISRGVV IKMAGPEEMS SLEKDLAGPS
DITVQDRKTE NLPDTKSIKQ KEGSLEIESE CHSAFENTAH SVFRSAKFYF HHPVHLPHEQ DFCHESLGRS VFMQHSWKDF FHHHSGHSCL PPPGPSSDKL
DKTKMDYTRI KSLSINLNLG EHEKIHTIKN QARDPKGKRQ ANEQKKDQKV TPELTTECPV SLNELWNRYQ ERQKQQNPSG ACDTKELSLV ERLDRLAKLL
QNPITHSLRA SESAQDDSRG GHRAREWTGR RQQKQKGKQH RKWSKSLERG QSTGDFRKSK VFSPHQGGKS SQFKIEQIKL DKYILRKEPG FNNVSNTSLD
SRPSEESVSL TDSPNIFSST DSPVDSDVLT PTDRDMPLNE RSSSISTIDT VRLIQAFGQD RLSLSPRRIK LYSTVTSQRR RYLEQPCKHN RKALNTACPQ
MTSEHSRRRH IQVANHMTSS DSVSSPGSLL SLDSALSNEE TVRMVSKGVQ AGNLEIVAGV KKYTQDVGVT FPTPSSSEAR LEEDSDVTSS SEEKAKEKKF
LSNYLQTKNL RKNKPNPCAG VSWFVPVESG QSGSKKENLP KIYRPVISWF EPVTKTKPWR EPLREQNWQA QCMNSRGSLG GPGRDSGQVS LRPFVRATLQ
ESLQLHRPDF ISHSGERIKR LKLLVQERKL QSLFQSEREA LFHSARPLPR RVLLAVQKNK PIGKKEMIQR TRRIYEQLPE VKKKREEEKR KSEYKSYWLR
AQHYKMKVTN HLLGRKVPWD

N-glycosylation sites

Potential sites Identified Peptide*1
Position Sequon Position Sequence Unique or Shared
34 NAS
151 NSS
309 NSC
405 NNS
474 NVC
489 NES
1197 NSS
1367 NVT
1841 NSS
1910 NFS
2253 NLS 2249 - 2256
GRHINLSR
Shared ( 2 )
2447 NVS
2451 NVS
2482 NVT
2763 NIS
3293 NVS
3296 NTS
Testis
ConA
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
*1:   _:Potential Sequon     :Asn (glycosylated)     :Gln (deaminated:pyroGlu)     :Met (oxidized)     :Cys (carbamidomethylated and deaminated)

External Links

Publications

Kaji H, Shikanai T, Sasaki-Sawa A, et al Large-scale Identification of N-Glycosylated Proteins of Mouse Tissues and Construction of a Glycoprotein Database, GlycoProtDB J Proteome Res 2012;11(9):4553-4566 PMID:22823882

Entry information

Entry version 2016-12-05
protein sequence version 2016-12-05
Entry status Latest version
Entry history

create : 2016-12-05