GlycoProtDB ID - GPDB0006031


Protein Name CUB and sushi domain-containing protein 3
Protein Accession Number Q80T79
Gene Csmd3
Length 3707
Status
Reviewed
release : 2016-12-05

Glycosylation Sites

Schema of N-glycosylation site(s) of the protein

Sequence

(__:Potential Sequon , N:Identified Site)

1
101
201
301
401
501
601
701
801
901
1001
1101
1201
1301
1401
1501
1601
1701
1801
1901
2001
2101
2201
2301
2401
2501
2601
2701
2801
2901
3001
3101
3201
3301
3401
3501
3601
3701
MKGSRKGESR AKESKPREPG TRRCAKCGRL DFILKKKMGI KSGFTFWNLV FLLTLSCVKG FIYTCGGTLK GLNGTIESPG FPYGYPNGAN CTWVIIAEER
NRIQIVFQSF ALEEEYDYLS LYDGHPHPTN FRTRLTGFHL PPPVTSTKSV FSLRLTSDFA VSAHGFKVYY EELQSSSCGN PGVPPKGVLY GTRFDVGDKI
RYSCVTGYIL DGHPQLTCIA NSVNTASWDF PVPICRAEDA CGGTMRGSSG IISSPGFPNE YHNNADCTWT IVAEPGDTIS LIFTDFQMEE KYDYLEIEGS
EPPTIWLSGM NIPPPIISNK NWLRLHFVTD SNHRYRGFSA PYQGSSPLTL TASIGELEEH IRTATGAIDV ASTPADVTVS SVTAVTSHRL SEEQRVQVRS
LSDSGLDPNT PEDQLSPHQA DTQSTSRRPR NAEQIERTKE LAVVTHRVKK AIDFKSRGFK LFPGKDNSNK FSLLNEGGIK TASNLCPDPG EPENGKRFGS
DFSLGSTVQF SCDEDYVLQG AKSITCQRIA EVFAAWSDHR PVCKVKTCGS NLQGPSGTFT SPNFPFQYDS NAQCVWVITA VNTNKVIQIN FEEFDLEIGY
DTLTIGDGGE VGDPRTVLQV LTGSFVPDLI VSMRSQMWLH LQTDESVGSV GFKVNYKEIE KESCGDPGTP LYGIREGDGF SNRDVLRFEC QFGFELIGEK
SIVCQENNQW SANIPICIFP CLSNFTAPMG TVLSPDYPEG YGNNLNCIWT IISDPGSRIH LSFNDFDLES QFDFLAVKDG DSPDSPILGT FTGAEVPSHL
TSNSHILRLE FQADHSMSGR GFNITYNTFG HNECPDPGIP INARRFGDNF QLGSSISVIC EEGFIKTQGT ETITCILMDG KVMWSGPIPR CGAPCGGHFS
APSGVILSPG WPGYYKDSLN CEWVIEAEPG HSIKITFERF QTELNYDVLE VHDGPNLLSP LLGSYNGTQV PQFLFSSSNF IYLLFTTDNS RSNNGFKIHY
ESVTVNTYSC LDPGIPVHGR RYGHDFSIGS TVSFSCDPGY RLSHEEPLLC EKNHWWSHPL PTCDALCGGD VRGPSGTILS PGYPEFYPNS LNCTWTVDVT
HGKGVQFNFH TFHLEDHHDY LLITENGSFT QPLARLTGSE LPSTINAGLY GNFRAQLRFI SDFSISYEGF NITFSEYNLE PCEDPGIPQY GSRVGFSFGV
GDTLTFSCSL GYRLEGSSEI ICLGGGRRVW SAPLPRCVAE CGASATNNEG ILLSPNYPLN YENNHECIYS LQVQAGKGIN ISARTFHLAQ GDVLKIYDGK
DKTTHLLGAF TGASMRGLTL SSTSNQLWLE FNSDSEGTDE GFQLVYTSFE LSHCEDPGIP QFGYKISDQG HFAGSTIIYG CNPGYTLHGS SLLKCMTGER
RAWDYPLPSC IAECGGRFKG ESSGRILSPG YPFPYDNNLR CMWMIEVDPG NIVSLQFLAF DTEASHDILR VWDGPPENEM LLKEVSGSLI PDGIHSTLNI
VTIQFDTDFY ISKSGFAIQF SSSVATACRD PGVPMNGTRN GDGREPGDTV VFQCDPGYEL QGQERITCIQ VENRYFWQPS PPVCIAPCGG NLTGSSGFIL
SPNFPHPYPH SRDCDWTISV NTDYVISLAF ISFSIEPNYD FLYIYDGPDS NSPLIGSFQD SKLPERIESS SNTMHLAFRS DGSVSYTGFH LEYKAKLRES
CFDPGNIMNG TRLGMDYKLG STVTYYCDAG YVLQGYSTLT CIMGDDGRPG WNRVLPSCHA PCGSRSTGSE GTVLSPNYPK NYSVDHNCVY SIAVPKEFVV
FGQFVFFQTS LHDVVEVFDG PTQQSPLLSS LSGSHSGESL PLSSGNQITI RFTSVGPITA KGFHFVYQAV PRTSSTQCSS VPEPRFGRRI GNDFAVGSLV
LFECNPGYIL HGSRAIRCET VPNSLAQWND SLPTCIVPCG GILTKRKGTI LSPGYPEPYD NNLNCVWKIT VPEGAGIQVQ VVSFATEHNW DSLDFYDGGD
NNAPRLGSYS GTTIPHLLNS TSNNLYLNFQ SDISVSAAGF HLEYTAIGLD SCPEPQTPSS GIKVGDRYMV GDVVSFQCDQ GYSLQGHSHI TCMPGPVRRW
NYPIPICLAQ CGGAMSDFSG VILSPGFPGN YPSSLDCTWT IKLPIGFGVH LQFVNFSTET IHDYLEVRSG SSEISTVIGR LSGPQIPSSL FSTTHETSLY
FHSDYSQNKQ GFHIVYQAYQ LQSCPDPRPF RNGFVIGNDF TVGQTISFEC FPGYTLIGNS ALTCLHGVSR NWNHPLPRCE ALCGGNITAM NGTIYSPGYP
DEYPNFQDCF WLVRVPPGNG IYINFTVLQT EPIYDFITVW DGPDQNSPQI GQFSGNTALE SVYSTSNQIL IKFHSDFTTS GFFVLSYHAY QLRVCQPPPP
VPNAEILTED DEFEIGDIIR YQCLPGFTLV GNAILTCRLG ERLQMDGAPP VCQVLCPANE LRLDSTGVIL SPGYPDSYPN LQMCAWSISV EKGYNISMFV
EFFQTEKEFD VLQVYDGPNI QSPVLISLSG DYSAAFNVTS NGHEVFLQWS ADHGNNKKGF RIRYIAFYCS TPESPPHGYI ISQTGGQLNS VVRWACDRGF
RLVGKSSAVC RKSSYGYHSW DAPVPACQAI SCGIPKAPTN GGILTTDYLV GTRVTYFCND GYRLSSKELT TATCQSDGTW SNHNKTPRCV VVTCPSINSF
TLDHGRWRIV NGSHYEYKTK VVFSCDPGYH GLGPASIECL PNGTWSWRTE RPYCQIISCG ELPTPPNGNK IGTQTSYGST AIFTCDLGFM LVGSAVRECL
SSGLWSGSET RCLAGHCGIP ELIVNGQVIG ENYGYRDTVV YQCNPGFRLI GSSVRICQQD HNWSGQLPSC VPVSCGHPGS PIYGRTSGNG FNFNDVVTFS
CNIGYLMQGP TKAQCQANRQ WSHPPPVCKV VNCSDPGIPA NSKRESKIEH GNFTYGTVVF YDCNPGYFLF GSSVLICQPN GQWDKPLPEC IMIDCGHPGI
PPNAVLSGEK YTFGSTVHYS CTGKRSLLGQ ASRTCQLNGH WSGSQPHCSG DTTGTCGDPG TPGHGSRQES DFRTKSTVRF ACDTGYILYG SEERTCLSNG
S
WTGRQPECK AVQCGNPGTT ANGKVFRIDG TTFSSSVIYS CLEGYILSGP SVRQCTANGT WSGSLPNCTI ISCGDPGIPA NGLRYGDDFV VGQNVSYMCQ
PGYTIELNGS RVRTCTTNGT WSGVMPTCRA VTCSTPPQIS NGRLEGTNFD WGFSISYICS AGYELSFPAV LTCVGNGTWS GEVPQCLPKF CGDPGIPSQG
KREGKSFIYQ SEVSFSCNSP FILVGSSTRL CQTDGTWSGS SPHCIEPTRT SCENPGVPRH GSQNNTFGFQ VGSVVQFHCK KGHLLQGSTT RTCLPDLTWS
GIQPECIPHS CKQPESPAHA NVVGMDLPSH GYTLIYTCQP GFFLAGGTEH RVCRSDNTWT GKVPVCEAGS KILVKDPRPA LGTPSPKLSV PDDVFAQNYI
WKGSYNFKGR KQPMTLTVTS FNASTGRVNA TLSNSDMELL LSGVYKSQEA RLMLHIYLIK VPAHASVKKM KEENWAMDGF VSAEPDGATY VFQGFIKGKD
YGQFGLQRLG LNTSEGSNSS NQPHGTNSSS VAIAILVPFF ALIFAGFGFY LYKQRTAPKT QYTGCSVHEN NNGQAAFENP MYDTNAKSVE GKAVRFDPNL
NTVCTMV

N-glycosylation sites

Potential sites Identified Peptide*1
Position Sequon Position Sequence Unique or Shared
73 NGT
90 NCT
484 NLC
724 NFT
823 NIT 821 - 844
GFNITYNTFGHNECPDPGIPINAR
Shared ( 3 )
832 NEC
966 NGT
1092 NCT
1126 NGS
1171 NIT
1280 NIS
1536 NGT
1591 NLT
1709 NGT
1781 NYS
1929 NDS
2019 NST
2155 NFS
2286 NIT
2291 NGT
2324 NFT
2495 NIS
2537 NVT
2684 NKT
2711 NGS 2709 - 2718
IVNGSHYEYK
Shared ( 3 )
2742 NGT
2862 NWS
2932 NCS
2952 NFT
3099 NGS 3095 - 3105
TCLSNGSWTGR
Shared ( 4 )
3158 NGT
3167 NCT
3194 NVS
3208 NGS
3218 NGT
3276 NGT
3364 NNT
3522 NAS 3511 - 3527
KQPMTLTVTSFNASTGR
Shared ( 5 )
3529 NAT
3612 NTS
3618 NSS
3627 NSS
Brain
ConA
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
*1:   _:Potential Sequon     :Asn (glycosylated)     :Gln (deaminated:pyroGlu)     :Met (oxidized)     :Cys (carbamidomethylated and deaminated)

External Links

Publications

Kaji H, Shikanai T, Sasaki-Sawa A, et al Large-scale Identification of N-Glycosylated Proteins of Mouse Tissues and Construction of a Glycoprotein Database, GlycoProtDB J Proteome Res 2012;11(9):4553-4566 PMID:22823882
Kaji H, Yamauchi Y, Takahashi N, Isobe T Mass spectrometric identification of N-linked glycopeptides using lectin-mediated affinity capture and glycosylation site--specific stable isotope tagging Nat Protocols 2007;1(6):3019-3027 PMID:17406563

Entry information

Entry version 2016-12-05
protein sequence version 2016-12-05
Entry status Latest version
Entry history

create : 2016-12-05