GlycoProtDB ID - GPDB0007171


Protein Name CUB and sushi domain-containing protein 1
Protein Accession Number Q923L3
Gene Csmd1
Length 3564
Status
Reviewed
release : 2016-12-05

Glycosylation Sites

Schema of N-glycosylation site(s) of the protein

Sequence

(__:Potential Sequon , N:Identified Site)

1
101
201
301
401
501
601
701
801
901
1001
1101
1201
1301
1401
1501
1601
1701
1801
1901
2001
2101
2201
2301
2401
2501
2601
2701
2801
2901
3001
3101
3201
3301
3401
3501
MTAWRKFKSL LLPLVLAVLC AGLLTAAKGQ NCGGLVQGPN GTIESPGFPH GYPNYANCTW IIITGERNRI QLSFHTFALE EDFDILSVYD GQPQQGNLKV
RLSGFQLPSS IVSTGSLLTL WFTTDFAVSA QGFKAMYEVL PSHTCGNPGE ILKGVLHGTR FNIGDKIRYS CLSGYILEGH AILTCIVSPG NGASWDFPAP
FCRAEGACGG TLRGTSGSIS SPHFPSEYDN NADCTWTILA EPGDTIALVF TDFQLEEGYD FLEISGTEAP SIWLTGMNLP SPVISSKNWL RLHFTSDSNH
RRKGFNAQFQ VKKAIELKSR GVKMLPSKDS SHKNSVLTQG GVSLISDMCP DPGIPDNGRR AGSDFRVGAN VQFSCEDNYV LQGAKGITCQ RVTETLAAWN
DHRPICRART CGSNLRGPSG VITSPNYPVQ YEDNAHCVWV ITTTDPDKVI KLAFEEFELE RGYDTLTVGD AGKVGDTRSV LYVLTGSSVP DLIVSMSNQM
WLHLQSDDSI GSPGFKAVYQ EIEKGGCGDP GIPAYGKRTG SSFLHGDTLT FECQAAFELV GERVITCQKN NQWSGNKPSC VFSCFFNFTA PSGIILSPNY
PEEYGNNMNC VWLIISEPGS RIHLIFNDFD VEPQFDFLAV KDDGISDITV LGTFSGNEVP AQLASSGHIV RLEFQSDHST TGRGFNITYT TFGQNECHDP
GIPVNGRRFG DRFLLGSSVS FHCDDGFVKT QGSESITCIL QDGNVVWSST VPRCEAPCGG HLTASSGVIL PPGWPGYYKD SLNCEWVIEA KPGHSIKITF
DRFQTEVNYD TLEVRDGPTS SSPLIGEYHG TQAPQFLIST GNYMYLLFTT DSSRASVGFL IHYESVTLES DSCLDPGIPV NGQRHGSNFG IRSTVTFSCD
PGYTLSDDEP LVCEKNHQWN HALPSCDALC GGYIHGKSGT VLSPGFPDFY PNSLNCTWTI EVSHGKGVQM NFHTFHLESS HDYLLITEDG SFSEPVARLT
GSVLPHTIKA GLFGNFTAQL RFISDFSISY EGFNITFAEY DLEPCDDPGV PAFSRRIGFQ FGVGDTLAFT CFQGYRLEGA TKLTCLGGGR RVWSAPLPRC
VAECGASVKG NEGTLLSPNF PSHYDNNHEC IYKIETEAGK GIHLRARTFQ LFEGDTLKVY DGKDSSSRSL GVFTRSEFMG LVLNSTSNHL RLEFNTNGSD
TAQGFQLTYT SFDLVKCEDP GIPNYGYRIR DDGHFTDTVV LYSCNPGYAM HGSSTLTCLS GDRRVWDKPM PSCVAECGGL VHAATSGRIL SPGYPAPYDN
NLHCTWTIEA DPGKTISLHF IVFDTETAHD ILKVWDGPVD SNILLKEWSG SALPEDIHST FNSLTLQFDS DFFISKSGFS IQFSTSIAST CNDPGMPQNG
T
RYGDSREPG DTITFQCDPG YQLQGPAKIT CVQLNNRFFW QPDPPSCIAA CGGNLTGPAG VILSPNYPQP YPPGKECDWR IKVNPDFVIA LIFKSFSMEP
SYDFLHIYEG EDSNSPLIGS FQGSQAPERI ESSGNSLFLA FRSDASVGLS GFAIEFKEKP REACFDPGNI MNGTRIGTDF KLGSTVTYQC DSGYKIVDPS
SIECVTGADG KPSWDRALPA CQAPCGGQYT GSEGVVLSPN YPHNYTAGQM CIYSITVPKE FVVFGQFAYF QTALNDLAEL FDGTHPQARL LSSLSGSHSG
ETLPLATSNQ ILLRFSAKSG ASARGFHFVY QAVPRTSDTQ CSSVPEPRYG RRIGSEFSAG SIVRFECNPG YLLQGSTAIR CQSVPNALAQ WNDTIPSCVV
PCSGNFTQRR GTILSPGYPE PYGNNLNCVW KIIVSEGSGI QIQVISFATE QNWDSLEIHD GGDMTAPRLG SFSGTTVPAL LNSTSNQLCL HFQSDISVAA
AGFHLEYKTV GLAACQEPAL PSNGIKIGDR YMVNDVLSFQ CEPGYTLQGR SHISCMPGTV RRWNYPSPLC IATCGGTLTS MSGVILSPGF PGSYPNNLDC
TWKISLPIGY GAHIQFLNFS TEANHDYLEI QNGPYHSSPM MGQFSGPDLP TSLLSTTHET LIRFYSDHSQ NRQGFKLSYQ AYELQNCPDP PAFQNGFMIN
SDYSVGQSIS FECYPGYILL GHPVLTCQHG TDRNWNYPFP RCDAPCGYNV TSQNGTIYSP GFPDEYPILK DCLWLVTVPP GHGVYINFTL LQTEAVNDYI
AVWDGPDQNS PQLGVFSGNT ALETAYSSTN QVLLKFHSDF SNGGFFVLNF HAFQLKRCPP PPAVPQADLL TEDEDFEIGD FVKYQCHPGY TLLGSDTLTC
KLSSQLLFQG SPPTCEAQCP ANEVRTESSG VILSPGYPGN YFNSQTCAWS IKVEPNFNIT LFVDTFQSEK QFDALEVFDG SSGQSPLLVV LSGNHTEQSN
FT
SRSNHLYL RWSTDHATSK KGFKIRYAAP YCSLTSTLRN GGILNKTAGA VGSKVHYFCK PGYRMIGHSN ATCRRNPVGV YQWDSMAPLC QAVSCGIPEA
PGNGSFTGNE FTLDSKVTYE CNEGFKLDAS QEATTVCQED GLWSNRGKPP TCKPVPCPSI EGQLSEHVLW RLVSGSLNEY GAQVLLSCSP GYFLQGQRLL
QCQANGTWST EEDRPRCKVI SCGSLSFPPN GNKIGTLTIY GATAIFTCNT GYTLVGSHVR ECLANGLWSG SETRCLAGHC GSPDPIVNGH ISGDGFSYRD
TVVYQCNPGF RLVGTSVRIC LQDHKWSGQT PVCVPITCGH PGNPAHGLTN GTEFNLNDLV NFTCHTGYRL QGASRAQCRS NGQWSSPLPI CRVVNCSDPG
SVENAVRHGQ QNFPESFEYG TSVMYHCKTG FYLLGSSALT CMASGLWDRS LPKCLAISCG HPGVPANAVL TGELFTYGAT VQYSCKGGQI LTGNSTRVCQ
EDSHWSGSLP HCSGNSPGFC GDPGTPAHGS RLGDEFKTKS LLRFSCEMGH QLRGSAERTC LVNGSWSGVQ PVCEAVSCGN PGTPTNGMIL SSDGILFSSS
VIYACWEGYK TSGLMTRHCT ANGTWTGTAP DCTIISCGDP GTLPNGIQFG TDFTFNKTVS YQCNPGYLME PPTSPTIRCT KDGTWNQSRP LCKAVLCNQP
PPVPNGKVEG SDFRWGASIS YSCVDGYQLS HSAILSCEGR GVWKGEVPQC LPVFCGDPGT PAEGRLSGKS FTFKSEVFIQ CKPPFVLVGS SRRTCQADGI
WSGIQPTCID PAHTACPDPG TPHFGIQNSS KGYEVGSTVF FRCRKGYHIQ GSTTRTCLAN LTWSGIQTEC IPHACRQPET PAHADVRAID LPAFGYTLVY
TCHPGFFLAG GSEHRTCKAD MKWTGKSPVC KSKGVREVNE TVTKTPVPSD VFFINSVWKG YYEYLGKRQP ATLTVDWFNA TSSKVNATFT AASRVQLELT
GVYKKEEAHL LLKAFHIKGP ADIFVSKFEN DNWGLDGYVS SGLERGGFSF QGDIHGKDFG KFKLERQDPS NSDADSSNHY QGTSSGSVAA AILVPFFALI
LSGFAFYLYK HRTRPKVQYN GYAGHENSNG QASFENPMYD TNLKPTEAKA VRFDTTLNTV CTVV

N-glycosylation sites

Potential sites Identified Peptide*1
Position Sequon Position Sequence Unique or Shared
40 NGT
57 NCT
587 NFT
686 NIT
695 NEC
955 NCT
1015 NFT
1034 NIT
1184 NST
1197 NGS
1399 NGT
1454 NLT
1572 NGT 1562 - 1575
EACFDPGNIMNGTR
Shared ( 2 )
1644 NYT
1792 NDT
1805 NFT
1882 NST
2018 NFS
2149 NVT
2154 NGT
2187 NFT
2358 NIT
2394 NHT
2400 NFT
2445 NKT
2470 NAT
2503 NGS
2605 NGT 2599 - 2616
LLQCQANGTWSTEEDRPR
Shared ( 2 )
2750 NGT
2761 NFT
2795 NCS 2793 - 2807
VVNCSDPGSVENAVR
Shared ( 3 )
2894 NST
2963 NGS
3022 NGT
3056 NKT
3086 NQS
3228 NSS
3260 NLT 3256 - 3276
TCLANLTWSGIQTECIPHACR
Shared ( 3 )
3339 NET
3379 NAT 3368 - 3384
RQPATLTVDWFNATSSK
Shared ( 3 )
3386 NAT 3385 - 3394
VNATFTAASR
Shared ( 2 )
Brain
ConA
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
*1:   _:Potential Sequon     :Asn (glycosylated)     :Gln (deaminated:pyroGlu)     :Met (oxidized)     :Cys (carbamidomethylated and deaminated)

External Links

Publications

Kaji H, Shikanai T, Sasaki-Sawa A, et al Large-scale Identification of N-Glycosylated Proteins of Mouse Tissues and Construction of a Glycoprotein Database, GlycoProtDB J Proteome Res 2012;11(9):4553-4566 PMID:22823882
Kaji H, Yamauchi Y, Takahashi N, Isobe T Mass spectrometric identification of N-linked glycopeptides using lectin-mediated affinity capture and glycosylation site--specific stable isotope tagging Nat Protocols 2007;1(6):3019-3027 PMID:17406563

Entry information

Entry version 2016-12-05
protein sequence version 2016-12-05
Entry status Latest version
Entry history

create : 2016-12-05