GlycoProtDB ID - GPDB0003889


Protein Name Basement membrane-specific heparan sulfate proteoglycan core protein
Protein Accession Number Q05793
Gene Hspg2
Length 3707
Status
Reviewed
release : 2016-12-05

Glycosylation Sites

Schema of N-glycosylation site(s) of the protein

Sequence

(__:Potential Sequon , N:Identified Site)

1
101
201
301
401
501
601
701
801
901
1001
1101
1201
1301
1401
1501
1601
1701
1801
1901
2001
2101
2201
2301
2401
2501
2601
2701
2801
2901
3001
3101
3201
3301
3401
3501
3601
3701
MGQRAVGSLL LGLLLHARLL AVTHGLRAYD GLSLPEDTET VTASRYGWTY SYLSDDEDLL ADDASGDGLG SGDVGSGDFQ MVYFRALVNF TRSIEYSPQL
EDASAKEFRE VSEAVVEKLE PEYRKIPGDQ IVSVVFIKEL DGWVFVELDV GSEGNADGSQ IQEVLHTVVS SGSIGPYVTS PWGFKFRRLG TVPQFPRVCT
ETEFACHSYN ECVALEYRCD RRPDCRDMSD ELNCEEPVPE LSSSTPAVGK VSPLPLWPEA ATTPPPPVTH GPQFLLPSVP GPSACGPQEA SCHSGHCIPR
DYLCDGQEDC RDGSDELGCA SPPPCEPNEF ACENGHCALK LWRCDGDFDC EDRTDEANCS VKQPGEVCGP THFQCVSTNR CIPASFHCDE ESDCPDRSDE
FGCMPPQVVT PPQQSIQASR GQTVTFTCVA TGVPTPIINW RLNWGHIPAH PRVTMTSEGG RGTLIIRDVK EADQGAYTCE AMNSRGMVFG IPDGVLELVP
QRGPCPDGHF YLEDSASCLP CFCFGVTNVC QSSLRFRDQI RLSFDQPNDF KGVNVTMPSQ PGVPPLSSTQ LQIDPALQEF QLVDLSRRFL VHDAFWALPK
QFLGNKVDSY GGFLRYKVRY ELARGMLEPV QKPDVILVGA GYRLHSRGHT PTHPGTLNQR QVQLSEEHWV HESGRPVQRA EMLQALASLE AVLLQTVYNT
KMASVGLSDI VMDTTVTHTT IHGRAHSVEE CRCPIGYSGL SCESCDAHFT RVPGGPYLGT CSGCNCNGHA SSCDPVYGHC LNCQHNTEGP QCDKCKPGFF
GDATKATATA CRPCPCPYID ASRRFSDTCF LDTDGQATCD ACAPGYTGRR CESCAPGYEG NPIQPGGKCR PTTQEIVRCD ERGSLGTSGE TCRCKNNVVG
RLCNECSDGS FHLSKQNPDG CLKCFCMGVS RQCSSSSWSR AQVLGASEQP SQFSLSNAAG THTTSEGVSS PAPGELSFSS FHNLLSEPYF WSLPASFRGD
KVTSYGGELR FTVMQRPRPS SAPLHRQPLV VLQGNNIVLE HHASRDPSPG QPSNFIVPFQ EQAWQRPDGQ PATREHLLMA LAGIDALLIQ ASYTQQPAES
RLSGISMDVA VPENTGQDSA REVEQCTCPP GYRGPSCQDC DTGYTRVPSG LYLGTCERCN CHGHSETCEP ETGACQSCQH HTEGASCEQC QPGYYGDAQR
GTPQDCQPCP CYGAPAAGQA AHTCFLDTDG HPTCDSCSPG HSGRHCERCA PGYYGNPSQG QPCHRDGQVP EVLGCGCDPH GSISSQCDAA GQCQCKAQVE
GRSCSHCRPH HFHLSASNPE GCLPCFCMGV TQQCASSSYS RQLISTHFAP GDFQGFALVN PQRNSQLTGG FTVEPVHDGA RLSFSNFAHL GQESFYWQLP
EIYQGDKVAA YGGKLRYTLS YTAGPQGSPL LDPDIQITGN NIMLVASQPA LQGPERRSYE IIFREEFWRR PDGQPATREH LLMALADLDE LLVRATFSSV
PRAASISAVS LEGAQPGPSS GPRALEVEEC RCPPGYVGLS CQDCAPGYTR TGSGLYLGQC ELCECNGHSD LCHPETGACS RCQHNTAGEF CELCATGYYG
DATAGTPEDC QPCACPLTNP ENMFSRTCES LGAGGYRCTA CEPGYTGQYC EQCAPGYEGD PNVQGGRCQP LTKESLEVQI HPSRSVVPQG GPHSLRCQVS
GSPPHYFYWS REDGRPLPSS AQQRHQGSEL HFPSVQPSDA GVYICTCRNL IHTSNSRAEL LVAEAPSKPI MVTVEEQRSQ SVRPGADVTF ICTAKSKSPA
YTLVWTRLHN GKLPSRAMDF NGILTIRNVQ PSDAGTYVCT GSNMFAMDQG TATLHVQVSG TSTAPVASIH PPQLTVQPGQ QAEFRCSATG NPTPMLEWIG
GPSGQLPAKA QIHNGILRLP AIEPSDQGQY LCRALSSAGQ HVARAMLQVH GGSGPRVQVS PERTQVHEGR TVRLYCRAAG VPSASITWRK EGGSLPFRHQ
AHGSRLRLHH MSVADSGEYV CRANNNIDAQ ETSIMISVSP STNSPPAPAS PAPIRIESSS SRVAEGQTLD LNCVVPGHAH AQVTWHKRGG SLPTHHQTHG
SRLRLYQVSS ADSGEYVCSV LSSSGPLEAS VLVSITPAAA NVHIPGVVPP IRIETSSSRV AEGQTLDLSC VVPGQAHAQV TWHKRGGSLP AGHQVHGHML
RLNRVSPADS GEYSCQVTGS SGTLEASVLV TIEASEPSPI PAPGLAQPVY IESSSSHLTE GQTVDLKCVV PGQAHAQVTW HKRGSSLPAR HQTHGSLLRL
YQLSPADSGE YVCQVAGSSH PEHEASFKLT VPSSQNSSFR LRSPVISIEP PSSTVQQGQD ASFKCLIHEG AMPIKVEWKI RDQELEDNVH ISPNGSIITI
VAPGPATMEP TACVASNVYG MAQSVVNLSV HGPPTVSVLP EGPVHVKMGK DITLECISSG EPRSSPRWTR LGIPVKLEPR MFGLMNSHAM LKIASVKPSD
AGTYVCQAQN ALGTAQKQVE LIVDTGTVAP GTPQVQVEES ELTLEAGHTA TLHCSATGNP PPTIHWSKLR APLPWQHRIE GNTLVIPRVA QQDSGQYICN
AT
NSAGHTEA TVVLHVESPP YATIIPEHTS AQPGNLVQLQ CLAHGTPPLT YQWSLVGGVL PEKAVVRNQL LRLEPTVPED SGRYRCQVSN RVGSAEAFAQ
VLVQGSSSNL PDTSIPGGST PTVQVTPQLE TRNIGASVEF HCAVPNERGT HLRWLKEGGQ LPPGHSVQDG VLRIQNLDQN CQGTYVCQAH GPWGQAQATA
QLIVQALPSV LINVRTSVHS VVVGHSVEFE CLALGDPKPQ VTWSKVGGHL RPGIVQSGTI IRIAHVELAD AGQYRCAATN AAGTTQSHVL LLVQALPQIS
TPPEIRVPAG SAAVFPCMAS GYPTPAITWS KVDGDLPPDS RLENNMLMLP SVRPEDAGTY VCTATNRQGK VKAFAYLQVP ERVIPYFTQT PYSFLPLPTI
KDAYRKFEIK ITFRPDSADG MLLYNGQKRS PTNLANRQPD FISFGLVGGR PEFRFDAGSG MATIRHPTPL ALGQFHTVTL LRSLTQGSLI VGNLAPVNGT
SQGKFQGLDL NEELYLGGYP DYGAIPKAGL SSGFVGCVRE LRIQGEEIVF HDVNLTTHGI SHCPTCQDRP CQNGGQCQDS ESSSYTCVCP AGFTAAAVNI
RKPCTATPSL WADATCVNRP DGRGYTCRCH LGRSGVRCEE GVTVTTPSMS GAGSYLALPA LTNTHHELRL DVEFKPLEPN GILLFSGGKS GPVEDFVSLA
MVGGHLEFRY ELGSGLAVLR SHEPLALGRW HRVSAERLNK DGSLRVDGGR PVLRSSPGKS QGLNLHTLLY LGGVEPSVQL SPATNMSAHF HGCVGEVSVN
GKRLDLTYSF LGSQGVGQCY DSSPCERQPC RNGATCMPAG EYEFQCLCQD GFKGDLCEHE ENPCQLHEPC LNGGTCRGAR CLCLPGFSGP RCQQGAGYGV
VESDWHPEGS GGNDAPGQYG AYFYDNGFLG LPGNSFSRSL PEVPETIEFE VRTSTADGLL LWQGVVREAS RSKDFISLGL QDGHLVFSYQ LGSGEARLVS
GDPINDGEWH RITALREGQR GSIQVDGEDL VTGRSPGPNV AVNTKDIIYI GGAPDVATLT RGKFSSGITG CIKNLVLHTA RPGAPPPQPL DLQHRAQAGA
NTRPCPS

N-glycosylation sites

Potential sites Identified Peptide*1
Position Sequon Position Sequence Unique or Shared
89 NFT
210 NEC 198 - 218
VCTETEFACHSYNECVALEYR
Shared ( 4 )
208 - 218
SYNECVALEYR
Shared ( 4 )
358 NCS 341 - 362
LWRCDGDFDCEDRTDEANCSVK
Shared ( 4 )
344 - 362
CDGDFDCEDRTDEANCSVK
Shared ( 4 )
349 - 362
DCEDRTDEANCSVK
Shared ( 4 )
379 NRC 363 - 380
QPGEVCGPTHFQCVSTNR
Shared ( 4 )
363 - 380
QPGEVCGPTHFQCVSTNR
Shared ( 4 )
528 NVC
554 NVT 552 - 587
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSR
Shared ( 4 )
552 - 588
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSRR
Shared ( 4 )
552 - 587
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSR
Shared ( 4 )
552 - 588
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSRR
Shared ( 4 )
904 NEC 902 - 915
LCNECSDGSFHLSK
Shared ( 4 )
2336 NSS 2329 - 2340
LTVPSSQNSSFR
Shared ( 3 )
2394 NGS
2427 NLS
2600 NAT
3098 NGT 3083 - 3104
SLTQGSLIVGNLAPVNGTSQGK
Shared ( 5 )
3154 NLT
3385 NMS 3360 - 3402
SQGLNLHTLLYLGGVEPSVQLSPATNMSAHFHGCVGEVSVNGK
Shared ( 5 )
Brain Colon Heart Kidney Kidney(Fut9 KO) Kidney(Fut9 WT) Liver Liver: b4GalT-I(+/+) Liver: b4GalT-I(-/-) Lung Skeletal_muscle Stomach Testis
ConA RCA120 Amide80 ConA RCA120 Amide80 ConA RCA120 AAL ConA RCA120 AAL(+) AAL(+) RCA120 RCA120 RCA120 AAL ConA RCA120 Amide80 ConA RCA120 Amide80 ConA RCA120 RCA120
                                                   
   
   
 
     
                     
 
                     
                         
   
   
 
     
                 
 
 
             
     
                         
                       
                         
         
                                     
                     
                           
                                                   
                     
                         
                     
                         
                     
                           
                     
                         
   
   
         
                         
 
   
 
 
 
 
   
                                                   
                                                   
                                                   
 
 
                                                   
                     
                           
*1:   _:Potential Sequon     :Asn (glycosylated)     :Gln (deaminated:pyroGlu)     :Met (oxidized)     :Cys (carbamidomethylated and deaminated)

External Links

Publications

Noro E, Togayachi A, Sato T, Tomioka A, Fujita M, Sukegawa M, Suzuki N, Kaji H, Narimatsu H Large-Scale Identification of N-Glycan Glycoproteins Carrying Lewis x and Site-Specific N-Glycan Alterations in Fut9 Knockout Mice. J Proteome Res 2015;14(9):3823-34 PMID:26244810
Sugahara D, Kaji H, Sugihara K, Asano M, Narimatsu H Large-scale identification of target proteins of a glycosyltransferase isozyme by Lectin-IGOT-LC/MS, an LC/MS-based glycoproteomic approach Sci Rep 2012;2(): PMID:23002422
Kaji H, Shikanai T, Sasaki-Sawa A, et al Large-scale Identification of N-Glycosylated Proteins of Mouse Tissues and Construction of a Glycoprotein Database, GlycoProtDB J Proteome Res 2012;11(9):4553-4566 PMID:22823882
Kaji H, Yamauchi Y, Takahashi N, Isobe T Mass spectrometric identification of N-linked glycopeptides using lectin-mediated affinity capture and glycosylation site--specific stable isotope tagging Nat Protocols 2007;1(6):3019-3027 PMID:17406563

Entry information

Entry version 2016-12-05
protein sequence version 2016-12-05
Entry status Latest version
Entry history

create : 2016-12-05