GlycoProtDB ID - GPDB0001896


Protein Name Basement membrane-specific heparan sulfate proteoglycan core protein
Protein Accession Number B1B0C7
Gene Hspg2
Length 4375
Status
release : 2016-12-05

Glycosylation Sites

Schema of N-glycosylation site(s) of the protein

Sequence

(__:Potential Sequon , N:Identified Site)

1
101
201
301
401
501
601
701
801
901
1001
1101
1201
1301
1401
1501
1601
1701
1801
1901
2001
2101
2201
2301
2401
2501
2601
2701
2801
2901
3001
3101
3201
3301
3401
3501
3601
3701
3801
3901
4001
4101
4201
4301
MGQRAVGSLL LGLLLHARLL AVTHGLRAYD GLSLPEDTET VTASRYGWTY SYLSDDEDLL ADDASGDGLG SGDVGSGDFQ MVYFRALVNF TRSIEYSPQL
EDASAKEFRE VSEAVVEKLE PEYRKIPGDQ IVSVVFIKEL DGWVFVELDV GSEGNADGSQ IQEVLHTVVS SGSIGPYVTS PWGFKFRRLG TVPQFPRVCT
ETEFACHSYN ECVALEYRCD RRPDCRDMSD ELNCEEPVPE LSSSTPAVGK VSPLPLWPEA ATTPPPPVTH GPQFLLPSVP GPSACGPQEA SCHSGHCIPR
DYLCDGQEDC RDGSDELGCA SPPPCEPNEF ACENGHCALK LWRCDGDFDC EDRTDEANCS VKQPGEVCGP THFQCVSTNR CIPASFHCDE ESDCPDRSDE
FGCMPPQVVT PPQQSIQASR GQTVTFTCVA TGVPTPIINW RLNWGHIPAH PRVTMTSEGG RGTLIIRDVK EADQGAYTCE AMNSRGMVFG IPDGVLELVP
QRGPCPDGHF YLEDSASCLP CFCFGVTNVC QSSLRFRDQI RLSFDQPNDF KGVNVTMPSQ PGVPPLSSTQ LQIDPALQEF QLVDLSRRFL VHDAFWALPK
QFLGNKVDSY GGFLRYKVRY ELARGMLEPV QKPDVILVGA GYRLHSRGHT PTHPGTLNQR QVQLSEEHWV HESGRPVQRA EMLQALASLE AVLLQTVYNT
KMASVGLSDI VMDTTVTHTT IHGRAHSVEE CRCPIGYSGL SCESCDAHFT RVPGGPYLGT CSGCNCNGHA SSCDPVYGHC LNCQHNTEGP QCDKCKPGFF
GDATKATATA CRPCPCPYID ASRRFSDTCF LDTDGQATCD ACAPGYTGRR CESCAPGYEG NPIQPGGKCR PTTQEIVRCD ERGSLGTSGE TCRCKNNVVG
RLCNECSDGS FHLSKQNPDG CLKCFCMGVS RQCSSSSWSR AQVLGASEQP SQFSLSNAAG THTTSEGVSS PAPGELSFSS FHNLLSEPYF WSLPASFRGD
KVTSYGGELR FTVTQRPRPS SAPLHRQPLV VLQGNNIVLE HHASRDPSPG QPSNFIVPFQ EQAWQRPDGQ PATREHLLMA LAGIDALLIQ ASYTQQPAES
RVSGISMDVA VPENTGQDSA REVEQCTCPP GYRGPSCQDC DTGYTRVPSG LYLGTCERCN CHGHSETCEP ETGACQSCQH HTEGASCEQC QPGYYGDAQR
GTPQDCQPCP CYGAPAAGQA AHTCFLDTDG HPTCDSCSPG HSGRHCERCA PGYYGNPSQG QPCHRDGQVP EVLGCGCDPH GSISSQCDAA GQCQCKAQVE
GRTCSHCRPH HFHLSASNPE GCLPCFCMGV TQQCASSSYS RQLISTHFAP GDFQGFALVN PQRNSQLTGG FTVEPVHDGA RLSFSNFAHL GQESFYWQLP
EIYQGDKVAA YGGKLRYTLS YTAGPQGSPL LDPDIQITGN NIMLVASQPA LQGPERRSYE IIFREEFWRR PDGQPATREH LLMALADLDE LLVRATFSSV
PRAASISAVS LEVAQPGPSS GPRALEVEEC RCPPGYVGLS CQDCAPGYTR TGSGLYLGQC ELCECNGHSD LCHPETGACS RCQHNTAGEF CELCATGYYG
DATAGTPEDC QPCACPLTNP ENMFSRTCES LGAGGYRCTA CEPGYTGQYC EQCAPGYEGD PNVQGGRCQP LTKESLEVQI HPSRSVVPQG GPHSLRCQVS
GSPPHYFYWS REDGRPLPSS AQQRHQGSEL HFPSVQPSDA GVYICTCRNL IHTSNSRAEL LVAEAPSKPI TVTVEEQRSQ SVRPGADVTF ICTAKSKSPA
YTLVWTRLHN GKLPSRAMDF NGILTIRNVQ PSDAGTYVCT GSNMFAMDQG TATLHVQVSG TSTAPVASIH PPQLTVQPGQ QAEFRCSATG NPTPMLEWIG
GPSGQLPAKA QIHNGILRLP AIEPSDQGQY LCRALSSAGQ HVARAMLQVH GGSGPRVQVS PERTQVHEGR TVRLYCRAAG VPSASITWRK EGGSLPPQAR
SENTDIPTLL IPAITAADAG FYLCVATSPT GTAQARIQVV VLSVPVRIES SSPSVTEGQT LDLNCAVMGL TYTQVTWYKR GGSLPPHAQV HGSRLRLPQV
SPADSGDYVC RVESDVGPKE ASIVVSVLHS PHSGPSYTPA TSITPPIRIE SSSSHVAEGQ TLDLNCVVPG QAQVTWRKRG GSLPARHQTH GSLLRLHQVS
PADSGEYVCH VVLGSEHTET SVLVTIEPAE SIPAPGPAPP VRIEASSSTV TEGHMLDLNC VVAGQAHAQV TWYKRGGSLP ARHQVRGSRL YILQASPADA
GEYVCRAGNG QEATITVTVT RNHGANLAYP PGSTSPIRIE SSSSHVAEGQ TLDLNCVVQG QAHAQVTWHK RGGSLPARHQ THGSLLRLHQ VSPVDSGEYV
CRVEGGAVPL ESSVLVTIEP AGTAPGVIPP VRIESSSSHV SEGQSLDLNC LVSGQTHPQI SWHKRGGSLP ARHQVHGSRL RLLQVTPTDS GEYVCRVVSG
SGTQEASILV TIQQTLSPSH SQSVVHPVRI ESSSPSLANG HTLDLNCLVA SLTPHTITWY KRGGSLPSRH QIVGSRLRIP QVTPADSGEY VCHVSNGAGS
QETSLIVTIE SRGPSHVPSV SPPMRIETSS PTVTEGQTLD LNCVVVGRPQ ATITWYKRGG SLPFRHQAHG SRLRLHHMSV ADSGEYVCRA NNNIDAQETS
IMISVSPSTN SPPAPASPAP IRIESSSSRV AEGQTLDLNC VVPGHAHAQV TWHKRGGSLP THHQTHGSRL RLYQVSSADS GEYVCSVLSS SGPLEASVLV
SITPAAANVH IPGVVPPIRI ETSSSRVAEG QTLDLSCVVP GQAHAQVTWH KRGGSLPAGH QVHGHMLRLN RVSPADSGEY SCQVTGSSGT LEASVLVTIE
ASEPSPIPAP GLAQPVYIES SSSHLTEGQT VDLKCVVPGQ AHAQVTWHKR GSSLPARHQT HGSLLRLYQL SPADSGEYVC QVAGSSHPEH EASFKLTVPS
SQNSSFRLRS PVISIEPPSS TVQQGQDASF KCLIHEGATP IKVEWKIRDQ ELEDNVHISP NGSIITIVGT RPSNHGAYRC VASNVYGMAQ SVVNLSVHGP
PTVSVLPEGP VHVKMGKDIT LECISSGEPR SSPRWTRLGI PVKLEPRMFG LMNSHAMLKI ASVKPSDAGT YVCQAQNALG TAQKQVELIV DTGTVAPGAP
QVQVEESELT LEAGHTATLH CSATGNPPPT IHWSKLRAPL PWQHRIEGNT LVIPRVAQQD SGQYICNATN SAGHTEATVV LHVESPPYAT IIPEHTSAQP
GNLVQLQCLA HGTPPLTYQW SLVGGVLPEK AVARNQVLRL EPTVPEDSGR YRCQVSNRVG SAEAFAQVLV QGSSSNLPDT SIPGGSTPTV QVTPQLETRN
IGASVEFHCA VPNERGTHLR WLKEGGQLPP GHSVQDGVLR IQNLDQSCQG TYVCQAHGPW GQAQATAQLI VQALPSVLIN VRTSVHSVVV GHSVEFECLA
LGDPKPQVTW SKVGGHLRPG IVQSGSIIRI AHVELADAGQ YRCAATNAAG TTQSHVLLLV QALPQISTPP EIRVPAGSAA VFPCMASGYP TPAITWSKVD
GDLPPDSRLE NNMLMLPSVR PEDAGTYVCT ATNRQGKVKA FAYLQVPERV IPYFTQTPYS FLPLPTIKDA YRKFEIKITF RPDSADGMLL YNGQKRSPTN
LANRQPDFIS FGLVGGRPEF RFDAGSGMAT IRHPTPLALG QFHTVTLLRS LTQGSLIVGN LAPVNGTSQG KFQGLDLNEE LYLGGYPDYG AIPKAGLSSG
FVGCVRELRI QGEEVVFHDV NLTTHGISHC PTCQDRPCQN GGQCQDSESS SYTCVCPAGF TGSRCEHSQA LHCHPEACGP DATCVNRPDG RGYTCRCHLG
RSGVRCEEGV TVTTPSMSGA GSYLALPALT NMHHELRLDV EFKPLEPNGI LLFSGGKSGP VEDFVSLAMV GGHLEFRYEL GSGLAVLRSH EPLTLGRWHR
VSAERLNKDG SLRVDGGRPV LRSSPGKSQG LNLHTLLYLG GVEPSVQLSP ATNMSAHFHG CVGEVSVNGK RLDLTYSFLG SQGVGQCYDS SPCERQPCQN
GATCMPAGEY EFQCLCQDGF KGDLCEHEEN PCQLHEPCLN GGTCRGARCL CLPGFSGPRC QQGAGYGVVE SDWHPEGSGG NDAPGQYGAY FYDNGFLGLP
GNSFSRSLPE VPETIEFEVR TSTADGLLLW QGVVREASRS KDFISLGLQD GHLVFSYQLG SGEARLVSED PINDGEWHRI TALREGQRGS IQVDGEDLVT
GRSPGPNVAV NTKDIIYIGG APDVATLTRG KFSSGITGCI KNLVLHTARP GAPPPQPLDL QHRAQAGANT RPCPS

N-glycosylation sites

Potential sites Identified Peptide*1
Position Sequon Position Sequence Unique or Shared
89 NFT
210 NEC 198 - 218
VCTETEFACHSYNECVALEYR
Shared ( 4 )
208 - 218
SYNECVALEYR
Shared ( 4 )
358 NCS 341 - 362
LWRCDGDFDCEDRTDEANCSVK
Shared ( 4 )
344 - 362
CDGDFDCEDRTDEANCSVK
Shared ( 4 )
349 - 362
DCEDRTDEANCSVK
Shared ( 4 )
379 NRC 363 - 380
QPGEVCGPTHFQCVSTNR
Shared ( 4 )
363 - 380
QPGEVCGPTHFQCVSTNR
Shared ( 4 )
528 NVC
554 NVT 552 - 587
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSR
Shared ( 4 )
552 - 588
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSRR
Shared ( 4 )
552 - 587
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSR
Shared ( 4 )
552 - 588
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSRR
Shared ( 4 )
904 NEC 902 - 915
LCNECSDGSFHLSK
Shared ( 4 )
3003 NSS 2996 - 3007
LTVPSSQNSSFR
Shared ( 3 )
3061 NGS 3049 - 3079
DQELEDNVHISPNGSIITIVGTRPSNHGAYR
Shared ( 2 )
3047 - 3079
IRDQELEDNVHISPNGSIITIVGTRPSNHGAYR
Shared ( 2 )
3094 NLS
3267 NAT
3765 NGT 3750 - 3771
SLTQGSLIVGNLAPVNGTSQGK
Shared ( 5 )
3821 NLT 3810 - 3823
IQGEEVVFHDVNLT
Shared ( 3 )
3810 - 3838
IQGEEVVFHDVNLTTHGISHCPTCQDRPC
Shared ( 3 )
4053 NMS 4028 - 4070
SQGLNLHTLLYLGGVEPSVQLSPATNMSAHFHGCVGEVSVNGK
Shared ( 5 )
Brain Colon Heart Kidney Kidney(Fut9 KO) Kidney(Fut9 WT) Liver Liver: b4GalT-I(+/+) Liver: b4GalT-I(-/-) Lung Skeletal_muscle Stomach Testis
ConA RCA120 Amide80 ConA RCA120 Amide80 ConA RCA120 AAL ConA RCA120 AAL(+) AAL(+) RCA120 RCA120 RCA120 AAL ConA RCA120 Amide80 ConA RCA120 Amide80 ConA RCA120 RCA120
                                                   
   
   
 
     
                     
 
                     
                         
   
   
 
     
                 
 
 
             
     
                         
                       
                         
         
                                     
                     
                           
                                                   
                     
                         
                     
                         
                     
                           
                     
                         
   
   
         
                         
 
   
 
 
 
 
   
                     
                         
   
 
         
 
       
       
                                                   
                                                   
 
 
                       
                         
                       
                         
                     
                           
*1:   _:Potential Sequon     :Asn (glycosylated)     :Gln (deaminated:pyroGlu)     :Met (oxidized)     :Cys (carbamidomethylated and deaminated)

External Links

Publications

Noro E, Togayachi A, Sato T, Tomioka A, Fujita M, Sukegawa M, Suzuki N, Kaji H, Narimatsu H Large-Scale Identification of N-Glycan Glycoproteins Carrying Lewis x and Site-Specific N-Glycan Alterations in Fut9 Knockout Mice. J Proteome Res 2015;14(9):3823-34 PMID:26244810
Sugahara D, Kaji H, Sugihara K, Asano M, Narimatsu H Large-scale identification of target proteins of a glycosyltransferase isozyme by Lectin-IGOT-LC/MS, an LC/MS-based glycoproteomic approach Sci Rep 2012;2(): PMID:23002422
Kaji H, Shikanai T, Sasaki-Sawa A, et al Large-scale Identification of N-Glycosylated Proteins of Mouse Tissues and Construction of a Glycoprotein Database, GlycoProtDB J Proteome Res 2012;11(9):4553-4566 PMID:22823882
Kaji H, Yamauchi Y, Takahashi N, Isobe T Mass spectrometric identification of N-linked glycopeptides using lectin-mediated affinity capture and glycosylation site--specific stable isotope tagging Nat Protocols 2007;1(6):3019-3027 PMID:17406563

Entry information

Entry version 2016-12-05
protein sequence version 2016-12-05
Entry status Latest version
Entry history

create : 2016-12-05