GlycoProtDB ID - GPDB0002433


Protein Name Basement membrane-specific heparan sulfate proteoglycan core protein
Protein Accession Number E9PZ16
Gene Hspg2
Length 4383
Status
release : 2016-12-05

Glycosylation Sites

Schema of N-glycosylation site(s) of the protein

Sequence

(__:Potential Sequon , N:Identified Site)

1
101
201
301
401
501
601
701
801
901
1001
1101
1201
1301
1401
1501
1601
1701
1801
1901
2001
2101
2201
2301
2401
2501
2601
2701
2801
2901
3001
3101
3201
3301
3401
3501
3601
3701
3801
3901
4001
4101
4201
4301
MGQRAVGSLL LGLLLHARLL AVTHGLRAYD GLSLPEDTET VTASRYGWTY SYLSDDEDLL ADDASGDGLG SGDVGSGDFQ MVYFRALVNF TRSIEYSPQL
EDASAKEFRE VSEAVVEKLE PEYRKIPGDQ IVSVVFIKEL DGWVFVELDV GSEGNADGSQ IQEVLHTVVS SGSIGPYVTS PWGFKFRRLG TVPQFPRVCT
ETEFACHSYN ECVALEYRCD RRPDCRDMSD ELNCEEPVPE LSSSTPAVGK VSPLPLWPEA ATTPPPPVTH GPQFLLPSVP GPSACGPQEA SCHSGHCIPR
DYLCDGQEDC RDGSDELGCA SPPPCEPNEF ACENGHCALK LWRCDGDFDC EDRTDEANCS VKQPGEVCGP THFQCVSTNR CIPASFHCDE ESDCPDRSDE
FGCMPPQVVT PPQQSIQASR GQTVTFTCVA TGVPTPIINW RLNWGHIPAH PRVTMTSEGG RGTLIIRDVK EADQGAYTCE AMNSRGMVFG IPDGVLELVP
QRGPCPDGHF YLEDSASCLP CFCFGVTNVC QSSLRFRDQI RLSFDQPNDF KGVNVTMPSQ PGVPPLSSTQ LQIDPALQEF QLVDLSRRFL VHDAFWALPK
QFLGNKVDSY GGFLRYKVRY ELARGMLEPV QKPDVILVGA GYRLHSRGHT PTHPGTLNQR QVQLSEEHWV HESGRPVQRA EMLQALASLE AVLLQTVYNT
KMASVGLSDI VMDTTVTHTT IHGRAHSVEE CRCPIGYSGL SCESCDAHFT RVPGGPYLGT CSGCNCNGHA SSCDPVYGHC LNCQHNTEGP QCDKCKPGFF
GDATKATATA CRPCPCPYID ASRRFSDTCF LDTDGQATCD ACAPGYTGRR CESCAPGYEG NPIQPGGKCR PTTQEIVRCD ERGSLGTSGE TCRCKNNVVG
RLCNECSDGS FHLSKQNPDG CLKCFCMGVS RQCSSSSWSR AQVLGASEQP SQFSLSNAAG THTTSEGVSS PAPGELSFSS FHNLLSEPYF WSLPASFRGD
KVTSYGGELR FTVTQRPRPS SAPLHRQPLV VLQGNNIVLE HHASRDPSPG QPSNFIVPFQ EQAWQRPDGQ PATREHLLMA LAGIDALLIQ ASYTQQPAES
RVSGISMDVA VPENTGQDSA REVEQCTCPP GYRGPSCQDC DTGYTRVPSG LYLGTCERCN CHGHSETCEP ETGACQSCQH HTEGASCEQC QPGYYGDAQR
GTPQDCQPCP CYGAPAAGQA AHTCFLDTDG HPTCDSCSPG HSGRHCERCA PGYYGNPSQG QPCHRDGQVP EVLGCGCDPH GSISSQCDAA GQCQCKAQVE
GRTCSHCRPH HFHLSASNPE GCLPCFCMGV TQQCASSSYS RQLISTHFAP GDFQGFALVN PQRNSQLTGG FTVEPVHDGA RLSFSNFAHL GQESFYWQLP
EIYQGDKVAA YGGKLRYTLS YTAGPQGSPL LDPDIQITGN NIMLVASQPA LQGPERRSYE IIFREEFWRR PDGQPATREH LLMALADLDE LLVRATFSSV
PRAASISAVS LEVAQPGPSS GPRALEVEEC RCPPGYVGLS CQDCAPGYTR TGSGLYLGQC ELCECNGHSD LCHPETGACS RCQHNTAGEF CELCATGYYG
DATAGTPEDC QPCACPLTNP ENMFSRTCES LGAGGYRCTA CEPGYTGQYC EQCAPGYEGD PNVQGGRCQP LTKESLEVQI HPSRSVVPQG GPHSLRCQVS
GSPPHYFYWS REDGRPLPSS AQQRHQGSEL HFPSVQPSDA GVYICTCRNL IHTSNSRAEL LVAEAPSKPI TVTVEEQRSQ SVRPGADVTF ICTAKSKSPA
YTLVWTRLHN GKLPSRAMDF NGILTIRNVQ PSDAGTYVCT GSNMFAMDQG TATLHVQVSG TSTAPVASIH PPQLTVQPGQ QAEFRCSATG NPTPMLEWIG
GPSGQLPAKA QIHNGILRLP AIEPSDQGQY LCRALSSAGQ HVARAMLQVH GGSGPRVQVS PERTQVHEGR TVRLYCRAAG VPSASITWRK EGGSLPPQAR
SENTDIPTLL IPAITAADAG FYLCVATSPT GTAQARIQVV VLSASGANSV PVRIESSSPS VTEGQTLDLN CAVMGLTYTQ VTWYKRGGSL PPHAQVHGSR
LRLPQVSPAD SGDYVCRVES DVGPKEASIV VSVLHSPHSG PSYTPATSIT PPIRIESSSS HVAEGQTLDL NCVVPGQAQV TWRKRGGSLP ARHQTHGSLL
RLHQVSPADS GEYVCHVVLG SEHTETSVLV TIEPAESIPA PGPAPPVRIE ASSSTVTEGH MLDLNCVVAG QAHAQVTWYK RGGSLPARHQ VRGSRLYILQ
ASPADAGEYV CRAGNGQEAT ITVTVTRNHG ANLAYPPGST SPIRIESSSS HVAEGQTLDL NCVVQGQAHA QVTWHKRGGS LPARHQTHGS LLRLHQVSPV
DSGEYVCRVE GGAVPLESSV LVTIEPAGTA PGVIPPVRIE SSSSHVSEGQ SLDLNCLVSG QTHPQISWHK RGGSLPARHQ VHGSRLRLLQ VTPTDSGEYV
CRVVSGSGTQ EASILVTIQQ TLSPSHSQSV VHPVRIESSS PSLANGHTLD LNCLVASLTP HTITWYKRGG SLPSRHQIVG SRLRIPQVTP ADSGEYVCHV
SNGAGSQETS LIVTIESRGP SHVPSVSPPM RIETSSPTVT EGQTLDLNCV VVGRPQATIT WYKRGGSLPF RHQAHGSRLR LHHMSVADSG EYVCRANNNI
DAQETSIMIS VSPSTNSPPA PASPAPIRIE SSSSRVAEGQ TLDLNCVVPG HAHAQVTWHK RGGSLPTHHQ THGSRLRLYQ VSSADSGEYV CSVLSSSGPL
EASVLVSITP AAANVHIPGE VPFPPIRIET SSSRVAEGQT LDLSCVVPGQ AHAQVTWHKR GGSLPAGHQV HGHMLRLNRV SPADSGEYSC QVTGSSGTLE
ASVLVTIEAS EPSPIPAPGL AQPVYIESSS SHLTEGQTVD LKCVVPGQAH AQVTWHKRGS SLPARHQTHG SLLRLYQLSP ADSGEYVCQV AGSSHPEHEA
SFKLTVPSSQ NSSFRLRSPV ISIEPPSSTV QQGQDASFKC LIHEGATPIK VEWKIRDQEL EDNVHISPNG SIITIVGTRP SNHGAYRCVA SNVYGMAQSV
VNLSVHGPPT VSVLPEGPVH VKMGKDITLE CISSGEPRSS PRWTRLGIPV KLEPRMFGLM NSHAMLKIAS VKPSDAGTYV CQAQNALGTA QKQVELIVDT
GTVAPGAPQV QVEESELTLE AGHTATLHCS ATGNPPPTIH WSKLRAPLPW QHRIEGNTLV IPRVAQQDSG QYICNATNSA GHTEATVVLH VESPPYATII
PEHTSAQPGN LVQLQCLAHG TPPLTYQWSL VGGVLPEKAV ARNQVLRLEP TVPEDSGRYR CQVSNRVGSA EAFAQVLVQG SSSNLPDTSI PGGSTPTVQV
TPQLETRNIG ASVEFHCAVP NERGTHLRWL KEGGQLPPGH SVQDGVLRIQ NLDQSCQGTY VCQAHGPWGQ AQATAQLIVQ ALPSVLINVR TSVHSVVVGH
SVEFECLALG DPKPQVTWSK VGGHLRPGIV QSGSIIRIAH VELADAGQYR CAATNAAGTT QSHVLLLVQA LPQISTPPEI RVPAGSAAVF PCMASGYPTP
AITWSKVDGD LPPDSRLENN MLMLPSVRPE DAGTYVCTAT NRQGKVKAFA YLQVPERVIP YFTQTPYSFL PLPTIKDAYR KFEIKITFRP DSADGMLLYN
GQKRSPTNLA NRQPDFISFG LVGGRPEFRF DAGSGMATIR HPTPLALGQF HTVTLLRSLT QGSLIVGNLA PVNGTSQGKF QGLDLNEELY LGGYPDYGAI
PKAGLSSGFV GCVRELRIQG EEVVFHDVNL TTHGISHCPT CQDRPCQNGG QCQDSESSSY TCVCPAGFTG SRCEHSQALH CHPEACGPDA TCVNRPDGRG
YTCRCHLGRS GVRCEEGVTV TTPSMSGAGS YLALPALTNM HHELRLDVEF KPLEPNGILL FSGGKSGPVE DFVSLAMVGG HLEFRYELGS GLAVLRSHEP
LTLGRWHRVS AERLNKDGSL RVDGGRPVLR SSPGKSQGLN LHTLLYLGGV EPSVQLSPAT NMSAHFHGCV GEVSVNGKRL DLTYSFLGSQ GVGQCYDSSP
CERQPCQNGA TCMPAGEYEF QCLCQDGFKG DLCEHEENPC QLHEPCLNGG TCRGARCLCL PGFSGPRCQQ GAGYGVVESD WHPEGSGGND APGQYGAYFY
DNGFLGLPGN SFSRSLPEVP ETIEFEVRTS TADGLLLWQG VVREASRSKD FISLGLQDGH LVFSYQLGSG EARLVSEDPI NDGEWHRITA LREGQRGSIQ
VDGEDLVTGR SPGPNVAVNT KDIIYIGGAP DVATLTRGKF SSGITGCIKN LVLHTARPGA PPPQPLDLQH RAQAGANTRP CPS

N-glycosylation sites

Potential sites Identified Peptide*1
Position Sequon Position Sequence Unique or Shared
89 NFT
210 NEC 198 - 218
VCTETEFACHSYNECVALEYR
Shared ( 4 )
208 - 218
SYNECVALEYR
Shared ( 4 )
358 NCS 341 - 362
LWRCDGDFDCEDRTDEANCSVK
Shared ( 4 )
344 - 362
CDGDFDCEDRTDEANCSVK
Shared ( 4 )
349 - 362
DCEDRTDEANCSVK
Shared ( 4 )
379 NRC 363 - 380
QPGEVCGPTHFQCVSTNR
Shared ( 4 )
363 - 380
QPGEVCGPTHFQCVSTNR
Shared ( 4 )
528 NVC
554 NVT 552 - 587
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSR
Shared ( 4 )
552 - 588
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSRR
Shared ( 4 )
552 - 587
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSR
Shared ( 4 )
552 - 588
GVNVTMPSQPGVPPLSSTQLQIDPALQEFQLVDLSRR
Shared ( 4 )
904 NEC 902 - 915
LCNECSDGSFHLSK
Shared ( 4 )
3011 NSS 3004 - 3015
LTVPSSQNSSFR
Shared ( 3 )
3069 NGS 3057 - 3087
DQELEDNVHISPNGSIITIVGTRPSNHGAYR
Shared ( 2 )
3055 - 3087
IRDQELEDNVHISPNGSIITIVGTRPSNHGAYR
Shared ( 2 )
3102 NLS
3275 NAT
3773 NGT 3758 - 3779
SLTQGSLIVGNLAPVNGTSQGK
Shared ( 5 )
3829 NLT 3818 - 3831
IQGEEVVFHDVNLT
Shared ( 3 )
3818 - 3846
IQGEEVVFHDVNLTTHGISHCPTCQDRPC
Shared ( 3 )
4061 NMS 4036 - 4078
SQGLNLHTLLYLGGVEPSVQLSPATNMSAHFHGCVGEVSVNGK
Shared ( 5 )
Brain Colon Heart Kidney Kidney(Fut9 KO) Kidney(Fut9 WT) Liver Liver: b4GalT-I(+/+) Liver: b4GalT-I(-/-) Lung Skeletal_muscle Stomach Testis
ConA RCA120 Amide80 ConA RCA120 Amide80 ConA RCA120 AAL ConA RCA120 AAL(+) AAL(+) RCA120 RCA120 RCA120 AAL ConA RCA120 Amide80 ConA RCA120 Amide80 ConA RCA120 RCA120
                                                   
   
   
 
     
                     
 
                     
                         
   
   
 
     
                 
 
 
             
     
                         
                       
                         
         
                                     
                     
                           
                                                   
                     
                         
                     
                         
                     
                           
                     
                         
   
   
         
                         
 
   
 
 
 
 
   
                     
                         
   
 
         
 
       
       
                                                   
                                                   
 
 
                       
                         
                       
                         
                     
                           
*1:   _:Potential Sequon     :Asn (glycosylated)     :Gln (deaminated:pyroGlu)     :Met (oxidized)     :Cys (carbamidomethylated and deaminated)

External Links

Publications

Noro E, Togayachi A, Sato T, Tomioka A, Fujita M, Sukegawa M, Suzuki N, Kaji H, Narimatsu H Large-Scale Identification of N-Glycan Glycoproteins Carrying Lewis x and Site-Specific N-Glycan Alterations in Fut9 Knockout Mice. J Proteome Res 2015;14(9):3823-34 PMID:26244810
Sugahara D, Kaji H, Sugihara K, Asano M, Narimatsu H Large-scale identification of target proteins of a glycosyltransferase isozyme by Lectin-IGOT-LC/MS, an LC/MS-based glycoproteomic approach Sci Rep 2012;2(): PMID:23002422
Kaji H, Shikanai T, Sasaki-Sawa A, et al Large-scale Identification of N-Glycosylated Proteins of Mouse Tissues and Construction of a Glycoprotein Database, GlycoProtDB J Proteome Res 2012;11(9):4553-4566 PMID:22823882
Kaji H, Yamauchi Y, Takahashi N, Isobe T Mass spectrometric identification of N-linked glycopeptides using lectin-mediated affinity capture and glycosylation site--specific stable isotope tagging Nat Protocols 2007;1(6):3019-3027 PMID:17406563

Entry information

Entry version 2016-12-05
protein sequence version 2016-12-05
Entry status Latest version
Entry history

create : 2016-12-05