GlycoProtDB ID - GPDB0003086


Protein Name Cadherin EGF LAG seven-pass G-type receptor 1
Protein Accession Number O35161
Gene Celsr1
Length 3034
Status
Reviewed
release : 2016-12-05

Glycosylation Sites

Schema of N-glycosylation site(s) of the protein

Sequence

(__:Potential Sequon , N:Identified Site)

1
101
201
301
401
501
601
701
801
901
1001
1101
1201
1301
1401
1501
1601
1701
1801
1901
2001
2101
2201
2301
2401
2501
2601
2701
2801
2901
3001
MAPSSPRVLP ALVLLAAAAL PALELGAAAW ELRVPGGARA FALGPGWSYR LDTTRTPREL LDVSREGPAA GRRLGLGAGT LGCARLAGRL LPLQVRLVAR
GAPTAPSLVL RARAYGARCG VRLLRRSARG AELRSPAVRS VPGLGDALCF PAAGGGAASL TSVLEAITNF PACSCPPVAG TGCRRGPICL RPGGSAELRL
VCALGRAAGA VWVELVIEAT SGTPSESPSV SPSLLNLSQP RAGVVRRSRR GTGSSTSPQF PLPSYQVSVP ENEPAGTAVI ELRAHDPDEG DAGRLSYQME
ALFDERSNGY FLIDAATGAV TTARSLDRET KDTHVLKVSA VDHGSPRRSA ATYLTVTVSD TNDHSPVFEQ SEYRERIREN LEVGYEVLTI RATDGDAPSN
ANMRYRLLEG AGGVFEIDAR SGVVRTRAVV DREEAAEYQL LVEANDQGRN PGPLSASATV HIVVEDENDN YPQFSEKRYV VQVPEDVAVN TAVLRVQATD
RDQGQNAAIH YSIVSGNLKG QFYLHSLSGS LDVINPLDFE AIREYTLRIK AQDGGRPPLI NSSGLVSVQV LDVNDNAPIF VSSPFQAAVL ENVPLGHSVL
HIQAVDADAG ENARLQYRLV DTASTIVGGS SVDSENPASA PDFPFQIHNS SGWITVCAEL DREEVEHYSF GVEAVDHGSP AMSSSASVSI TVLDVNDNDP
MFTQPVYELR LNEDAAVGSS VLTLRARDRD ANSVITYQLT GGNTRNRFAL SSQSGGGLIT LALPLDYKQE RQYVLAVTAS DGTRSHTAQV FINVTDANTH
RPVFQSSHYT VSVSEDRPVG TSIATISATD EDTGENARIT YVLEDPVPQF RIDPDTGTIY TMTELDYEDQ AAYTLAITAQ DNGIPQKSDT TSLEILILDA
NDNAPRFLRD FYQGSVFEDA PPSTSVLQVS ATDRDSGPNG RLLYTFQGGD DGDGDFYIEP TSGVIRTQRR LDRENVAVYN LWALAVDRGS PNPLSASVGI
QVSVLDINDN PPVFEKDELE LFVEENSPVG SVVARIRAND PDEGPNAQIM YQIVEGNVPE VFQLDLLSGD LRALVELDFE VRRDYMLVVQ ATSAPLVSRA
TVHIRLLDQN DNPPELPDFQ ILFNNYVTNK SNSFPSGVIG RIPAHDPDLS DSLNYTFLQG NELSLLLLDP ATGELQLSRD LDNNRPLEAL MEVSVSDGIH
SVTALCTLRV TIITDDMLTN SITVRLENMS QEKFLSPLLS LFVEGVATVL STTKDDIFVF NIQNDTDVSS NILNVTFSAL LPGGTRGRFF PSEDLQEQIY
LNRTLLTTIS AQRVLPFDDN ICLREPCENY MKCVSVLRFD SSAPFISSTT VLFRPIHPIT GLRCRCPPGF TGDYCETEID LCYSNPCGAN GRCRSREGGY
TCECFEDFTG EHCQVNVRSG RCASGVCKNG GTCVNLLIGG FHCVCPPGEY EHPYCEVSTR SFPPQSFVTF RGLRQRFHFT VSLAFATQDR NALLLYNGRF
NEKHDFIALE IVEEQLQLTF SAGETTTTVT PQVPGGVSDG RWHSVLVQYY NKPNIGHLGL PHGPSGEKVA VVTVDDCDAA VAVHFGSYVG NYSCAAQGTQ
SGSKKSLDLT GPLLLGGVPN LPEDFPVHSR QFVGCMRNLS IDGRIVDMAA FIANNGTRAG CASQRNFCDG TSCQNGGTCV NRWNTYLCEC PLRFGGKNCE
QAMPHPQRFT GESVVLWSDL DITISVPWYL GLMFRTRKED GVLMEATAGT SSRLHLQILN SYIRFEVSYG PSDVASMQLS KSRITDGGWH HLLIELRSAK
EGKDIKYLAV MTLDYGMDQS TVQIGNQLPG LKMRTIVIGG VTEDKVSVRH GFRGCMQGVR MGETSTNIAT LNMNDALKVR VKDGCDVEDP CASSPCPPHS
HCRDTWDSYS CICDRGYFGK KCVDACLLNP CKHVAACVRS PNTPRGYSCE CGPGHYGQYC ENKVDLPCPK GWWGNPVCGP CHCAVSQGFD PDCNKTNGQC
QCKENYYKPP AQDACLPCDC FPHGSHSRAC DMDTGQCACK PGVIGRQCNR CDNPFAEVTS LGCEVIYNGC PRAFEAGIWW PQTKFGQPAA VPCPKGSVGN
AVRHCSGEKG WLPPELFNCT SGSFVDLKAL NEKLNRNETR MDGNRSLRLA KALRNATQGN STLFGNDVRT AYQLLARILQ HESRQQGFDL AATREANFHE
DVVHTGSALL APATEASWEQ IQRSEAGAAQ LLRHFEAYFS NVARNVKRTY LRPFVIVTAN MILAVDIFDK LNFTGAQVPR FEDIQEELPR ELESSVSFPA
DTFKPPEKKE GPVVRLTNRR TTPLTAQPEP RAERETSSSR RRRHPDEPGQ FAVALVVIYR TLGQLLPEHY DPDHRSLRLP NRPVINTPVV SAMVYSEGTP
LPSSLQRPIL VEFSLLETEE RSKPVCVFWN HSLDTGGTGG WSAKGCELLS RNRTHVTCQC SHSASCAVLM DISRREHGEV LPLKIITYAA LSLSLVALLV
AFVLLSLVRT LRSNLHSIHK NLITALFFSQ LIFMVGINQT ENPFLCTVVA ILLHYVSMGT FAWTLVENLH VYRMLTEVRN IDTGPMRFYH VVGWGIPAIV
TGLAVGLDPQ GYGNPDFCWL SLQDTLIWSF AGPVGTVIII NTVIFVLSAK VSCQRKHHYY ERKGVVSMLR TAFLLLLLVT ATWLLGLLAV NSDTLSFHYL
FAAFSCLQGI FVLLFHCVAH REVRKHLRAV LAGKKLQLDD SATTRATLLT RSLNCNNTYS EGPDMLRTAL GESTASLDST TRDEGVQKLS VSSGPARGNH
GEPDASFIPR NSKKAHGPDS DSDSELSLDE HSSSYASSHT SDSEDDGGEA EDKWNPAGGP AHSTPKADAL ANHVPAGWPD ESLAGSDSEE LDTEPHLKVE
TKVSVELHRQ AQGNHCGDRP SDPESGVLAK PVAVLSSQPQ EQRKGILKNK VTYPPPLPEQ PLKSRLREKL ADCEQSPTSS RTSSLGSGDG VHATDCVITI
KTPRREPGRE HLNGVAMNVR TGSAQANGSD SEKP

N-glycosylation sites

Potential sites Identified Peptide*1
Position Sequon Position Sequence Unique or Shared
236 NLS
561 NSS
649 NSS
793 NVT
1129 NKS
1154 NYT
1228 NMS
1264 NDT
1274 NVT
1302 NRT 1287 - 1303
GRFFPSEDLQEQIYLNR
Unique
1320 NIC
1591 NYS
1638 NLS
1655 NGT 1645 - 1658
IVDMAAFIANNGTR
Shared ( 2 )
1666 NFC
1994 NKT
2049 NRC
2068 NGC
2118 NCT
2137 NET
2144 NRS
2155 NAT 2152 - 2169
ALRNATQGNSTLFGNDVR
Shared ( 2 )
2155 - 2169
NATQGNSTLFGNDVR
Shared ( 2 )
2160 NST 2152 - 2169
ALRNATQGNSTLFGNDVR
Shared ( 2 )
2155 - 2169
NATQGNSTLFGNDVR
Shared ( 2 )
2272 NFT
2430 NHS
2452 NRT
2538 NQT
2756 NNT
2914 NHC
3027 NGS
Colon Kidney Stomach
RCA120 ConA ConA
     
     
     
     
     
     
     
     
     
 
 
     
     
     
 
 
     
     
     
     
     
     
     
 
   
 
   
     
     
     
     
     
     
     
*1:   _:Potential Sequon     :Asn (glycosylated)     :Gln (deaminated:pyroGlu)     :Met (oxidized)     :Cys (carbamidomethylated and deaminated)

External Links

Publications

Kaji H, Shikanai T, Sasaki-Sawa A, et al Large-scale Identification of N-Glycosylated Proteins of Mouse Tissues and Construction of a Glycoprotein Database, GlycoProtDB J Proteome Res 2012;11(9):4553-4566 PMID:22823882

Entry information

Entry version 2016-12-05
protein sequence version 2016-12-05
Entry status Latest version
Entry history

create : 2016-12-05