Mucins Diversity and classes of mucin proteins Diversity and classes of mucin glycans Mucin biosynthesis Mucin distribution in cells and tissues Mucin functions
Resources Mucin Biology Group, University of Gothenburg http://www.medkem.gu.se/mucinbiology/ Mucins, Methods and Protocols Methods in Molecular Biology v. 842 (Springer Protocols) http://www.springerprotocols.com/booktoc/doi/10.1007/978-1-61779-513-8
Mucins Diversity and classes of mucin proteins Diversity and classes of mucin glycans Mucin biosynthesis Mucin distribution in cells and tissues Mucin functions
The human mucin family (2014)
Mucins Diversity and classes of mucin proteins Secreted gel forming non-gel forming Membrane-associated
Corifeld, AP (2014) Mucins: A biologically relevant glycan barrier in mucosal protection Biochim Biophys Acta in press
Structures of gel-forming mucins Figure 1 Cartoon of the major structural domains within polymeric airway mucins. (a) Generic representation of a polymeric mucin polypeptide. The N- and C-terminal von Willebrand factor (vwf)-like domains (D domains, the B domain, the C domain, and the CK domain) and the central region containing two mucin domains (MD) and one cys domain are highlighted. (b) The organization of the central domains of MUC5B, MUC5AC, and MUC2. Repetitive (striped blue) and nonrepetitive (solid blue) sequences within the mucin domains are shown. The central domains of MUC5B and MUC5AC show little length variation between individuals, but the fourth repetitive domain in MUC5AC shows a slight variation in length. In contrast, the second repetitive domain of MUC2 is variable in length, and the two extremes, with either 40 or 185 repeats, are depicted, illustrating the significant potential size difference for this mucin.
Taylor & Drickamer Introduction to Glycobiology, Third Edition. Oxford University Press 2011
Structures of membrane-associated mucins Figure 1 Structure of the major respiratory transmembrane mucins. Key domains include the N-terminal signal sequence (red); sperm protein, enterokinase, and agrin (SEA) modules; transmembrane (TM) domains; cytoplasmic tail (CT); nidogen homology sequence (NIDO); adhesion-associated domain in MUC4 and other proteins (AMOP); von Willebrand factor D sequence (VWD); and epidermal growth factor (EGF)-like regions. The inset in panel c illustrates the MUC16 repeat region containing N-glycosylation sites and cysteine-cysteine disulfide bonds.
Structures of membrane-associated mucins Taylor & Drickamer Introduction to Glycobiology, Third Edition. Oxford University Press 2011
Mucin protein domains Protein domains: http://www.medkem.gu.se/mucinbiology/ SS (Yellow background) Signal sequence. PTS (Light orange) Mucin domain. Rich of Pro, Ser, and Thr. PTS (Red) Mucin domain repeats part. PTS (Red, bold) Mucin domain 1 typical repeat unit. VWD (Lime) von Willebrand factor type D domain. CysD/CysD (Pink) Cys-rich domain inserted in or around PTS domains. SEA (Blue) Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. CK (Plum) Cys-knot domain. Comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation. TM (Turquoise background) Transmembrane domain. NIDO (Pale blue) Nidogen-like domain, an extracellular domain found in nidogen and hypothetical proteins of unknown function. AMOP (Lavender) This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges. VWC (Pink background) von Willebrand factor type C domain. Gray (-50%) The part in splice variant which is different from main sequence.
Muc1 domain structure (~1250 aa) Muc16 domain structure (>22,000 aa) http://www.medkem.gu.se/mucinbiology/
MLKPSGLPGSSSPTRSLMTGSRSTKATPEMDSGLTGATLSPKTSTGAIVVTEHTLPFTSPDKTLASPTSSVVGRTTQSLGVMSSALPESTSRGMTHSEQRTS PSLSPQVNGTPSRNYPATSMVSGLSSPRTRTSSTEGNFTKEASTYTLTVETTSGPVTEKYTVPTETSTTEGDSTETPWDTRYIPVKITSPMKTFADSTASKENA PVSMTPAETTVTDSHTPGRTNPSFGTLYSSFLDLSPKGTPNSRGETSLELILSTTGYPFSSPEPGSAGHSRISTSAPLSSSASVLDNKISETSIFSGQSLTSPLSPG VPEARASTMPNSAIPFSMTLSNAETSAERVRSTISSLGTPSISTKQTAETILTFHAFAETMDIPSTHIAKTLASEWLGSPGTLGGTSTSALTTTSPSTTLVSEETN THHSTSGKETEGTLNTSMTPLETSAPGEESEMTATLVPTLGFTTLDSKIRSPSQVSSSHPTRELRTTGSTSGRQSSSTAAHGSSDILRATTSSTSKASSWTSEST AQQFSEPQHTQWVETSPSMKTERPPASTSVAAPITTSVPSVVSGFTTLKTSSTKGIWLEETSADTLIGESTAGPTTHQFAVPTGISMTGGSSTRGSQGTTHL LTRATASSETSADLTLATNGVPVSVSPAVSKTAAGSSPPGGTKPSYTMVSSVIPETSSLQSSAFREGTSLGLTPLNTRHPFSSPEPDSAGHTKISTSIPLLSSASVL EDKVSATSTFSHHKATSSITTGTPEISTKTKPSSAVLSSMTLSNAATSPERVRNATSPLTHPSPSGEETAGSVLTLSTSAETTDSPNIHPTGTLTSESSESPSTLSLP SVSGVKTTFSSSTPSTHLFTSGEETEETSNPSVSQPETSVSRVRTTLASTSVPTPVFPTMDTWPTRSAQFSSSHLVSELRATSSTSVTNSTGSALPKISHLTGTA TMSQTNRDTFNDSAAPQSTTWPETSPRFKTGLPSATTTVSTSATSLSATVMVSKFTSPATSSMEATSIREPSTTILTTETTNGPGSMAVASTNIPIGKGYITEG RLDTSHLPIGTTASSETSMDFTMAKESVSMSVSPSQSMDAAGSSTPGRTSQFVDTFSDDVYHLTSREITIPRDGTSSALTPQMTATHPPSPDPGSARSTWL GILSSSPSSPTPKVTMSSTFSTQRVTTSMIMDTVETSRWNMPNLPSTTSLTPSNIPTSGAIGKSTLVPLDTPSPATSLEASEGGLPTLSTYPESTNTPSIHLGAH ASSESPSTIKLTMASVVKPGSYTPLTFPSIETHIHVSTARMAYSSGSSPEMTAPGETNTGSTWDPTTYITTTDPKDTSSAQVSTPHSVRTLRTTENHPKTESAT PAAYSGSPKISSSPNLTSPATKAWTITDTTEHSTQLHYTKLAEKSSGFETQSAPGPVSVVIPTSPTIGSSTLELTSDVPGEPLVLAPSEQTTITLPMATWLSTSLTE EMASTDLDISSPSSPMSTFAIFPPMSTPSHELSKSEADTSAIRNTDSTTLDQHLGIRSLGRTGDLTTVPITPLTTTWTSVIEHSTQAQDTLSATMSPTHVTQSL KDQTSIPASASPSHLTEVYPELGTQGRSSSEATTFWKPSTDTLSREIETGPTNIQSTPPMDNTTTGSSSSGVTLGIAHLPIGTSSPAETSTNMALERRSSTATVS MAGTMGLLVTSAPGRSISQSLGRVSSVLSESTTEGVTDSSKGSSPRLNTQGNTALSSSLEPSYAEGSQMSTSIPLTSSPTTPDVEFIGGSTFWTKEVTTVMTS DISKSSARTESSSATLMSTALGSTENTGKEKLRTASMDLPSPTPSMEVTPWISLTLSNAPNTTDSLDLSHGVHTSSAGTLATDRSLNTGVTRASRLENGSDTS SKSLSMGNSTHTSMTDTEKSEVSSSIHPRPETSAPGAETTLTSTPGNRAISLTLPFSSIPVEEVISTGITSGPDINSAPMTHSPITPPTIVWTSTGTIEQSTQPLH AVSSEKVSVQTQSTPYVNSVAVSASPTHENSVSSGSSTSSPYSSASLESLDSTISRRNAITSWLWDLTTSLPTTTWPSTSLSEALSSGHSGVSNPSSTTTEFPLF SAASTSAAKQRNPETETHGPQNTAASTLNTDASSVTGLSETPVGASISSEVPLPMAITSRSDVSGLTSESTANPSLGTASSAGTKLTRTISLPTSESLVSFRMN KDPWTVSIPLGSHPTTNTETSIPVNSAGPPGLSTVASDVIDTPSDGAESIPTVSFSPSPDTEVTTISHFPEKTTHSFRTISSLTHELTSRVTPIPGDWMSSAMST KPTGASPSITLGERRTITSAAPTTSPIVLTASFTETSTVSLDNETTVKTSDILDARKTNELPSDSSSSSDLINTSIASSTMDVTKTASISPTSISGMTASSSPSLFSSD RPQVPTSTTETNTATSPSVSSNTYSLDGGSNVGGTPSTLPPFTITHPVETSSALLAWSRPVRTFSTMVSTDTASGENPTSSNSVVTSVPAPGTWASVGSTTD LPAMGFLKTSPAGEAHSLLASTIEPATAFTPHLSAAVVTGSSATSEASLLTTSESKAIHSSPQTPTTPTSGANWETSATPESLLVVTETSDTTLTSKILVTDTILFST VSTPPSKFPSTGTLSGASFPTLLPDTPAIPLTATEPTSSLATSFDSTPLVTIASDSLGTVPETTLTMSETSNGDALVLKTVSNPDRSIPGITIQGVTESPLHPSSTSP SKIVAPRNTTYEGSITVALSTLPAGTTGSLVFSQSSENSETTALVDSSAGLERASVMPLTTGSQGMASSGGIRSGSTHSTGTKTFSSLPLTMNPGEVTAMSEIT TNRLTATQSTAPKGIPVKPTSAESGLLTPVSASSSPSKAFASLTTAPPSTWGIPQSTLTFEFSEVPSLDTKSASLPTPGQSLNTIPDSDASTASSSLSKSPEKNPRA RMMTSTKAISASSFQSTGFTETPEGSASPSMAGHEPRVPTSGTGDPRYASESMSYPDPSKASSAMTSTSLASKLTTLFSTGQAARSGSSSSPISLSTEKETSF LSPTASTSRKTSLFLGPSMARQPNILVHLQTSALTLSPTSTLNMSQEEPPELTSSQTIAEEEGTTAETQTLTFTPSETPTSLLPVSSPTEPTARRKSSPETWASSIS VPAKTSLVETTDGTLVTTIKMSSQAAQGNSTWPAPAEETGTSPAGTSPGSPEVSTTLKIMSSKEPSISPEIRSTVRNSPWKTPETTVPMETTVEPVTLQSTAL GSGSTSISHLPTGTTSPTKSPTENMLATERVSLSPSPPEAWTNLYSGTPGGTRQSLATMSSVSLESPTARSITGTGQQSSPELVSKTTGMEFSMWHGSTGGT TGDTHVSLSTSSNILEDPVTSPNSVSSLTDKSKHKTETWVSTTAIPSTVLNNKIMAAEQQTSRSVDEAYSSTSSWSDQTSGSDITLGASPDVTNTLYITSTAQT TSLVSLPSGDQGITSLTNPSGGKTSSASSVTSPSIGLETLRANVSAVKSDIAPTAGHLSQTSSPAEVSILDVTTAPTPGISTTITTMGTNSISTTTPNPEVGMST MDSTPATERRTTSTEHPSTWSSTAASDSWTVTDMTSNLKVARSPGTISTMHTTSFLASSTELDSMSTPHGRITVIGTSLVTPSSDASAVKTETSTSERTLSPS DTTASTPISTFSRVQRMSISVPDILSTSWTPSSTEAEDVPVSMVSTDHASTKTDPNTPLSTFLFDSLSTLDWDTGRSLSSATATTSAPQGATTPQELTLETMIS Muc16-1
PATSQLPFSIGHITSAVTPAAMARSSGVTFSRPDPTSKKAEQTSTQLPTTTSAHPGQVPRSAATTLDVIPHTAKTPDATFQRQGQTALTTEARATSDSWNEK EKSTPSAPWITEMMNSVSEDTIKEVTSSSSVLKDPEYAGHKLGIWDDFIPKFGKAAHMRELPLLSPPQDKEAIHPSTNTVETTGWVTSSEHASHSTIPAHSA SSKLTSPVVTTSTREQAIVSMSTTTWPESTRARTEPNSFLTIELRDVSPYMDTSSTTQTSIISSPGSTAITKGPRTEITSSKRISSSFLAQSMRSSDSPSEAITRLS NFPAMTESGGMILAMQTSPPGATSLSAPTLDTSATASWTGTPLATTQRFTYSEKTTLFSKGPEDTSQPSPPSVEETSSSSSLVPIHATTSPSNILLTSQGHSPS STPPVTSVFLSETSGLGKTTDMSRISLEPGTSLPPNLSSTAGEALSTYEASRDTKAIHHSADTAVTNMEATSSEYSPIPGHTKPSKATSPLVTSHIMGDITSSTS VFGSSETTEIETVSSVNQGLQERSTSQVASSATETSTVITHVSSGDATTHVTKTQATFSSGTSISSPHQFITSTNTFTDVSTNPSTSLIMTESSGVTITTQTGPTG AATQGPYLLDTSTMPYLTETPLAVTPDFMQSEKTTLISKGPKDVTWTSPPSVAETSYPSSLTPFLVTTIPPATSTLQGQHTSSPVSATSVLTSGLVKTTDMLNT SMEPVTNSPQNLNNPSNEILATLAATTDIETIHPSINKAVTNMGTASSAHVLHSTLPVSSEPSTATSPMVPASSMGDALASISIPGSETTDIEGEPTSSLTAGR KENSTLQEMNSTTESNIILSNVSVGAITEATKMEVPSFDATFIPTPAQSTKFPDIFSVASSRLSNSPPMTISTHMTTTQTGSSGATSKIPLALDTSTLETSAGTP SVVTEGFAHSKITTAMNNDVKDVSQTNPPFQDEASSPSSQAPVLVTTLPSSVAFTPQWHSTSSPVSMSSVLTSSLVKTAGKVDTSLETVTSSPQSMSNTLD DISVTSAATTDIETTHPSINTVVTNVGTTGSAFESHSTVSAYPEPSKVTSPNVTTSTMEDTTISRSIPKSSKTTRTETETTSSLTPKLRETSISQEITSSTETSTVPY KELTGATTEVSRTDVTSSSSTSFPGPDQSTVSLDISTETNTRLSTSPIMTESAEITITTQTGPHGATSQDTFTMDPSNTTPQAGIHSAMTHGFSQLDVTTLMS RIPQDVSWTSPPSVDKTSSPSSFLSSPAMTTPSLISSTLPEDKLSSPMTSLLTSGLVKITDILRTRLEPVTSSLPNFSSTSDKILATSKDSKDTKEIFPSINTEETNVK ANNSGHESHSPALADSETPKATTQMVITTTVGDPAPSTSMPVHGSSETTNIKREPTYFLTPRLRETSTSQESSFPTDTSFLLSKVPTGTITEVSSTGVNSSSKIS TPDHDKSTVPPDTFTGEIPRVFTSSIKTKSAEMTITTQASPPESASHSTLPLDTSTTLSQGGTHSTVTQGFPYSEVTTLMGMGPGNVSWMTTPPVEETSSV SSLMSSPAMTSPSPVSSTSPQSIPSSPLPVTALPTSVLVTTTDVLGTTSPESVTSSPPNLSSITHERPATYKDTAHTEAAMHHSTNTAVTNVGTSGSGHKSQSS VLADSETSKATPLMSTTSTLGDTSVSTSTPNISQTNQIQTEPTASLSPRLRESSTSEKTSSTTETNTAFSYVPTGAITQASRTEISSSRTSISDLDRPTIAPDISTG MITRLFTSPIMTKSAEMTVTTQTTTPGATSQGILPWDTSTTLFQGGTHSTVSQGFPHSEITTLRSRTPGDVSWMTTPPVEETSSGFSLMSPSMTSPSPVSS TSPESIPSSPLPVTALLTSVLVTTTNVLGTTSPETVTSSPPNLSSPTQERLTTYKDTAHTEAMHASMHTNTAVANVGTSISGHESQSSVPADSHTSKATSPMGI TFAMGDTSVSTSTPAFFETRIQTESTSSLIPGLRDTRTSEEINTVTETSTVLSEVPTTTTTEVSRTEVITSSRTTISGPDHSKMSPYISTETITRLSTFPFVTGSTEM AITNQTGPIGTISQATLTLDTSSTASWEGTHSPVTQRFPHSEETTTMSRSTKGVSWQSPPSVEETSSPSSPVPLPAITSHSSLYSAVSGSSPTSALPVTSLLTSG RRKTIDMLDTHSELVTSSLPSASSFSGEILTSEASTNTETIHFSENTAETNMGTTNSMHKLHSSVSIHSQPSGHTPPKVTGSMMEDAIVSTSTPGSPETKNV DRDSTSPLTPELKEDSTALVMNSTTESNTVFSSVSLDAATEVSRAEVTYYDPTFMPASAQSTKSPDISPEASSSHSNSPPLTISTHKTIATQTGPSGVTSLGQLT LDTSTIATSAGTPSARTQDFVDSETTSVMNNDLNDVLKTSPFSAEEANSLSSQAPLLVTTSPSPVTSTLQEHSTSSLVSVTSVPTPTLAKITDMDTNLEPVTRS PQNLRNTLATSEATTDTHTMHPSINTAMANVGTTSSPNEFYFTVSPDSDPYKATSAVVITSTSGDSIVSTSMPRSSAMKKIESETTFSLIFRLRETSTSQKIGS SSDTSTVFDKAFTAATTEVSRTELTSSSRTSIQGTEKPTMSPDTSTRSVTMLSTFAGLTKSEERTIATQTGPHRATSQGTLTWDTSITTSQAGTHSAMTHGFS QLDLSTLTSRVPEYISGTSPPSVEKTSSSSSLLSLPAITSPSPVPTTLPESRPSSPVHLTSLPTSGLVKTTDMLASVASLPPNLGSTSHKIPTTSEDIKDTEKMYPST NIAVTNVGTTTSEKESYSSVPAYSEPPKVTSPMVTSFNIRDTIVSTSMPGSSEITRIEMESTFSVAHGLKGTSTSQDPIVSTEKSAVLHKLTTGATETSRTEVASS RRTSIPGPDHSTESPDISTEVIPSLPISLGITESSNMTIITRTGPPLGSTSQGTFTLDTPTTSSRAGTHSMATQEFPHSEMTTVMNKDPEILSWTIPPSIEKTSFS SSLMPSPAMTSPPVSSTLPKTIHTTPSPMTSLLTPSLVMTTDTLGTSPEPTTSSPPNLSSTSHVILTTDEDTTAIEAMHPSTSTAATNVETTCSGHGSQSSVLT DSEKTKATAPMDTTSTMGHTTVSTSMSVSSETTKIKRESTYSLTPGLRETSISQNASFSTDTSIVLSEVPTGTTAEVSRTEVTSSGRTSIPGPSQSTVLPEISTRT MTRLFASPTMTESAEMTIPTQTGPSGSTSQDTLTLDTSTTKSQAKTHSTLTQRFPHSEMTTLMSRGPGDMSWQSSPSLENPSSLPSLLSLPATTSPPPISSTL PVTISSSPLPVTSLLTSSPVTTTDMLHTSPELVTSSPPKLSHTSDERLTTGKDTTNTEAVHPSTNTAASNVEIPSFGHESPSSALADSETSKATSPMFITSTQEDT TVAISTPHFLETSRIQKESISSLSPKLRETGSSVETSSAIETSAVLSEVSIGATTEISRTEVTSSSRTSISGSAESTMLPEISTTRKIIKFPTSPILAESSEMTIKTQTSPP GSTSESTFTLDTSTTPSLVITHSTMTQRLPHSEITTLVSRGAGDVPRPSSLPVEETSPPSSQLSLSAMISPSPVSSTLPASSHSSSASVTSPLTPGQVKTTEVLDA SAEPETSSPPSLSSTSVEILATSEVTTDTEKIHPFPNTAVTKVGTSSSGHESPSSVLPDSETTKATSAMGTISIMGDTSVSTLTPALSNTRKIQSEPASSLTTRLRE Muc16-2
PATSQLPFSIGHITSAVTPAAMARSSGVTFSRPDPTSKKAEQTSTQLPTTTSAHPGQVPRSAATTLDVIPHTAKTPDATFQRQGQTALTTEARATSDSWNEK EKSTPSAPWITEMMNSVSEDTIKEVTSSSSVLKDPEYAGHKLGIWDDFIPKFGKAAHMRELPLLSPPQDKEAIHPSTNTVETTGWVTSSEHASHSTIPAHSA SSKLTSPVVTTSTREQAIVSMSTTTWPESTRARTEPNSFLTIELRDVSPYMDTSSTTQTSIISSPGSTAITKGPRTEITSSKRISSSFLAQSMRSSDSPSEAITRLS NFPAMTESGGMILAMQTSPPGATSLSAPTLDTSATASWTGTPLATTQRFTYSEKTTLFSKGPEDTSQPSPPSVEETSSSSSLVPIHATTSPSNILLTSQGHSPS STPPVTSVFLSETSGLGKTTDMSRISLEPGTSLPPNLSSTAGEALSTYEASRDTKAIHHSADTAVTNMEATSSEYSPIPGHTKPSKATSPLVTSHIMGDITSSTS VFGSSETTEIETVSSVNQGLQERSTSQVASSATETSTVITHVSSGDATTHVTKTQATFSSGTSISSPHQFITSTNTFTDVSTNPSTSLIMTESSGVTITTQTGPTG AATQGPYLLDTSTMPYLTETPLAVTPDFMQSEKTTLISKGPKDVTWTSPPSVAETSYPSSLTPFLVTTIPPATSTLQGQHTSSPVSATSVLTSGLVKTTDMLNT SMEPVTNSPQNLNNPSNEILATLAATTDIETIHPSINKAVTNMGTASSAHVLHSTLPVSSEPSTATSPMVPASSMGDALASISIPGSETTDIEGEPTSSLTAGR KENSTLQEMNSTTESNIILSNVSVGAITEATKMEVPSFDATFIPTPAQSTKFPDIFSVASSRLSNSPPMTISTHMTTTQTGSSGATSKIPLALDTSTLETSAGTP SVVTEGFAHSKITTAMNNDVKDVSQTNPPFQDEASSPSSQAPVLVTTLPSSVAFTPQWHSTSSPVSMSSVLTSSLVKTAGKVDTSLETVTSSPQSMSNTLD DISVTSAATTDIETTHPSINTVVTNVGTTGSAFESHSTVSAYPEPSKVTSPNVTTSTMEDTTISRSIPKSSKTTRTETETTSSLTPKLRETSISQEITSSTETSTVPY KELTGATTEVSRTDVTSSSSTSFPGPDQSTVSLDISTETNTRLSTSPIMTESAEITITTQTGPHGATSQDTFTMDPSNTTPQAGIHSAMTHGFSQLDVTTLMS RIPQDVSWTSPPSVDKTSSPSSFLSSPAMTTPSLISSTLPEDKLSSPMTSLLTSGLVKITDILRTRLEPVTSSLPNFSSTSDKILATSKDSKDTKEIFPSINTEETNVK ANNSGHESHSPALADSETPKATTQMVITTTVGDPAPSTSMPVHGSSETTNIKREPTYFLTPRLRETSTSQESSFPTDTSFLLSKVPTGTITEVSSTGVNSSSKIS TPDHDKSTVPPDTFTGEIPRVFTSSIKTKSAEMTITTQASPPESASHSTLPLDTSTTLSQGGTHSTVTQGFPYSEVTTLMGMGPGNVSWMTTPPVEETSSV SSLMSSPAMTSPSPVSSTSPQSIPSSPLPVTALPTSVLVTTTDVLGTTSPESVTSSPPNLSSITHERPATYKDTAHTEAAMHHSTNTAVTNVGTSGSGHKSQSS VLADSETSKATPLMSTTSTLGDTSVSTSTPNISQTNQIQTEPTASLSPRLRESSTSEKTSSTTETNTAFSYVPTGAITQASRTEISSSRTSISDLDRPTIAPDISTG MITRLFTSPIMTKSAEMTVTTQTTTPGATSQGILPWDTSTTLFQGGTHSTVSQGFPHSEITTLRSRTPGDVSWMTTPPVEETSSGFSLMSPSMTSPSPVSS TSPESIPSSPLPVTALLTSVLVTTTNVLGTTSPETVTSSPPNLSSPTQERLTTYKDTAHTEAMHASMHTNTAVANVGTSISGHESQSSVPADSHTSKATSPMGI TFAMGDTSVSTSTPAFFETRIQTESTSSLIPGLRDTRTSEEINTVTETSTVLSEVPTTTTTEVSRTEVITSSRTTISGPDHSKMSPYISTETITRLSTFPFVTGSTEM AITNQTGPIGTISQATLTLDTSSTASWEGTHSPVTQRFPHSEETTTMSRSTKGVSWQSPPSVEETSSPSSPVPLPAITSHSSLYSAVSGSSPTSALPVTSLLTSG RRKTIDMLDTHSELVTSSLPSASSFSGEILTSEASTNTETIHFSENTAETNMGTTNSMHKLHSSVSIHSQPSGHTPPKVTGSMMEDAIVSTSTPGSPETKNV DRDSTSPLTPELKEDSTALVMNSTTESNTVFSSVSLDAATEVSRAEVTYYDPTFMPASAQSTKSPDISPEASSSHSNSPPLTISTHKTIATQTGPSGVTSLGQLT LDTSTIATSAGTPSARTQDFVDSETTSVMNNDLNDVLKTSPFSAEEANSLSSQAPLLVTTSPSPVTSTLQEHSTSSLVSVTSVPTPTLAKITDMDTNLEPVTRS PQNLRNTLATSEATTDTHTMHPSINTAMANVGTTSSPNEFYFTVSPDSDPYKATSAVVITSTSGDSIVSTSMPRSSAMKKIESETTFSLIFRLRETSTSQKIGS SSDTSTVFDKAFTAATTEVSRTELTSSSRTSIQGTEKPTMSPDTSTRSVTMLSTFAGLTKSEERTIATQTGPHRATSQGTLTWDTSITTSQAGTHSAMTHGFS QLDLSTLTSRVPEYISGTSPPSVEKTSSSSSLLSLPAITSPSPVPTTLPESRPSSPVHLTSLPTSGLVKTTDMLASVASLPPNLGSTSHKIPTTSEDIKDTEKMYPST NIAVTNVGTTTSEKESYSSVPAYSEPPKVTSPMVTSFNIRDTIVSTSMPGSSEITRIEMESTFSVAHGLKGTSTSQDPIVSTEKSAVLHKLTTGATETSRTEVASS RRTSIPGPDHSTESPDISTEVIPSLPISLGITESSNMTIITRTGPPLGSTSQGTFTLDTPTTSSRAGTHSMATQEFPHSEMTTVMNKDPEILSWTIPPSIEKTSFS SSLMPSPAMTSPPVSSTLPKTIHTTPSPMTSLLTPSLVMTTDTLGTSPEPTTSSPPNLSSTSHVILTTDEDTTAIEAMHPSTSTAATNVETTCSGHGSQSSVLT DSEKTKATAPMDTTSTMGHTTVSTSMSVSSETTKIKRESTYSLTPGLRETSISQNASFSTDTSIVLSEVPTGTTAEVSRTEVTSSGRTSIPGPSQSTVLPEISTRT MTRLFASPTMTESAEMTIPTQTGPSGSTSQDTLTLDTSTTKSQAKTHSTLTQRFPHSEMTTLMSRGPGDMSWQSSPSLENPSSLPSLLSLPATTSPPPISSTL PVTISSSPLPVTSLLTSSPVTTTDMLHTSPELVTSSPPKLSHTSDERLTTGKDTTNTEAVHPSTNTAASNVEIPSFGHESPSSALADSETSKATSPMFITSTQEDT TVAISTPHFLETSRIQKESISSLSPKLRETGSSVETSSAIETSAVLSEVSIGATTEISRTEVTSSSRTSISGSAESTMLPEISTTRKIIKFPTSPILAESSEMTIKTQTSPP GSTSESTFTLDTSTTPSLVITHSTMTQRLPHSEITTLVSRGAGDVPRPSSLPVEETSPPSSQLSLSAMISPSPVSSTLPASSHSSSASVTSPLTPGQVKTTEVLDA SAEPETSSPPSLSSTSVEILATSEVTTDTEKIHPFPNTAVTKVGTSSSGHESPSSVLPDSETTKATSAMGTISIMGDTSVSTLTPALSNTRKIQSEPASSLTTRLRE Muc16-2
TSTSEETSLATEANTVLSKVSTGATTEVSRTEAISFSRTSMSGPEQSTMSQDISIGTIPRISASSVLTESAKMTITTQTGPSESTLESTLNLNTATTPSWVETHSIV IQGFPHPEMTTSMGRGPGGVSWPSPPFVKETSPPSSPLSLPAVTSPHPVSTTFLAHIPPSPLPVTSLLTSGPATTTDILGTSTEPGTSSSSSLSTTSHERLTTYK DTAHTEAVHPSTNTGGTNVATTSSGYKSQSSVLADSSPMCTTSTMGDTSVLTSTPAFLETRRIQTELASSLTPGLRESSGSEGTSSGTKMSTVLSKVPTGATT EISKEDVTSIPGPAQSTISPDISTRTVSWFSTSPVMTESAEITMNTHTSPLGATTQGTSTLATSSTTSLTMTHSTISQGFSHSQMSTLMRRGPEDVSWMSPP LLEKTRPSFSLMSSPATTSPSPVSSTLPESISSSPLPVTSLLTSGLAKTTDMLHKSSEPVTNSPANLSSTSVEILATSEVTTDTEKTHPSSNRTVTDVGTSSSGHES TSFVLADSQTSKVTSPMVITSTMEDTSVSTSTPGFFETSRIQTEPTSSLTLGLRKTSSSEGTSLATEMSTVLSGVPTGATAEVSRTEVTSSSRTSISGFAQLTVSP ETSTETITRLPTSSIMTESAEMMIKTQTDPPGSTPESTHTVDISTTPNWVETHSTVTQRFSHSEMTTLVSRSPGDMLWPSQSSVEETSSASSLLSLPATTSPS PVSSTLVEDFPSASLPVTSLLTPGLVITTDRMGISREPGTSSTSNLSSTSHERLTTLEDTVDTEDMQPSTHTAVTNVRTSISGHESQSSVLSDSETPKATSPMGT TYTMGETSVSISTSDFFETSRIQIEPTSSLTSGLRETSSSERISSATEGSTVLSEVPSGATTEVSRTEVISSRGTSMSGPDQFTISPDISTEAITRLSTSPIMTESAES AITIETGSPGATSEGTLTLDTSTTTFWSGTHSTASPGFSHSEMTTLMSRTPGDVPWPSLPSVEEASSVSSSLSSPAMTSTSFFSALPESISSSPHPVTALLTLGP VKTTDMLRTSSEPETSSPPNLSSTSAEILATSEVTKDREKIHPSSNTPVVNVGTVIYKHLSPSSVLADLVTTKPTSPMATTSTLGNTSVSTSTPAFPETMMTQP TSSLTSGLREISTSQETSSATERSASLSGMPTGATTKVSRTEALSLGRTSTPGPAQSTISPEISTETITRISTPLTTTGSAEMTITPKTGHSGASSQGTFTLDTSSR ASWPGTHSAATHRSPHSGMTTPMSRGPEDVSWPSRPSVEKTSPPSSLVSLSAVTSPSPLYSTPSESSHSSPLRVTSLFTPVMMKTTDMLDTSLEPVTTSPP SMNITSDESLATSKATMETEAIQLSENTAVTQMGTISARQEFYSSYPGLPEPSKVTSPVVTSSTIKDIVSTTIPASSEITRIEMESTSTLTPTPRETSTSQEIHSAT KPSTVPYKALTSATIEDSMTQVMSSSRGPSPDQSTMSQDISSEVITRLSTSPIKAESTEMTITTQTGSPGATSRGTLTLDTSTTFMSGTHSTASQGFSHSQMT ALMSRTPGDVPWLSHPSVEEASSASFSLSSPVMTSSSPVSSTLPDSIHSSSLPVTSLLTSGLVKTTELLGTSSEPETSSPPNLSSTSAEILATTEVTTDTEKLEMT NVVTSGYTHESPSSVLADSVTTKATSSMGITYPTGDTNVLTSTPAFSDTSRIQTKSKLSLTPGLMETSISEETSSATEKSTVLSSVPTGATTEVSRTEAISSSRTSI PGPAQSTMSSDTSMETITRISTPLTRKESTDMAITPKTGPSGATSQGTFTLDSSSTASWPGTHSATTQRFPQSVVTTPMSRGPEDVSWPSPLSVEKNSPPS SLVSSSSVTSPSPLYSTPSGSSHSSPVPVTSLFTSIMMKATDMLDASLEPETTSAPNMNITSDESLATSKATTETEAIHVFENTAASHVETTSATEELYSSSPGFS EPTKVISPVVTSSSIRDNMVSTTMPGSSGITRIEIESMSSLTPGLRETRTSQDITSSTETSTVLYKMSSGATPEVSRTEVMPSSRTSIPGPAQSTMSLDISDEVV TRLSTSPIMTESAEITITTQTGYSLATSQVTLPLGTSMTFLSGTHSTMSQGLSHSEMTNLMSRGPESLSWTSPRFVETTRSSSSLTSLPLTTSLSPVSSTLLDSSP SSPLPVTSLILPGLVKTTEVLDTSSEPKTSSSPNLSSTSVEIPATSEIMTDTEKIHPSSNTAVAKVRTSSSVHESHSSVLADSETTITIPSMGITSAVDDTTVFTSNP AFSETRRIPTEPTFSLTPGFRETSTSEETTSITETSAVLYGVPTSATTEVSMTEIMSSNRTHIPDSDQSTMSPDIITEVITRLSSSSMMSESTQMTITTQKSSPGA TAQSTLTLATTTAPLARTHSTVPPRFLHSEMTTLMSRSPENPSWKSSPFVEKTSSSSSLLSLPVTTSPSVSSTLPQSIPSSSFSVTSLLTPGMVKTTDTSTEPGTS LSPNLSGTSVEILAASEVTTDTEKIHPSSSMAVTNVGTTSSGHELYSSVSIHSEPSKATYPVGTPSSMAETSISTSMPANFETTGFEAEPFSHLTSGFRKTNMS LDTSSVTPTNTPSSPGSTHLLQSSKTDFTSSAKTSSPDWPPASQYTEIPVDIITPFNASPSITESTGITSFPESRFTMSVTESTHHLSTDLLPSAETISTGTVMPSL SEAMTSFATTGVPRAISGSGSPFSRTESGPGDATLSTIAESLPSSTPVPFSSSTFTTTDSSTIPALHEITSSSATPYRVDTSLGTESSTTEGRLVMVSTLDTSSQPG RTSSTPILDTRMTESVELGTVTSAYQVPSLSTRLTRTDGIMEHITKIPNEAAHRGTIRPVKGPQTSTSPASPKGLHTGGTKRMETTTTALKTTTTALKTTSRATL TTSVYTPTLGTLTPLNASRQMASTILTEMMITTPYVFPDVPETTSSLATSLGAETSTALPRTTPSVLNRESETTASLVSRSGAERSPVIQTLDVSSSEPDTTASW VIHPAETIPTVSKTTPNFFHSELDTVSSTATSHGADVSSAIPTNISPSELDALTPLVTISGTDTSTTFPTLTKSPHETETRTTWLTHPAETSSTIPRTIPNFSHHESD ATPSIATSPGAETSSAIPIMTVSPGAEDLVTSQVTSSGTDRNMTIPTLTLSPGEPKTIASLVTHPEAQTSSAIPTSTISPAVSRLVTSMVTSLAAKTSTTNRALTN SPGEPATTVSLVTHPAQTSPTVPWTTSIFFHSKSDTTPSMTTSHGAESSSAVPTPTVSTEVPGVVTPLVTSSRAVISTTIPILTLSPGEPETTPSMATSHGEEAS SAIPTPTVSPGVPGVVTSLVTSSRAVTSTTIPILTFSLGEPETTPSMATSHGTEAGSAVPTVLPEVPGMVTSLVASSRAVTSTTLPTLTLSPGEPETTPSMATSH GAEASSTVPTVSPEVPGVVTSLVTSSSGVNSTSIPTLILSPGELETTPSMATSHGAEASSAVPTPTVSPGVSGVVTPLVTSSRAVTSTTIPILTLSSSEPETTPSM ATSHGVEASSAVLTVSPEVPGMVTSLVTSSRAVTSTTIPTLTISSDEPETTTSLVTHSEAKMISAIPTLAVSPTVQGLVTSLVTSSGSETSAFSNLTVASSQPETID SWVAHPGTEASSVVPTLTVSTGEPFTNISLVTHPAESSSTLPRTTSRFSHSELDTMPSTVTSPEAESSSAISTTISPGIPGVLTSLVTSSGRDISATFPTVPESPHE Muc16-3
SEATASWVTHPAVTSTTVPRTTPNYSHSEPDTTPSIATSPGAEATSDFPTITVSPDVPDMVTSQVTSSGTDTSITIPTLTLSSGEPETTTSFITYSETHTSSAIPTL PVSPGASKMLTSLVISSGTDSTTTFPTLTETPYEPETTAIQLIHPAETNTMVPKTTPKFSHSKSDTTLPVAITSPGPEASSAVSTTTISPDMSDLVTSLVPSSGTD TSTTFPTLSETPYEPETTVTWLTHPAETSTTVSGTIPNFSHRGSDTAPSMVTSPGVDTRSGVPTTTIPPSIPGVVTSQVTSSATDTSTAIPTLTPSPGEPETTASS ATHPGTQTGFTVPIRTVPSSEPDTMASWVTHPPQTSTPVSRTTSSFSHSSPDATPVMATSPRTEASSAVLTTISPGAPEMVTSQITSSGAATSTTVPTLTHSP GMPETTALLSTHPRTGTSKTFPASTVFPQVSETTASLTIRPGAETSTALPTQTTSSLFTLLVTGTSRVDLSPTASPGVSAKTAPLSTHPGTETSTMIPTSTLSLGLL ETTGLLATSSSAETSTSTLTLTVSPAVSGLSSASITTDKPQTVTSWNTETSPSVTSVGPPEFSRTVTGTTMTLIPSEMPTPPKTSHGEGVSPTTILRTTMVEATN LATTGSSPTVAKTTTTFNTLAGSLFTPLTTPGMSTLASESVTSRTSYNHRSWISTTSSYNRRYWTPATSTPVTSTFSPGISTSSIPSSTAATVPFMVPFTLNFTIT NLQYEEDMRHPGSRKFNATERELQGLLKPLFRNSSLEYLYSGCRLASLRPEKDSSAMAVDAICTHRPDPEDLGLDRERLYWELSNLTNGIQELGPYTLDRNS LYVNGFTHRSSMPTTSTPGTSTVDVGTSGTPSSSPSPTAAGPLLMPFTLNFTITNLQYEEDMRRTGSRKFNTMESVLQGLLKPLFKNTSVGPLYSGCRLTLLR PEKDGAATGVDAICTHRLDPKSPGLNREQLYWELSKLTNDIEELGPYTLDRNSLYVNGFTHQSSVSTTSTPGTSTVDLRTSGTPSSLSSPTIMAAGPLLVPFTL NFTITNLQYGEDMGHPGSRKFNTTERVLQGLLGPIFKNTSVGPLYSGCRLTSLRSEKDGAATGVDAICIHHLDPKSPGLNRERLYWELSQLTNGIKELGPYTL DRNSLYVNGFTHRTSVPTTSTPGTSTVDLGTSGTPFSLPSPATAGPLLVLFTLNFTITNLKYEEDMHRPGSRKFNTTERVLQTLLGPMFKNTSVGLLYSGCRLT LLRSEKDGAATGVDAICTHRLDPKSPGLDREQLYWELSQLTNGIKELGPYTLDRNSLYVNGFTHWIPVPTSSTPGTSTVDLGSGTPSSLPSPTAAGPLLVPFTL NFTITNLQYEEDMHHPGSRKFNTTERVLQGLLGPMFKNTSVGLLYSGCRLTLLRSEKDGAATGVDAICTHRLDPKSPGVDREQLYWELSQLTNGIKELGPYT LDRNSLYVNGFTHQTSAPNTSTPGTSTVDLGTSGTPSSLPSPTSAGPLLVPFTLNFTITNLQYEEDMRHPGSRKFNTTERVLQGLLKPLFKSTSVGPLYSGCRL TLLRSEKDGAATGVDAICTHRLDPKSPGVDREQLYWELSQLTNGIKELGPYTLDRNSLYVNGFTHQTSAPNTSTPGTSTVDLGTSGTPSSLPSPTSAGPLLVP FTLNFTITNLQYEEDMHHPGSRKFNTTERVLQGLLGPMFKNTSVGLLYSGCRLTLLRPEKNGAATGMDAICSHRLDPKSPGLNREQLYWELSQLTHGIKEL GPYTLDRNSLYVNGFTHRSSVAPTSTPGTSTVDLGTSGTPSSLPSPTTAVPLLVPFTLNFTITNLQYGEDMRHPGSRKFNTTERVLQGLLGPLFKNSSVGPLYS GCRLISLRSEKDGAATGVDAICTHHLNPQSPGLDREQLYWQLSQMTNGIKELGPYTLDRNSLYVNGFTHRSSGLTTSTPWTSTVDLGTSGTPSPVPSPTTA GPLLVPFTLNFTITNLQYEEDMHRPGSRKFNTTERVLQGLLSPIFKNSSVGPLYSGCRLTSLRPEKDGAATGMDAVCLYHPNPKRPGLDREQLYWELSQLTH NITELGPYSLDRDSLYVNGFTHQNSVPTTSTPGTSTVYWATTGTPSSFPGHTEPGPLLIPFTFNFTITNLHYEENMQHPGSRKFNTTERVLQGLLKPLFKNTS VGPLYSGCRLTSLRPEKDGAATGMDAVCLYHPNPKRPGLDREQLYWELSQLTHNITELGPYSLDRDSLYVNGFTHQNSVPTTSTPGTSTVYWATTGTPSSFP GHTEPGPLLIPFTFNFTITNLHYEENMQHPGSRKFNTTERVLQGLLKPLFKNTSVGPLYSGCRLTLLRPEKHEAATGVDTICTHRVDPIGPGLDRERLYWELS QLTNSITELGPYTLDRDSLYVNGFNPRSSVPTTSTPGTSTVHLATSGTPSSLPGHTAPVPLLIPFTLNFTITNLHYEENMQHPGSRKFNTTERVLQGLLKPLFK NTSVGPLYSGCRLTLLRPEKHEAATGVDTICTHRVDPIGPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXP XXTSAGPLLVPFTLNFTITNLQYEEDMHHPGSRKFNTTERVLQGLLGPMFKNTSVGLLYSGCRLTLLRPEKNGAATGMDAICSHRLDPKSPGLDREQLYWE LSQLTHGIKELGPYTLDRNSLYVNGFTHRSSVAPTSTPGTSTVDLGTSGTPSSLPSPTTAVPLLVPFTLNFTITNLQYGEDMRHPGSRKFNTTERVLQGLLGPL FKNSSVGPLYSGCRLISLRSEKDGAATGVDAICTHHLNPQSPGLDREQLYWQLSQMTNGIKELGPYTLDRNSLYVNGFTHRSSGLTTSTPWTSTVDLGTSGT PSPVPSPTTAGPLLVPFTLNFTITNLQYEEDMHRPGSRKFNATERVLQGLLSPIFKNSSVGPLYSGCRLTSLRPEKDGAATGMDAVCLYHPNPKRPGLDREQL YWELSQLTHNITELGPYSLDRDSLYVNGFTHQSSMTTTRTPDTSTMHLATSRTPASLSGPTTASPLLVLFTINCTITNLQYEEDMRRTGSRKFNTMESVLQGL LKPLFKNTSVGPLYSGCRLTLLRPKKDGAATGVDAICTHRLDPKSPGLNREQLYWELSKLTNDIEELGPYTLDRNSLYVNGFTHQSSVSTTSTPGTSTVDLRTS GTPSSLSSPTIMXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTERVLQGLLRPLFKNTSVSSLYSGCRLTLLRPEKDGAATRVDAACTYRPDPKSPGLDR EQLYWELSQLTHSITELGPYTLDRVSLYVNGFNPRSSVPTTSTPGTSTVHLATSGTPSSLPGHTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTERVLQ GLLKPLFRNSSLEYLYSGCRLASLRPEKDSSAMAVDAICTHRPDPEDLGLDRERLYWELSNLTNGIQELGPYTLDRNSLYVNGFTHRSSGLTTSTPWTSTVDL GTSGTPSPVPSPTTAGPLLVPFTLNFTITNLQYEEDMHRPGSRRFNTTERVLQGLLTPLFKNTSVGPLYSGCRLTLLRPEKQEAATGVDTICTHRVDPIGPGLD RERLYWELSQLTNSITELGPYTLDRDSLYVNGFNPWSSVPTTSTPGTSTVHLATSGTPSSLPGHTAPVPLLIPFTLNFTITDLHYEENMQHPGSRKFNTTERVL Muc16-4
QGLLKPLFKSTSVGPLYSGCRLTLLRPEKHGAATGVDAICTLRLDPTGPGLDRERLYWELSQLTNSVTELGPYTLDRDSLYVNGFTHRSSVPTTSIPGTSAVHLE TSGTPASLPGHTAPGPLLVPFTLNFTITNLQYEEDMRHPGSRKFSTTERVLQGLLKPLFKNTSVSSLYSGCRLTLLRPEKDGAATRVDAVCTHRPDPKSPGLD RERLYWKLSQLTHGITELGPYTLDRHSLYVNGFTHQSSMTTTRTPDTSTMHLATSRTPASLSGPTTASPLLVLFTINFTITNLRYEENMHHPGSRKFNTTERV LQGLLRPVFKNTSVGPLYSGCRLTTLRPKKDGAATKVDAICTYRPDPKSPGLDREQLYWELSQLTHSITELGPYTQDRDSLYVNGFTHRSSVPTTSIPGTSAVH LETSGTPASLPGHTAPGPLLVPFTLNFTITNLQYEEDMRHPGSRKFNTTERVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKRGAATGVDTICTHRLDPLNPGL DREQLYWELSKLTRGIIELGPYLLDRGSLYVNGFTHRTSVPTTSTPGTSTVDLGTSGTPFSLPSPAXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTERVL QTLLGPMFKNTSVGLLYSGCRLTLLRSEKDGAATGVDAICTHRLDPKSPGVDREQLYWELSQLTNGIKELGPYTLDRNSLYVNGFTHWIPVPTSSTPGTSTV DLGSGTPSSLPSPTTAGPLLVPFTLNFTITNLKYEEDMHCPGSRKFNTTERVLQSLLGPMFKNTSVGPLYSGCRLTLLRSEKDGAATGVDAICTHRLDPKSPG VDREQLYWELSQLTNGIKELGPYTLDRNSLYVNGFTHQTSAPNTSTPGTSTVDLGTSGTPSSLPSPTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTE XVLQGLLXPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHWIPVPTSSTPGTSTV DLGSGTPSSLPSPTTAGPLLVPFTLNFTITNLKYEEDMHCPGSRKFNTTERVLQSLLGPMFKNTSVGPLYSGCRLTSLRSEKDGAATGVDAICTHRVDPKSPG VDREQLYWELSQLTNGIKELGPYTLDRNSLYVNGFTHQTSAPNTSTPGTSTVXXGTSGTPSSXPXXTSAGPLLVPFTLNFTITNLQYEEDMHHPGSRKFNTT ERVLQGLLGPMFKNTSVGLLYSGCRLTLLRPEKNGATTGMDAICTHRLDPKSPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTS TVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTERVLQGLLKPLFRNSSLEYLYSGCRLASLRPEKDSSAMAVDAICTHRPDPE DLGLDRERLYWELSNLTNGIQELGPYTLDRNSLYVNGFTHRSSMPTTSTPGTSTVDVGTSGTPSSSPSPTTAGPLLIPFTLNFTITNLQYGEDMGHPGSRKF NTTERVLQGLLGPIFKNTSVGPLYSGCRLTSLRSEKDGAATGVDAICIHHLDPKSPGLNRERLYWELSQLTNGIKELGPYTLDRNSLYVNGFTHRTSVPTTSTP GTSTVDLGTSGTPFSLPSPATAGPLLVLFTLNFTITNLKYEEDMHRPGSRKFNTTERVLQTLLGPMFKNTSVGLLYSGCRLTLLRSEKDGAATGVDAICTHRLD PKSPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFN TTERVLQGLLRPVFKNTSVGPLYSGCRLTLLRPKKDGAATKVDAICTYRPDPKSPGLDREQLYWELSQLTHSITELGPYTQDRDSLYVNGFTHRSSVPTTSIPG TSAVHLETTGTPSSFPGHTEPGPLLIPFTFNFTITNLRYEENMQHPGSRKFNTTERVLQGLLTPLFKNTSVGPLYSGCRLTLLRPEKQEAATGVDTICTHRVDPI GPGLDRERLYWELSQLTNSITELGPYTLDRDSLYVDGFNPWSSVPTTSTPGTSTVHLATSGTPSPLPGHTAPVPLLIPFTLNFTITDLHYEENMQHPGSRKFN TTERVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKHGAATGVDAICTLRLDPTGPGLDRERLYWELSQLTNSITELGPYTLDRDSLYVNGFNPWSSVPTTSTPG TSTVHLATSGTPSSLPGHTTAGPLLVPFTLNFTITNLKYEEDMHCPGSRKFNTTERVLQSLHGPMFKNTSVGPLYSGCRLTLLRSEKDGAATGVDAICTHRLD PKSPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFN TTEXVLQGLLXPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTNSITELGPYTLDRDSLYVNGFTHRSSMPTTSIPGT SAVHLETSGTPASLPGHTAPGPLLVPFTLNFTITNLQYEEDMRHPGSRKFNTTERVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKRGAATGVDTICTHRLDPL NPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTT EXVLQGLLXPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFHPRSSVPTTSTPGTST VHLATSGTPSSLPGHTAPVPLLIPFTLNFTITNLHYEENMQHPGSRKFNTTERVLQGLLGPMFKNTSVGLLYSGCRLTLLRPEKNGAATGMDAICSHRLDPK SPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTT EXVLQGLLXPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHQNSVPTTSTPGTS TVYWATTGTPSSFPGHTEPGPLLIPFTFNFTITNLHYEENMQHPGSRKFNTTERVLQGLLTPLFKNTSVGPLYSGCRLTLLRPEKQEAATGVDTICTHRVDPI GPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTT EXVLQGLLXPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHRSSVPTTSSPGTST VHLATSGTPSSLPGHTAPVPLLIPFTLNFTITNLHYEENMQHPGSRKFNTTERVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKHGAATGVDAICTLRLDPTGP GLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTE Muc16-5
VLQGLLXPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHRTSVPTTSTPGTSTVH LATSGTPSSLPGHTAPVPLLIPFTLNFTITNLQYEEDMHRPGSRKFNTTERVLQGLLSPIFKNSSVGPLYSGCRLTSLRPEKDGAATGMDAVCLYHPNPKRPGL DREQLYCELSQLTHNITELGPYSLDRDSLYVNGFTHQNSVPTTSTPGTSTVYWATTGTPSSFPGHTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTEX VLQGLLXPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHWSSGLTTSTPWTSTV DLGTSGTPSPVPSPTTAGPLLVPFTLNFTITNLQYEEDMHRPGSRKFNATERVLQGLLSPIFKNTSVGPLYSGCRLTLLRPEKQEAATGVDTICTHRVDPIGPG LXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTEXV LQGLLXPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHRSFGLTTSTPWTSTVDL GTSGTPSPVPSPTTAGPLLVPFTLNFTITNLQYEEDMHRPGSRKFNTTERVLQGLLTPLFRNTSVSSLYSGCRLTLLRPEKDGAATRVDAVCTHRPDPKSPGLX XEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTEXVLQ GLLXPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHWIPVPTSSTPGTSTVDLGS GTPSSLPSPTTAGPLLVPFTLNFTITNLQYGEDMGHPGSRKFNTTERVLQGLLGPIFKNTSVGPLYSGCRLTSLRSEKDGAATGVDAICIHHLDPKSPGLXXEX LYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTEXVLQGLL XPXFKNXSVGXLYSGCRLTXLRXEKXGAATGXDAICXHXXXPKXPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHQTFAPNTSTPGTSTVDLGTSG TPSSLPSPTSAGPLLVPFTLNFTITNLQYEEDMHHPGSRKFNTTERVLQGLLGPMFKNTSVGLLYSGCRLTLLRPEKNGAATRVDAVCTHRPDPKSPGLXXEX LYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTVXXGTSGTPSSXPXXTAPVPLLIPFTLNFTITNLHYEENMQHPGSRKFNTTERVLQGLL KPLFKSTSVGPLYSGCRLTLLRPEKHGAATGVDAICTLRLDPTGPGLDRERLYWELSQLTNSVTELGPYTLDRDSLYVNGFTQRSSVPTTSIPGTSAVHLETSGT PASLPGHTAPGPLLVPFTLNFTITNLQYEVDMRHPGSRKFNTTERVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKRGAATGVDTICTHRLDPLNPGLDREQLY WELSKLTRGIIELGPYLLDRGSLYVNGFTHRNFVPITSTPGTSTVHLGTSETPSSLPRPIVPGPLLVPFTLNFTITNLQYEEAMRHPGSRKFNTTERVLQGLLRP LFKNTSIGPLYSSCRLTLLRPEKDKAATRVDAICTHHPDPQSPGLNREQLYWELSQLTHGITELGPYTLDRDSLYVDGFTHWSPIPTTSTPGTSIVNLGTSGIPP SLPETTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTERVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKDGVATRVDAICTHRPDPKIPGLDRQQLYWE LSQLTHSITELGPYTLDRDSLYVNGFTQRSSVPTTSTPGTFTVQPETSETPSSLPGPTATGPVLLPFTLNFTITNLQYEEDMHRPGSRKFNTTERVLQGLLMPL FKNTSVSSLYSGCRLTLLRPEKDGAATRVDAVCTHRPDPKSPGLDRERLYWKLSQLTHGITELGPYTLDRHSLYVNGFTHQSSMTTTRTPDTSTMHLATSRT PASLSGPTTASPLLVLFTINFTITNLRYEENMHHPGSRKFNTTERVLQGLLRPVFKNTSVGPLYSGCRLTLLRPKKDGAATKVDAICTYRPDPKSPGLDREQLY WELSQLTHSITELGPYTLDRDSLYVNGFTQRSSVPTTSIPGTPTVDLGTSGTPVSKPGPSAASPLLVLFTLNFTITNLRYEENMQHPGSRKFNTTERVLQGLLR SLFKSTSVGPLYSGCRLTLLRPEKDGTATGVDAICTHHPDPKSPRLDREQLYWELSQLTHNITELGHYALDNDSLFVNGFTHRSSVSTTSTPGTPTVYLGASKT PASIFGPSAASHLLILFTLNFTITNLRYEENMWPGSRKFNTTERVLQGLLRPLFKNTSVGPLYSGSRLTLLRPEKDGEATGVDAICTHRPDPTGPGLDREQLYL ELSQLTHSITELGPYTLDRDSLYVNGFTHRSSVPTTSTGVVSEEPFTLNFTINNLRYMADMGQPGSLKFNITDNVMKHLLSPLFQRSSLGARYTGCRVIALRS VKNGAETRVDLLCTYLQPLSGPGLPIKQVFHELSQQTHGITRLGPYSLDKDSLYLNGYNEPGLDEPPTTPKPATTFLPPLSEATTAMGYHLKTLTLNFTISNLQ YSPDMGKGSATFNSTEGVLQHLLRPLFQKSSMGPFYLGCQLISLRPEKDGAATGVDTTCTYHPDPVGPGLDIQQLYWELSQLTHGVTQLGFYVLDRDSLFI NGYAPQNLSIRGEYQINFHIVNWNLSNPDPTSSEYITLLRDIQDKVTTLYKGSQLHDTFRFCLVTNLTMDSVLVTVKALFSSNLDPSLVEQVFLDKTLNASFH WLGSTYQLVDIHVTEMESSVYQPTSSSSTQHFYLNFTITNLPYSQDKAQPGTTNYQRNKRNIEDALNQLFRNSSIKSYFSDCQVSTFRSVPNRHHTGVDSLC NFSPLARRVDRVAIYEEFLRMTRNGTQLQNFTLDRSSVLVDGYSPNRNEPLTGNSDLPFWAVILIGLAGLLGLITCLICGVLVTTRRRKKEGEYNVQQQCPGY YQSHLDLEDLQ Muc16-6
MUC16 randomly selected sequence: 41% PTS PATSQLPFSIGHITSAVTPAAMARSSGVTFSRPDPTSKKAEQTS TQLPTTTSAHPGQVPRSAATTLDVIPHTAKTPDATFQRQGQT ALTTEARATSDSWNEKEKSTPSAPWITEMMNSVSEDTIKEVTS SSSVLKDPEYAGHKLGIWDDFIPKFGKAAHMRELPLLSPPQDK EAIHPSTNTVETTGWVTSSEHASHSTIPAHSASSKLTSPVVTTS TREQAIVSMSTTTWPESTRARTEPNSFLTIELRDVSPYMDTSST TQTSIISSPGSTAITKGPRTEITSSKRISSSFLAQSMRSSDSPSEAI TRLSNFPAMTESGGMILAMQTSPPGATSLSAPTLDTSATASW TGTPLATTQRFTYSEKTTLFSKGPEDTSQPSPPSVEETSSSSSLV PIHATTSPSNILLTSQGHSPSSTPPVTSVFLSETSGLGKTTDMSR ISLEPGTSLPPNLSSTAGEALSTYEASRDTKAIHHSADTAVTNM EATSSEYSPIPGHTKPSKATSPLVTSHIMGDITSSTSVFGS
P T S 21
Mucins Diversity and classes of mucin proteins Diversity and classes of mucin glycans Mucin biosynthesis Mucin distribution in cells and tissues Mucin functions
The four major mucin O-linked glycan cores [grey boxes] All start with GalNAc α-linked to Ser/Thr Common Cores β3-gal addition Core 1 β3-glcnac addition Core 3 β6-glcnac addition to Core 1 Core 2 β6-glcnac addition to Core 3 Core 4 Less common cores (note shown): α3-galnac addition Core 5 β6-glcnac addition (alone) Core 6 α6-glcnac addition (alone) Core 7 α3-gal addition (alone) Core 8 Essentials of Glycobiology Second Edition
Trivial nomenclature and key functional terminal structures Corifeld, AP (2014) Mucins: A biologically relevant glycan barrier in mucosal protection Biochim Biophys Acta in press
Trivial nomenclature and key functional terminal structures Corifeld, AP (2014) Mucins: A biologically relevant glycan barrier in mucosal protection Biochim Biophys Acta in press
Mucins Diversity and classes of mucin proteins Diversity and classes of mucin glycans Mucin biosynthesis Mucin distribution in cells and tissues Mucin functions
Corifeld, AP (2014) Mucins: A biologically relevant glycan barrier in mucosal protection Biochim Biophys Acta in press McGuckin et al. (2011) Nature Reviews Microbiology 9, 265
The ppgalnact family Crystal structure of ppgalnac T2 with bound peptide substrate showing the tethered catalytic and lectin domains Mucin type O-glycosylation is initiated by a large family of polypeptide GalNAc transferases (ppgalnact s) that add α-galnac to the Ser and Thr residues of peptides. Of the 20 human isoforms, all but one are composed of two globular domains linked by a short flexible linker: a catalytic domain and a ricin-like lectin carbohydrate binding domain. Gerken T A et al. J. Biol. Chem. 2013;288:19900-19914
Mucin glycan elaboration is step-by-step by individual glycosyltransferases Resource: CFG Glycosyltransferase Pages http://functionalglycomics.org/glycomics/molecule/jsp/glycoenzyme/gemolecule.jsp
A more complex and thorough representation Ju et al (2013) Proteomics Clin. Appl. 7:618
The T-synthase (galactosyltransferase) requires a devoted chaperone, Cosmc, for function Aryal et al (2012) J Biol Chem 287:15317 Ju et al (2013) Proteomics Clin. Appl. 7:618 Mutations in Cosmc cause Tn syndrome in humans. Patients appear healthy and do not require treatment, but laboratory tests may reveal moderate hemolytic anemia and reduced numbers of thrombocytes and leukocytes.
Mucins Diversity and classes of mucin proteins Diversity and classes of mucin glycans Mucin biosynthesis Mucin distribution in cells and tissues Mucin functions
Depending on the tissue, mucins are expressed by goblet cells and/or mucus gland cells Figure 3 Mucins on the respiratory epithelium. Ciliated cells and goblet cells in surface epithelium and mucus gland are shown in a simplified representation of the airway epithelium. Tethered mucins (MUC1, MUC4, and MUC16) are found both in the normal, cellassociated form as well as a secreted form in the overlying mucin raft along with MUC5AC and MUC5B. The mucin raft is composed mainly (90%) of MUC5AC (from goblet cells) and MUC5B (from mucus glands). The tethered mucins (MUC1, MUC4, and MUC16) make up approximately 10% of the mucus raft and may result from shedding and alternative splicing.
Mucins Diversity and classes of mucin proteins Diversity and classes of mucin glycans Mucin biosynthesis Mucin distribution in cells and tissues Mucin functions
Mucin functions Hydration Physical barrier Particulate transport Cellular signaling
Model of mucin clearance of particulate matter in the lung Dickey (2012) Science 337, 924 (A) Mobile mucus is continually swept out of the lungs and swallowed. (B) The mucus layer moves over an stiff periciliary layer. (C) Secretory cells release mucin polymers that travel upwards to be incorporated into the mobile gel layer. Membrane-tethered mucins at the periciliary layer are at greater density than polymeric mucins in the gel layer. (D) Densely packed sugar side chains cause membrane-tethered mucins to assume a partially extended configuration, whereas mucins in the gel layer are random entangled coils. This has been called the Gel-on-brush model.
Model of mucin clearance of particulate matter in the lung A Periciliary Brush Promotes the Lung Health by Separating the Mucus Layer from Airway Epithelia Button et al (2012) Science 337, 937 (A) Gel-on-brush model of the PCL. Schematic representation of the gel-on-brush hypothesis of the PCL: Tethered macromolecules, such as membrane-bound mucins, form a brushlike structure of the PCL. (B and C) Rapid freeze imaging of Human Bronchial Epithelial cultures exhibiting extensive meshlike structure with mesh [depicted by the arrow in (C)] on the order of ~20 to 40 nm in the PCL. Double-headed arrow in (C) = 30 nm. White box in (B) = area of magnification depicted in (C). (D) MUC1 (red) is located at the bottom of the PCL (E) MUC4 (green) spans the whole PCL. Scale bars in (B), (D), and (E), 7 μm; in (C), 100 nm.
Schematic proposal of how mucus is organized in the gut. Johansson et al. (2011) PNAS 108,4659 Hansson (2012) Curr Opin Microbiol 15,57
Bacterial infused and sterile layers of mucin on the mouse distal colon Muc2-positive goblet cells and overlaying mucus layers in the mouse distal colon detected by anti-muc2 (green) counterstained with DAPI. An inner stratified mucus layer (s) is linked via Muc2- stained threads (white arrow) to the goblet cells of the surface epithelia. Muc2 immunostaining (green) and bacterial probe (red) of distal colon. Muc2-positive goblet cells and an inner stratified (s) mucus layer is devoid of bacteria, which are detected only in the outer mucus layer. The inner mucus is a spatial separation between the cells and the microflora. (bar: 20 μm.)
Mucins Diversity and classes of mucin proteins Diversity and classes of mucin glycans Mucin biosynthesis Mucin distribution in cells and tissues Mucin functions