3 research outputs found

    Genome Reference and Sequence Variation in the Large Repetitive Central Exon of Human MUC5AC

    Get PDF
    Despite modern sequencing efforts, the difficulty in assembly of highly repetitive sequences has prevented resolution of human genome gaps, including some in the coding regions of genes with important biological functions. One such gene, MUC5AC, encodes a large, secreted mucin, which is one of the two major secreted mucins in human airways. The MUC5AC region contains a gap in the human genome reference (hg19) across the large, highly repetitive, and complex central exon. This exon is predicted to contain imperfect tandem repeat sequences and multiple conserved cysteine-rich (CysD) domains. To resolve the MUC5AC genomic gap, we used high-fidelity long PCR followed by single molecule real-time (SMRT) sequencing. This technology yielded long sequence reads and robust coverage that allowed for de novo sequence assembly spanning the entire repetitive region. Furthermore, we used SMRT sequencing of PCR amplicons covering the central exon to identify genetic variation in four individuals. The results demonstrated the presence of segmental duplications of CysD domains, insertions/deletions (indels) of tandem repeats, and single nucleotide variants. Additional studies demonstrated that one of the identified tandem repeat insertions is tagged by nonexonic single nucleotide polymorphisms. Taken together, these data illustrate the successful utility of SMRT sequencing long reads for de novo assembly of large repetitive sequences to fill the gaps in the human genome. Characterization of the MUC5AC gene and the sequence variation in the central exon will facilitate genetic and functional studies for this critical airway mucin

    Assembly and organization of the N-terminal region of mucin MUC5AC:Indications for structural and functional distinction from MUC5B

    No full text
    Elevated levels of MUC5AC, one of the major gel-forming mucins in the lungs, are closely associated with chronic obstructive lung diseases such as chronic bronchitis and asthma. It is not known, however, how the structure and/or gel-making properties of MUC5AC contribute to innate lung defense in health and drive the formation of stagnant mucus in disease. To understand this, here we studied the biophysical properties and macromolecular assembly of MUC5AC compared to MUC5B. To study each native mucin, we used Calu3 monomucin cultures that produced MUC5AC or MUC5B. To understand the macromolecular assembly of MUC5AC through N-terminal oligomerization, we expressed a recombinant whole N-terminal domain (5ACNT). Scanning electron microscopy and atomic force microscopy imaging indicated that the two mucins formed distinct networks on epithelial and experimental surfaces; MUC5B formed linear, infrequently branched multimers, whereas MUC5AC formed tightly organized networks with a high degree of branching. Quartz crystal microbalance-dissipation monitoring experiments indicated that MUC5AC bound significantly more to hydrophobic surfaces and was stiffer and more viscoelastic as compared to MUC5B. Light scattering analysis determined that 5ACNT primarily forms disulfide-linked covalent dimers and higher-order oligomers (i.e., trimers and tetramers). Selective proteolytic digestion of the central glycosylated region of the full-length molecule confirmed that MUC5AC forms dimers and higher-order oligomers through its N terminus. Collectively, the distinct N-terminal organization of MUC5AC may explain the more adhesive and unique viscoelastic properties of branched, highly networked MUC5AC gels. These properties may generate insight into why/how MUC5AC forms a static, “tethered” mucus layer in chronic muco-obstructive lung diseases

    Genome Reference and Sequence Variation in the Large Repetitive Central Exon of Human MUC5AC

    No full text
    Despite modern sequencing efforts, the difficulty in assembly of highly repetitive sequences has prevented resolution of human genome gaps, including some in the coding regions of genes with important biological functions. One such gene, MUC5AC, encodes a large, secreted mucin, which is one of the two major secreted mucins in human airways. The MUC5AC region contains a gap in the human genome reference (hg19) across the large, highly repetitive, and complex central exon. This exon is predicted to contain imperfect tandem repeat sequences and multiple conserved cysteine-rich (CysD) domains. To resolve the MUC5AC genomic gap, we used high-fidelity long PCR followed by single molecule real-time (SMRT) sequencing. This technology yielded long sequence reads and robust coverage that allowed for de novo sequence assembly spanning the entire repetitive region. Furthermore, we used SMRT sequencing of PCR amplicons covering the central exon to identify genetic variation in four individuals. The results demonstrated the presence of segmental duplications of CysD domains, insertions/deletions (indels) of tandem repeats, and single nucleotide variants. Additional studies demonstrated that one of the identified tandem repeat insertions is tagged by nonexonic single nucleotide polymorphisms. Taken together, these data illustrate the successful utility of SMRT sequencing long reads for de novo assembly of large repetitive sequences to fill the gaps in the human genome. Characterization of the MUC5AC gene and the sequence variation in the central exon will facilitate genetic and functional studies for this critical airway mucin
    corecore