188 research outputs found

    Mixture of Soft Prompts for Controllable Data Generation

    Full text link
    Large language models (LLMs) effectively generate fluent text when the target output follows natural language patterns. However, structured prediction tasks confine the output format to a limited ontology, causing even very large models to struggle since they were never trained with such restrictions in mind. The difficulty of using LLMs for direct prediction is exacerbated in few-shot learning scenarios, which commonly arise due to domain shift and resource limitations. We flip the problem on its head by leveraging the LLM as a tool for data augmentation rather than direct prediction. Our proposed Mixture of Soft Prompts (MSP) serves as a parameter-efficient procedure for generating data in a controlled manner. Denoising mechanisms are further applied to improve the quality of synthesized data. Automatic metrics show our method is capable of producing diverse and natural text, while preserving label semantics. Moreover, MSP achieves state-of-the-art results on three benchmarks when compared against strong baselines. Our method offers an alternate data-centric approach for applying LLMs to complex prediction tasks.Comment: 19 pages, 13 Tables, 2 Figures. Accepted at EMNLP 202

    Population-genomic analysis identifies a low rate of global adaptive fixation in the proteins of the cyclical parthenogen Daphnia magna

    Get PDF
    Daphnia are well-established ecological and evolutionary models, and the interaction between D. magna and its microparasites is widely considered a paragon of the host-parasite coevolutionary process. Like other well-studied arthropods such as Drosophila melanogaster and Anopheles gambiae, D. magna is a small, widespread, and abundant species that is therefore expected to display a large long-term population size and high rates of adaptive protein evolution. However, unlike these other species, D. magna is cyclically asexual and lives in a highly structured environment (ponds and lakes) with moderate levels of dispersal, both of which are predicted to impact upon long-term effective population size and adaptive protein evolution. To investigate patterns of adaptive protein fixation, we produced the complete coding genomes of 36 D. magna clones sampled from across the European range (Western Palaearctic), along with draft sequences for the close relatives D. similis and D. lumholtzi, used as outgroups. We analyzed genome-wide patterns of adaptive fixation, with a particular focus on genes that have an a priori expectation of high rates, such as those likely to mediate immune responses, RNA interference against viruses and transposable elements, and those with a strongly male-biased expression pattern. We find that, as expected, D. magna displays high levels of diversity and that this is highly structured among populations. However, compared to Drosophila, we find that D. magna proteins appear to have a high proportion of weakly deleterious variants and do not show evidence of pervasive adaptive fixation across its entire range. This is true of the genome as a whole, and also of putative ‘arms race’ genes that often show elevated levels of adaptive substitution in other species. In addition to the likely impact of extensive, and previously documented, local adaptation, we speculate that these findings may reflect reduced efficacy of selection associated with cyclical asexual reproduction

    Protein Tyrosine Phosphatase-PEST and β8 Integrin Regulate Spatiotemporal Patterns of RhoGDI1 Activation in Migrating Cells

    Get PDF
    Directional cell motility is essential for normal development and physiology, although how motile cells spatiotemporally activate signaling events remains largely unknown. Here, we have characterized an adhesion and signaling unit comprised of protein tyrosine phosphatase (PTP)-PEST and the extracellular matrix (ECM) adhesion receptor β8 integrin that plays essential roles in directional cell motility. β8 integrin and PTP-PEST form protein complexes at the leading edge of migrating cells and balance patterns of Rac1 and Cdc42 signaling by controlling the subcellular localization and phosphorylation status of Rho GDP dissociation inhibitor 1 (RhoGDI1). Translocation of Src-phosphorylated RhoGDI1 to the cell's leading edge promotes local activation of Rac1 and Cdc42, whereas dephosphorylation of RhoGDI1 by integrin-bound PTP-PEST promotes RhoGDI1 release from the membrane and sequestration of inactive Rac1/Cdc42 in the cytoplasm. Collectively, these data reveal a finely tuned regulatory mechanism for controlling signaling events at the leading edge of directionally migrating cells

    Mutations in <em>GRHL2</em> result in an autosomal-recessive ectodermal dysplasia syndrome

    Get PDF
    Grainyhead-like 2, encoded by GRHL2, is a member of a highly conserved family of transcription factors that play essential roles during epithelial development. Haploinsufficiency for GRHL2 has been implicated in autosomal-dominant deafness, but mutations have not yet been associated with any skin pathology. We investigated two unrelated Kuwaiti families in which a total of six individuals have had lifelong ectodermal defects. The clinical features comprised nail dystrophy or nail loss, marginal palmoplantar keratoderma, hypodontia, enamel hypoplasia, oral hyperpigmentation, and dysphagia. In addition, three individuals had sensorineural deafness, and three had bronchial asthma. Taken together, the features were consistent with an unusual autosomal-recessive ectodermal dysplasia syndrome. Because of consanguinity in both families, we used whole-exome sequencing to search for novel homozygous DNA variants and found GRHL2 mutations common to both families: affected subjects in one family were homozygous for c.1192T>C (p.Tyr398His) in exon 9, and subjects in the other family were homozygous for c.1445T>A (p.Ile482Lys) in exon 11. Immortalized keratinocytes (p.Ile482Lys) showed altered cell morphology, impaired tight junctions, adhesion defects, and cytoplasmic translocation of GRHL2. Whole-skin transcriptomic analysis (p.Ile482Lys) disclosed changes in genes implicated in networks of cell-cell and cell-matrix adhesion. Our clinical findings of an autosomal-recessive ectodermal dysplasia syndrome provide insight into the role of GRHL2 in skin development, homeostasis, and human disease

    A Computational Method for Prediction of Excretory Proteins and Application to Identification of Gastric Cancer Markers in Urine

    Get PDF
    A novel computational method for prediction of proteins excreted into urine is presented. The method is based on the identification of a list of distinguishing features between proteins found in the urine of healthy people and proteins deemed not to be urine excretory. These features are used to train a classifier to distinguish the two classes of proteins. When used in conjunction with information of which proteins are differentially expressed in diseased tissues of a specific type versus control tissues, this method can be used to predict potential urine markers for the disease. Here we report the detailed algorithm of this method and an application to identification of urine markers for gastric cancer. The performance of the trained classifier on 163 proteins was experimentally validated using antibody arrays, achieving >80% true positive rate. By applying the classifier on differentially expressed genes in gastric cancer vs normal gastric tissues, it was found that endothelial lipase (EL) was substantially suppressed in the urine samples of 21 gastric cancer patients versus 21 healthy individuals. Overall, we have demonstrated that our predictor for urine excretory proteins is highly effective and could potentially serve as a powerful tool in searches for disease biomarkers in urine in general

    Increased prevalence of methicillin-resistant Staphylococcus aureus nasal colonization in household contacts of children with community acquired disease

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>To measure Methicillin-resistant <it>Staphylococcus aureus </it>(MRSA) nasal colonization prevalence in household contacts of children with current community associated (CA)-MRSA infections (study group) in comparison with a group of household contacts of children without suspected <it>Staphylococcus aureus </it>infection (a control group).</p> <p>Methods</p> <p>This is a cross sectional study. Cultures of the anterior nares were taken. Relatedness of isolated strains was tested using pulse field gel electrophoresis (PFGE).</p> <p>Results</p> <p>The prevalence of MRSA colonization in the study group was significantly higher than in the control group (18/77 (23%) vs 3/77 (3.9%); p ≤ 0.001). The prevalence of SA colonization was 28/77 (36%) in the study group and 16/77 (21%) in the control group (p = 0.032). The prevalence of SA nasal colonization among patients was 6/24 (25%); one with methicillin-susceptible <it>S. aureus </it>(MSSA) and 5 with MRSA. In the study (patient) group, 14/24 (58%) families had at least one household member who was colonized with MRSA compared to 2/29 (6.9%) in the control group (p = 0.001). Of 69 total isolates tested by PFGE, 40 (58%) were related to USA300. Panton-Valetine leukocidin (PVL) genes were detected in 30/52 (58%) tested isolates. Among the families with ≥1 contact colonized with MRSA, similar PFGE profiles were found between the index patient and a contact in 10/14 families.</p> <p>Conclusions</p> <p>Prevalence of asymptomatic nasal carriage of MRSA is higher among household contacts of patients with CA-MRSA disease than control group. Decolonizing such carriers may help prevent recurrent CA-MRSA infections.</p
    corecore