63 research outputs found
What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams
Open domain question answering (OpenQA) tasks have been recently attracting
more and more attention from the natural language processing (NLP) community.
In this work, we present the first free-form multiple-choice OpenQA dataset for
solving medical problems, MedQA, collected from the professional medical board
exams. It covers three languages: English, simplified Chinese, and traditional
Chinese, and contains 12,723, 34,251, and 14,123 questions for the three
languages, respectively. We implement both rule-based and popular neural
methods by sequentially combining a document retriever and a machine
comprehension model. Through experiments, we find that even the current best
method can only achieve 36.7\%, 42.0\%, and 70.1\% of test accuracy on the
English, traditional Chinese, and simplified Chinese questions, respectively.
We expect MedQA to present great challenges to existing OpenQA systems and hope
that it can serve as a platform to promote much stronger OpenQA models from the
NLP community in the future.Comment: Submitted to AAAI 202
Towards Creating Synthetic Data Testbeds for Research
Insurance datasets are generally private in order to protect user information, making it difficult for the ML research community to access and experiment with this data. To increase accessibility and innovation on private insurance data, we compile and share publicly available insurance datasets, analyze challenges inherent in these datasets, and propose, motivate, and evaluate a Synthetic Data sharing framework called Synthetic Insurance Data (SID) Testbed that can be used to improve ML performance on tabular datasets by allowing collaborators to generate Synthetic Data for Data Augmentation. In addition to this framework, we recognize that tabular data augmentation is not a well understood phenomenon, and we run controlled experiments to better understand how and when data augmentation improves machine learning performance in the setting of tabular data.S.M
Identification et étude de l'expression de deux gènes codant une ATPase pompe à protons de la membrane plasmique chez Nicotiana plumbaginifolia
Doctorat en sciences biologiques -- UCL, 199
Hydrophobic residues within the predicted N-terminal amphiphilic alpha-helix of a plant mitochondrial targeting presequence play a major role in in vivo import.
A deletion and mutagenesis study was performed on the mitochondrial presequence of the beta-subunit of the F(1)-ATP synthase from Nicotiana plumbaginifolia linked to the green fluorescent protein (GFP). The various constructs were tested in vivo by transient expression in tobacco protoplasts. GFP distribution in transformed cells was analysed in situ by confocal microscopy, and in vitro in subcellular fractions by Western blotting. Despite its being highly conserved in different species, deletion of the C-terminal region (residues 48-54) of the presequence did not affect mitochondrial import. Deletion of the conserved residues 40-47 and the less conserved intermediate region (residues 18-39) resulted in 60% reduction in GFP import, whereas mutation of conserved residues within these regions had little effect. Further shortening of the presequence progressively reduced import, with the construct retaining the predicted N-terminal amphiphilic alpha-helix (residues 1-12) being unable to mediate mitochondrial import. However, point mutation showed that this last region plays an important role through its basic residues and amphiphilicity, but also through its hydrophobic residues. Replacing Arg4 and Arg5 by alanine residues and shifting the Arg5 and Leu6 (in order to disturb amphiphilicity) resulted in reduction of the presequence import efficiency. The most dramatic effects were seen with single or double mutations of the four Leu residues (positions 5, 6, 10 and 11), which resulted in marked reduction or abolition of GFP import, respectively. We conclude that the N-terminal helical structure of the presequence is necessary but not sufficient for efficient mitochondrial import, and that its hydrophobic residues play an essential role in in vivo mitochondrial targeting
Identification and expression of three new Nicotiana plumbaginifolia genes which encode isoforms of a plasma-membrane H(+)-ATPase, and one of which is induced by mechanical stress.
To analyze in detail the multigene family encoding the plasma-membrane H(+)-ATPase (pma) in Nicotiana plumbaginifolia Viv., five new pma genes (pma 5-9) were isolated. Three of these (pma 6, 8, 9) were fully characterized and classified into new and independent subfamilies. Their cell-type expression was followed by the beta-glucuronidase (gusA) reporter-gene method. While the pma8-gusA transgene was not expressed in transgenic tobacco, expression of the two other transgenes (pma6- and pma9-gusA) was found to be restricted to particular cell types. In the vegetative tissues, pma6-gusA expression was limited to the head cells of the leaf short trichomes, involved in secretion, and to the cortical parenchyma of the young nodes where the developing leaves and axillary flowering stalks join the stem. In the latter tissues, gene expression was enhanced by mechanical stress, suggesting that H(+)-ATPase might be involved in the strength of the tissues and their resistance to mechanical trauma. The pma9-gusA transgene was mainly expressed in the apical meristem of adventitious roots and axillary buds as well as in the phloem tissues of the stem, in which expression depended on the developmental stage. In flowers, pma9-gusA expression was limited to the mature pollen grains and the young fertilized ovules, while that of pma6-gusA was identified in most of the organs. Reverse transcription-polymerase chain reaction of leaf and stem RNA confirmed the expression of pma 6 and 9, while pma8 was found to be expressed in both organs at a lower level. In conclusion, although pma 6 and 9 had a more restricted expression pattern than the previously characterized pma genes, they were nevertheless expressed in cell types in which H(+)-ATPase had not been previously detected
Mapping of tonoplast intrinsic proteins in maturing and germinating Arabidopsis seeds reveals dual localization of embryonic TIPs to the tonoplast and plasma membrane
We have mapped the expression of the tonoplast intrinsic protein (TIP) gene family members in Arabidopsis seeds by fluorescent protein tagging of their genomic sequences and confocal microscopy. Three isoforms (TIP1;1, TIP2;1, and TIP2;2) have distinct patterns of expression in maternal tissues (outer integument and placento-chalazal region). Two isoforms, TIP3;1 and the previously uncharacterized TIP3;2, are the only detectable TIPs in embryos during seed maturation and the early stages of seed germination. Throughout these developmental stages, both isoforms co-locate to the tonoplast of the protein storage vacuoles, but also appear to label the plasma membrane. Plasma membrane labeling is specific to TIP3;1 and TIP3;2, is independent of the position of the fluorescent protein tag, and appears to be specific to early seed maturation and early germination stages. We discuss these results in the context of the predicted distribution of aquaporins in Arabidopsis seeds
The plasma membrane proton pump ATPase: the significance of gene subfamilies.
The plasma membrane proton pump ATPase (H(+)-ATPase) plays a central role in transport across the plasma membrane. As a primary transporter, it mediates ATP-dependent H(+) extrusion to the extracellular space, thus creating pH and potential differences across the plasma membrane that activate a large set of secondary transporters. In several species, the H(+)-ATPase is encoded by a family of approximately 10 genes, classified into 5 gene subfamilies and we might ask what can this tell us about the concept, and the evolution, of gene families in plants. All the highly expressed H(+)-ATPase genes are classified into only two gene subfamilies, which diverged before the emergence of present plant species, raising the questions of the significance of the existence of these two well-conserved subfamilies and whether this is related to different kinetic or regulatory properties. Finally, what can we learn from experimental approaches that silence specific genes? In this review, we would like to discuss these questions in the light of recent data
Tonoplast intrinsic proteins and vacuolar identity
TIPs (tonoplast intrinsic proteins) have been traditionally used as markers for vacuolar identity in a variety of plant species and tissues. In the present article, we review recent attempts to compile a detailed map of TIP expression in Arabidopsis, in order to understand vacuolar identity and distribution in this model species. We discuss the general applicability of these findings. We also review the issue of the intracellular targeting of TIPs and propose key emerging questions relative to the cell biology of this protein family
Differential activation of H+-ATPase genes by an arbuscular mycorrhizal fungus in root cells of transgenic tobacco
In arbuscular mycorrhizas, H+-ATPase is active in the plant membrane around arbuscules but absent from plant mutants defective in arbuscule development (Gianinazzi-Pearson ct ai. 1995, Can J Bot 73: S526-S532), The proton-pumping H+-ATPase is encoded by a family of genes in plants. Immunocytochemical studies and promoter-gusA fusion assays were performed in transgenic tobacco (Nicotiana tabacum L.) to determine whether the periarbuscular enzyme activity results from de-novo activation of plant genes by an arbuscular mycorrhizal fungus. The H+-ATPase protein was localized in the plant membrane around arbuscule hyphae. The enzyme was absent from non-colonized cortical cells. Regulation of seven H+-ATPase genes (pma) was compared in non-mycorrhizal and mycorrhizal roots by histochemical detection of beta -glucuronidase (GUS) activity. Two genes (pma2, pma4) were induced in arbuscule-containing cells of mycorrhizal roots but not in non-mycorrhizal cortical tissues or senescent mycorrhiza. It is concluded that de-novo H+-ATPase activity in the periarbuscular membrane results from selective induction of two H+-ATPase genes, which can have diverse roles in plant-fungal interactions at the symbiotic interface
- …