24 research outputs found

    A Self-Supervised Automatic Post-Editing Data Generation Tool

    Full text link
    Data building for automatic post-editing (APE) requires extensive and expert-level human effort, as it contains an elaborate process that involves identifying errors in sentences and providing suitable revisions. Hence, we develop a self-supervised data generation tool, deployable as a web application, that minimizes human supervision and constructs personalized APE data from a parallel corpus for several language pairs with English as the target language. Data-centric APE research can be conducted using this tool, involving many language pairs that have not been studied thus far owing to the lack of suitable data.Comment: Accepted for DataPerf workshop at ICML 202

    QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine Translation

    Full text link
    With the recent advance in neural machine translation demonstrating its importance, research on quality estimation (QE) has been steadily progressing. QE aims to automatically predict the quality of machine translation (MT) output without reference sentences. Despite its high utility in the real world, there remain several limitations concerning manual QE data creation: inevitably incurred non-trivial costs due to the need for translation experts, and issues with data scaling and language expansion. To tackle these limitations, we present QUAK, a Korean-English synthetic QE dataset generated in a fully automatic manner. This consists of three sub-QUAK datasets QUAK-M, QUAK-P, and QUAK-H, produced through three strategies that are relatively free from language constraints. Since each strategy requires no human effort, which facilitates scalability, we scale our data up to 1.58M for QUAK-P, H and 6.58M for QUAK-M. As an experiment, we quantitatively analyze word-level QE results in various ways while performing statistical analysis. Moreover, we show that datasets scaled in an efficient way also contribute to performance improvements by observing meaningful performance gains in QUAK-M, P when adding data up to 1.58M

    Biallelic variants in COX4I1 associated with a novel phenotype resembling Leigh syndrome with developmental regression, intellectual disability, and seizures

    Get PDF
    Autosomal recessive COX4I1 deficiency has been previously reported in a single individual with a homozygous pathogenic variant in COX4I1, who presented with short stature, poor weight gain, dysmorphic features, and features of Fanconi anemia. COX4I1 encodes subunit 4, isoform 1 of cytochrome c oxidase. Cytochrome c oxidase is a respiratory chain enzyme that plays an important role in mitochondrial electron transport and reduces molecular oxygen to water leading to the formation of ATP. Defective production of cytochrome c oxidase leads to a variable phenotypic spectrum ranging from isolated myopathy to Leigh syndrome. Here, we describe two siblings, born to consanguineous parents, who presented with encephalopathy, developmental regression, hypotonia, pathognomonic brain imaging findings resembling Leigh‐syndrome, and a novel homozygous variant on COX4I1, expanding the known clinical phenotype associated with pathogenic variants in COX4I1

    GARS- related disease in infantile spinal muscular atrophy: Implications for diagnosis and treatment

    Full text link
    The majority of patients with spinal muscular atrophy (SMA) identified to date harbor a biallelic exonic deletion of SMN1. However, there have been reports of SMA- like disorders that are independent of SMN1, including those due to pathogenic variants in the glycyl- tRNA synthetase gene (GARS1). We report three unrelated patients with de novo variants in GARS1 that are associated with infantile- onset SMA (iSMA). Patients were ascertained during inpatient hospital evaluations for complications of neuropathy. Evaluations were completed as indicated for clinical care and management and informed consent for publication was obtained. One newly identified, disease- associated GARS1 variant, identified in two out of three patients, was analyzed by functional studies in yeast complementation assays. Genomic analyses by exome and/or gene panel and SMN1 copy number analysis of three patients identified two previously undescribed de novo missense variants in GARS1 and excluded SMN1 as the causative gene. Functional studies in yeast revealed that one of the de novo GARS1 variants results in a loss- of- function effect, consistent with other pathogenic GARS1 alleles. In sum, the patients’ clinical presentation, assessments of previously identified GARS1 variants and functional assays in yeast suggest that the GARS1 variants described here cause iSMA. GARS1 variants have been previously associated with Charcot- Marie- Tooth disease (CMT2D) and distal SMA type V (dSMAV). Our findings expand the allelic heterogeneity of GARS- associated disease and support that severe early- onset SMA can be caused by variants in this gene. Distinguishing the SMA phenotype caused by SMN1 variants from that due to pathogenic variants in other genes such as GARS1 significantly alters approaches to treatment.Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/154914/1/ajmga61544_am.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/154914/2/ajmga61544.pd

    Flying Cross-Border To Entrepreneurs: Business Angels In Croatia And Slovenia

    Get PDF
    The signifi cant expansion of business formation is playing a key role in the transformation of transitional economies. As a result of this and, the development of more entrepreneurial business culture, the role of endogenous venture capital, equity market provision, and the potential for business angels involvement is also growing. Despite these entrepreneurially driven developments, and the encouragement of individuals to establish new businesses, start-up companies in Croatia and Slovenia, they are facing the immediate issue of raising capital. Th is paper undertakes a comparative analysis of business angels in Croatia and Slovenia as part of they represent a key part of the response and solution to this problem. Their primary motivation is capital growth, and they seek to fi ll an equity gap and compensate for failures in the venture capital market wherever they appear. Th e study documents the current state of business angel activity and networking within the private equity market in Croatia and Slovenia, based on interviews and case studies. Th erefore, it informs the analysis of key functions that business angels can play in addressing problems faced by new small businesses in an emergent economic and investment environment

    25th annual computational neuroscience meeting: CNS-2016

    Get PDF
    The same neuron may play different functional roles in the neural circuits to which it belongs. For example, neurons in the Tritonia pedal ganglia may participate in variable phases of the swim motor rhythms [1]. While such neuronal functional variability is likely to play a major role the delivery of the functionality of neural systems, it is difficult to study it in most nervous systems. We work on the pyloric rhythm network of the crustacean stomatogastric ganglion (STG) [2]. Typically network models of the STG treat neurons of the same functional type as a single model neuron (e.g. PD neurons), assuming the same conductance parameters for these neurons and implying their synchronous firing [3, 4]. However, simultaneous recording of PD neurons shows differences between the timings of spikes of these neurons. This may indicate functional variability of these neurons. Here we modelled separately the two PD neurons of the STG in a multi-neuron model of the pyloric network. Our neuron models comply with known correlations between conductance parameters of ionic currents. Our results reproduce the experimental finding of increasing spike time distance between spikes originating from the two model PD neurons during their synchronised burst phase. The PD neuron with the larger calcium conductance generates its spikes before the other PD neuron. Larger potassium conductance values in the follower neuron imply longer delays between spikes, see Fig. 17.Neuromodulators change the conductance parameters of neurons and maintain the ratios of these parameters [5]. Our results show that such changes may shift the individual contribution of two PD neurons to the PD-phase of the pyloric rhythm altering their functionality within this rhythm. Our work paves the way towards an accessible experimental and computational framework for the analysis of the mechanisms and impact of functional variability of neurons within the neural circuits to which they belong

    Comparative Analysis of Current Approaches to Quality Estimation for Neural Machine Translation

    No full text
    Quality estimation (QE) has recently gained increasing interest as it can predict the quality of machine translation results without a reference translation. QE is an annual shared task at the Conference on Machine Translation (WMT), and most recent studies have applied the multilingual pretrained language model (mPLM) to address this task. Recent studies have focused on the performance improvement of this task using data augmentation with finetuning based on a large-scale mPLM. In this study, we eliminate the effects of data augmentation and conduct a pure performance comparison between various mPLMs. Separate from the recent performance-driven QE research involved in competitions addressing a shared task, we utilize the comparison for sub-tasks from WMT20 and identify an optimal mPLM. Moreover, we demonstrate QE using the multilingual BART model, which has not yet been utilized, and conduct comparative experiments and analyses with cross-lingual language models (XLMs), multilingual BERT, and XLM-RoBERTa
    corecore