67 research outputs found
Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation
In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and English-specific sequence to sequence (ENSS) for language pairs involving English. JASS focuses on masking and reordering Japanese linguistic units known as bunsetsu, whereas ENSS is proposed based on phrase structure masking and reordering tasks. Experiments on ASPEC JapaneseāEnglish & JapaneseāChinese, Wikipedia JapaneseāChinese, News EnglishāKorean corpora demonstrate that JASS and ENSS outperform MASS and other existing language-agnostic pre-training methods by up to +2.9 BLEU points for the JapaneseāEnglish tasks, up to +7.0 BLEU points for the JapaneseāChinese tasks and up to +1.3 BLEU points for EnglishāKorean tasks. Empirical analysis, which focuses on the relationship between individual parts in JASS and ENSS, reveals the complementary nature of the subtasks of JASS and ENSS. Adequacy evaluation using LASER, human evaluation, and case studies reveals that our proposed methods significantly outperform pre-training methods without injected linguistic knowledge and they have a larger positive impact on the adequacy as compared to the fluency
Lightweight Cross-Lingual Sentence Representation Learning
Large-scale models for learning fixed-dimensional cross-lingual sentence
representations like LASER (Artetxe and Schwenk, 2019b) lead to significant
improvement in performance on downstream tasks. However, further increases and
modifications based on such large-scale models are usually impractical due to
memory limitations. In this work, we introduce a lightweight dual-transformer
architecture with just 2 layers for generating memory-efficient cross-lingual
sentence representations. We explore different training tasks and observe that
current cross-lingual training tasks leave a lot to be desired for this shallow
architecture. To ameliorate this, we propose a novel cross-lingual language
model, which combines the existing single-word masked language model with the
newly proposed cross-lingual token-level reconstruction task. We further
augment the training task by the introduction of two computationally-lite
sentence-level contrastive learning tasks to enhance the alignment of
cross-lingual sentence representation space, which compensates for the learning
bottleneck of the lightweight transformer for generative tasks. Our comparisons
with competing models on cross-lingual sentence retrieval and multilingual
document classification confirm the effectiveness of the newly proposed
training tasks for a shallow model.Comment: ACL 202
Recommended from our members
Reversible Interlayer Sliding and Conductivity Changes in Adaptive Tetrathiafulvalene-Based Covalent Organic Frameworks.
Ordered interlayer stacking is intrinsic in two-dimensional covalent organic frameworks (2D COFs) and has strong implications on COF's optoelectronic properties. Reversible interlayer sliding, corresponding to shearing of 2D layers along their basal plane, is an appealing dynamic control of both structures and properties, yet it remains unexplored in the 2D COF field. Herein, we demonstrate that the reversible interlayer sliding can be realized in an imine-linked tetrathiafulvalene (TTF)-based COF TTF-DMTA. The solvent treatment induces crystalline phase changes between the proposed staircase-like sql net structure and a slightly slipped eclipsed sql net structure. The solvation-induced crystallinity changes correlate well with reversible spectroscopic and electrical conductivity changes as demonstrated in oriented COF thin films. In contrast, no reversible switching is observed in a related TTF-TA COF, which differs from TTF-DMTA in terms of the absence of methoxy groups on the phenylene linkers. This work represents the first 2D COF example of which eclipsed and staircase-like aggregated states are interchangeably accessed via interlayer sliding, an uncharted structural feature that may enable applications such as chemiresistive sensors
Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today
Recent investigations show that large language models (LLMs), specifically
GPT-4, not only have remarkable capabilities in common Natural Language
Processing (NLP) tasks but also exhibit human-level performance on various
professional and academic benchmarks. However, whether GPT-4 can be directly
used in practical applications and replace traditional artificial intelligence
(AI) tools in specialized domains requires further experimental validation. In
this paper, we explore the potential of LLMs such as GPT-4 to outperform
traditional AI tools in dementia diagnosis. Comprehensive comparisons between
GPT-4 and traditional AI tools are conducted to examine their diagnostic
accuracy in a clinical setting. Experimental results on two real clinical
datasets show that, although LLMs like GPT-4 demonstrate potential for future
advancements in dementia diagnosis, they currently do not surpass the
performance of traditional AI tools. The interpretability and faithfulness of
GPT-4 are also evaluated by comparison with real doctors. We discuss the
limitations of GPT-4 in its current state and propose future research
directions to enhance GPT-4 in dementia diagnosis.Comment: 16 pages, 6 figure
Association of APOE Īµ4/Īµ4 with fluid biomarkers in patients from the PUMCH dementia cohort
BackgroundApolipoprotein-E (APOE) Īµ4 is a major genetic risk factor for Alzheimerās disease (AD). Current studies, which were mainly based on the clinical diagnosis rather than biomarkers, come to inconsistent conclusions regarding the associations of APOE Īµ4 homozygotes (APOE Īµ4/Īµ4) and cerebrospinal fluid (CSF) biomarkers of AD. In addition, few studies have explored the associations of APOE Īµ4/Īµ4 with plasma biomarkers. Therefore, we aimed to investigate the associations of APOE Īµ4/Īµ4 with fluid biomarkers in dementia and biomarker-diagnosed AD.MethodsA total of 297 patients were enrolled. They were classified into Alzheimerās continuum, AD, and non-AD, according to CSF biomarkers and/or Ī² amyloid PET results. AD was a subgroup of the AD continuum. Plasma Amyloid Ī² (AĪ²) 40, AĪ²42, glial fibrillary acidic protein (GFAP), neurofilament light chain (NFL), and phosphorylated tau (P-tau)181 were quantified in 144 of the total population using an ultra-sensitive Simoa technology. We analyzed the associations of APOE Īµ4/Īµ4 on CSF and plasma biomarkers in dementia and biomarker diagnosed AD.ResultsBased on the biomarker diagnostic criteria, 169 participants were diagnosed with Alzheimerās continuum and 128 individuals with non-AD, and among the former, 120 patients with AD. The APOE Īµ4/Īµ4 frequencies were 11.8% (20/169), 14.2% (17/120), and 0.8% (1/128) in Alzheimerās continuum, AD and non-AD, respectively. Only CSF AĪ²42 was shown to be decreased in APOE Īµ4/Īµ4 carriers than in non-carriers for patients with AD (pā=ā0.024). Furthermore, we did not find any associations of APOE Īµ4 with plasma biomarkers of AD and non-AD. Interestingly, we found that in non-AD patients, APOE Īµ4 carriers had lower CSF AĪ²42 (pā=ā0.018) and higher T-tau/AĪ²42 ratios (pā<ā0.001) and P-tau181/AĪ²42 ratios (pā=ā0.002) than non-carriers.ConclusionOur data confirmed that of the three groups (AD continuum, AD, and non-AD), those with AD had the highest frequency of APOE É4/É4 genotypes. The APOE É4/É4 was associated with CSF levels of AĪ²42 but not tau for AD and non-AD, suggesting that APOE É4/É4 affected the AĪ² metabolism of both. No associations between APOE Īµ4/É4 and plasma biomarkers of AD and non-AD were found
CRAFTS for Fast Radio Bursts : extending the dispersion-fluence relation with new FRBs detected by FAST
We report three new FRBs discovered by the Five-hundred-meter Aperture Spherical radio Telescope (FAST), namely FRB 181017.J0036+11, FRB 181118, and FRB 181130, through the Commensal Radio Astronomy FAST Survey (CRAFTS). Together with FRB 181123, which was reported earlier, all four FAST-discovered FRBs share the same characteristics of low fluence (1000 pc cm(-3)), consistent with the anticorrelation between DM and fluence of the entire FRB population. FRB 181118 and FRB 181130 exhibit band-limited features. FRB 181130 is prominently scattered (tau(s) 8 ms) at 1.25 GHz. FRB 181017.J0036+11 has full-bandwidth emission with a fluence of 0.042 Jy ms, which is one of the faintest FRB sources detected so far. CRAFTS has started to build a new sample of FRBs that fills the region for more distant and fainter FRBs in the fluence-DME diagram, previously out of reach of other surveys. The implied all-sky event rate of FRBs is 1.24(-0.90)(+1.94) x 5 sky(-1) day(-1) at the 95% confidence interval above 0.0146 Jy ms. We also demonstrate here that the probability density function of CRAFTS FRB detections is sensitive to the assumed intrinsic FRB luminosity function and cosmological evolution, which may be further constrained with more discoveries
Atypical radio pulsations from magnetar SGR 1935+2154
Magnetars are neutron stars with extremely strong magnetic fields, frequently
powering high-energy activity in X-rays. Pulsed radio emission following some
X-ray outbursts have been detected, albeit its physical origin is unclear. It
has long been speculated that the origin of magnetars' radio signals is
different from those from canonical pulsars, although convincing evidence is
still lacking. Five months after magnetar SGR 1935+2154's X-ray outburst and
its associated Fast Radio Burst (FRB) 20200428, a radio pulsar phase was
discovered. Here we report the discovery of X-ray spectral hardening associated
with the emergence of periodic radio pulsations from SGR 1935+2154 and a
detailed analysis of the properties of the radio pulses. The complex radio
pulse morphology, which contains both narrow-band emission and frequency
drifts, has not been seen before in other magnetars, but is similar to those of
repeating FRBs - even though the luminosities are many orders of magnitude
different. The observations suggest that radio emission originates from the
outer magnetosphere of the magnetar, and the surface heating due to the
bombardment of inward-going particles from the radio emission region is
responsible for the observed X-ray spectral hardening.Comment: 47 pages, 11 figure
Next-Generation Sequencing of Cerebrospinal Fluid for the Diagnosis of Neurocysticercosis
Background: Neurocysticercosis (NCC) is the most common helminthic infection of the central nervous system (CNS). The diagnosis of NCC is sometimes challenging due to its heterogenous clinical manifestations and the variable sensitivity and specificity of neuroimaging and serological tests.Methods: Next-generation sequencing (NGS) of cerebrospinal fluid (CSF) was used to detect pathogens in patients with clinically suspected CNS infections. A series of patients diagnosed with NCC is reviewed here.Results: Using NGS of CSF, four patients were diagnosed with NCC. The reads corresponding to Taenia solium ranged from 478 to 117,362, with genomic coverage of 0.0564ā11.15%. Reads corresponding to T. solium were not found in non-template controls and far exceeded those of the background microorganisms in patients with NCC, facilitating the interpretation of the NGS results.Conclusions: This case series demonstrates that NGS of CSF is promising in the diagnosis of NCC in difficult to diagnose cases. Larger studies are needed in the future
Rheumatoid meningitis: a rare neurological complication of rheumatoid arthritis
ObjectiveTo describe the clinical and neuroimaging characteristics of rheumatoid meningitis (RM) in Chinese patients. MethodsThe patients admitted to our hospital with the diagnosis of RM in the past 8 years were retrospectively analyzed. ResultsSix patients with RM were identified among 933 patients admitted with rheumatoid arthritis (RA). The symptoms of meningitis occurred after onset of arthritis in five patients and before onset in one. Headache (n=6), hyperacute focal neurological deficits (n=4) and seizures (n=3) were the most prevalent symptoms. The nadir modified Rankin Scale score was ā„3 in five patients. Rheumatoid factor was elevated in all patients, and interleukin-6 levels in cerebrospinal fluid were dramatically elevated in three of four tested patients. Magnetic resonance imaging of the brain revealed that the meninges were affected in all patients and the cerebral parenchyma was affected in one patient. The lesions were generally located in the frontoparietal region and showed restricted diffusion along the adjacent subarachnoid space. RM occurred during disease-modifying therapy in four patients. In the acute episode, three patients improved on tocilizumab and the other three improved on pulse corticosteroids. For maintenance therapy, two patients received combined therapy of tocilizumab and other immunosuppressive agents, one received adalimumab and methotrexate, and two received low-dose oral corticosteroids with an immunosuppressive agent. Five patients had a good outcome, and one died of Pneumocystis jirovecii pneumonia after stabilization of his neurologic conditions. No relapse of RM occurred on immunotherapy during follow-up. ConclusionsChinese patients with RM share some remarkable clinical and neuroimaging features and respond well to appropriate immunotherapy. Tocilizumab could be a treatment option for this severe complication of RA
- ā¦