6 research outputs found
Fortunately, Discourse Markers Can Enhance Language Models for Sentiment Analysis
In recent years, pretrained language models have revolutionized the NLP world, while achieving state of the art performance in various downstream tasks. However, in many cases, these models do not perform well when labeled data is scarce and the model is expected to perform in the zero or few shot setting. Recently, several works have shown that continual pretraining or performing a second phase of pretraining (inter-training) which is better aligned with the downstream task, can lead to improved results, especially in the scarce data setting. Here, we propose to leverage sentiment-carrying discourse markers to generate large-scale weakly-labeled data, which in turn can be used to adapt language models for sentiment analysis. Extensive experimental results show the value of our approach on various benchmark datasets, including the finance domain. Code, models and data are available at https://github.com/ibm/tslm-discourse-markers
Nonlinear Optical Imaging and Raman Microspectrometry of the Cell Nucleus throughout the Cell Cycle
Fundamental understanding of cellular processes at molecular level is of considerable importance in cell biology as well as in biomedical disciplines for early diagnosis of infection and cancer diseases, and for developing new molecular medicine-based therapies. Modern biophotonics offers exclusive capabilities to obtain information on molecular composition, organization, and dynamics in a cell by utilizing a combination of optical spectroscopy and optical imaging. We introduce here a combination of Raman microspectrometry, together with coherent anti-Stokes Raman scattering (CARS) and two-photon excited fluorescence (TPEF) nonlinear optical microscopy, to study macromolecular organization of the nucleus throughout the cell cycle. Site-specific concentrations of proteins, DNA, RNA, and lipids were determined in nucleoli, nucleoplasmic transcription sites, nuclear speckles, constitutive heterochromatin domains, mitotic chromosomes, and extrachromosomal regions of mitotic cells by quantitative confocal Raman microspectrometry. A surprising finding, obtained in our study, is that the local concentration of proteins does not increase during DNA compaction. We also demonstrate that postmitotic DNA decondensation is a gradual process, continuing for several hours. The quantitative Raman spectroscopic analysis was corroborated with CARS/TPEF multimodal imaging to visualize the distribution of protein, DNA, RNA, and lipid macromolecules throughout the cell cycle
Ultrafast isomerization initiated by X-ray core ionization
Rapid proton migration is a key process in hydrocarbon photochemistry. Charge migration and subsequent proton motion can mitigate radiation damage when heavier atoms absorb X-rays. If rapid enough, this can improve the fidelity of diffract-before-destroy measurements of biomolecular structure at X-ray-free electron lasers. Here we study X-ray-initiated isomerization of acetylene, a model for proton dynamics in hydrocarbons. Our time-resolved measurements capture the transient motion of protons following X-ray ionization of carbon K-shell electrons. We Coulomb-explode the molecule with a second precisely delayed X-ray pulse and then record all the fragment momenta. These snapshots at different delays are combined into a ‘molecular movie’ of the evolving molecule, which shows substantial proton redistribution within the first 12 fs. We conclude that significant proton motion occurs on a timescale comparable to the Auger relaxation that refills the K-shell vacancy