181 research outputs found
Exploring the Impact of Opinion Polarization on Short Video Consumption
Investigating the increasingly popular domain of short video consumption,
this study focuses on the impact of Opinion Polarization (OP), a significant
factor in the digital landscape influencing public opinions and social
interactions. We analyze OP's effect on viewers' perceptions and behaviors,
finding that traditional feedback metrics like likes and watch time fail to
fully capture and measure OP. Addressing this gap, our research utilizes
Electroencephalogram (EEG) signals to introduce a novel, non-invasive approach
for evaluating neural responses to OP, affecting perception and cognition.
Empirical analysis reveals OP's considerable impact on viewers' emotions,
evidenced by changes in brain activity. Our findings also highlight the
potential of EEG data in predicting exposure to polarized short video content,
offering a new perspective on the dynamics of short video consumption and a
unique method for quantifying OP's effects.Comment: 9 pages, 8 figure
Core Point Pixel-Level Localization by Fingerprint Features in Spatial Domain
Singular point detection is a primary step in fingerprint recognition, especially for fingerprint alignment and classification. But in present there are still some problems and challenges such as more false-positive singular points or inaccurate reference point localization. This paper proposes an accurate core point localization method based on spatial domain features of fingerprint images from a completely different viewpoint to improve the fingerprint core point displacement problem of singular point detection. The method first defines new fingerprint features, called furcation and confluence, to represent specific ridge/valley distribution in a core point area, and uses them to extract the innermost Curve of ridges. The summit of this Curve is regarded as the localization result. Furthermore, an approach for removing false Furcation and Confluence based on their correlations is developed to enhance the method robustness. Experimental results show that the proposed method achieves satisfactory core localization accuracy in a large number of samples
Galaxy Morphology Classification Using Multi-Scale Convolution Capsule Network
The classification of galaxy morphology is a hot issue in astronomical
research. Although significant progress has been made in the last decade in
classifying galaxy morphology using deep learning technology, there are still
some deficiencies in spatial feature representation and classification
accuracy. In this study, we present a multi-scale convolutional capsule network
(MSCCN) model for the classification of galaxy morphology. First, this model
improves the convolutional layers through using a multi-branch structure to
extract multi-scale hidden features of galaxy images. In order to further
explore the hidden information in the features, the multi-scale features are
encapsulated and fed into the capsule layer. Second, we use a sigmoid function
to replace the softmax function in dynamic routing, which can enhance the
robustness of MSCCN. Finally, the classification model achieving 97% accuracy,
96% precision, 98% recall, and 97% F1-score under macroscopic averaging. In
addition, a more comprehensive model evaluation were accomplished in this
study. We visualized the morphological features for the part of sample set,
which using the t-distributed stochastic neighbor embedding (t-SNE) algorithm.
The results shows that the model has the better generalization ability and
robustness, it can be effectively used in the galaxy morphological
classification
Can Large Language Models Infer Causation from Correlation?
Causal inference is one of the hallmarks of human intelligence. While the
field of CausalNLP has attracted much interest in the recent years, existing
causal inference datasets in NLP primarily rely on discovering causality from
empirical knowledge (e.g., commonsense knowledge). In this work, we propose the
first benchmark dataset to test the pure causal inference skills of large
language models (LLMs). Specifically, we formulate a novel task Corr2Cause,
which takes a set of correlational statements and determines the causal
relationship between the variables. We curate a large-scale dataset of more
than 400K samples, on which we evaluate seventeen existing LLMs. Through our
experiments, we identify a key shortcoming of LLMs in terms of their causal
inference skills, and show that these models achieve almost close to random
performance on the task. This shortcoming is somewhat mitigated when we try to
re-purpose LLMs for this skill via finetuning, but we find that these models
still fail to generalize -- they can only perform causal inference in
in-distribution settings when variable names and textual expressions used in
the queries are similar to those in the training set, but fail in
out-of-distribution settings generated by perturbing these queries. Corr2Cause
is a challenging task for LLMs, and would be helpful in guiding future research
on improving LLMs' pure reasoning skills and generalizability. Our data is at
https://huggingface.co/datasets/causalnlp/corr2cause. Our code is at
https://github.com/causalNLP/corr2cause
Caseformer: Pre-training for Legal Case Retrieval Based on Inter-Case Distinctions
Legal case retrieval aims to help legal workers find relevant cases related
to their cases at hand, which is important for the guarantee of fairness and
justice in legal judgments. While recent advances in neural retrieval methods
have significantly improved the performance of open-domain retrieval tasks
(e.g., Web search), their advantages have not been observed in legal case
retrieval due to their thirst for annotated data. As annotating large-scale
training data in legal domains is prohibitive due to the need for domain
expertise, traditional search techniques based on lexical matching such as
TF-IDF, BM25, and Query Likelihood are still prevalent in legal case retrieval
systems. While previous studies have designed several pre-training methods for
IR models in open-domain tasks, these methods are usually suboptimal in legal
case retrieval because they cannot understand and capture the key knowledge and
data structures in the legal corpus. To this end, we propose a novel
pre-training framework named Caseformer that enables the pre-trained models to
learn legal knowledge and domain-specific relevance information in legal case
retrieval without any human-labeled data. Through three unsupervised learning
tasks, Caseformer is able to capture the special language, document structure,
and relevance patterns of legal case documents, making it a strong backbone for
downstream legal case retrieval tasks. Experimental results show that our model
has achieved state-of-the-art performance in both zero-shot and full-data
fine-tuning settings. Also, experiments on both Chinese and English legal
datasets demonstrate that the effectiveness of Caseformer is
language-independent in legal case retrieval
The expression patterns and correlations of claudin-6, methy-CpG binding protein 2, DNA methyltransferase 1, histone deacetylase 1, acetyl-histone H3 and acetyl-histone H4 and their clinicopathological significance in breast invasive ductal carcinomas
<p>Abstract</p> <p>Background</p> <p>Claudin-6 is a candidate tumor suppressor gene in breast cancer, and has been shown to be regulated by DNA methylation and histone modification in breast cancer lines. However, the expression of claudin-6 in breast invasive ductal carcinomas and correlation with clinical behavior or expression of other markers is unclear. We considered that the expression pattern of claudin-6 might be related to the expression of DNA methylation associated proteins (methyl-CpG binding protein 2 (MeCP2) and DNA methyltransferase 1 (DNMT1)) and histone modification associated proteins (histone deacetylase 1 (HDAC1), acetyl-histone H3 (H3Ac) and acetyl- histone H4 (H4Ac)).</p> <p>Methods</p> <p>We have investigated the expression of claudin-6, MeCP2, HDAC1, H3Ac and H4Ac in 100 breast invasive ductal carcinoma tissues and 22 mammary gland fibroadenoma tissues using immunohistochemistry.</p> <p>Results</p> <p>Claudin-6 protein expression was reduced in breast invasive ductal carcinomas (<it>P </it>< 0.001). In contrast, expression of MeCP2 (<it>P </it>< 0.001), DNMT1 (<it>P </it>= 0.001), HDAC1 (<it>P </it>< 0.001) and H3Ac (<it>P </it>= 0.004) expressions was increased. Claudin-6 expression was inversely correlated with lymph node metastasis (<it>P </it>= 0.021). Increased expression of HDAC1 was correlated with histological grade (<it>P </it>< 0.001), age (<it>P </it>= 0.004), clinical stage (<it>P </it>= 0.007) and lymph node metastasis (<it>P </it>= 0.001). H3Ac expression was associated with tumor size (<it>P </it>= 0.044) and clinical stage of cancers (<it>P </it>= 0.034). MeCP2, DNMT1 and H4Ac expression levels did not correlate with any of the tested clinicopathological parameters (<it>P </it>> 0.05). We identified a positive correlation between MeCP2 protein expression and H3Ac and H4Ac protein expression.</p> <p>Conclusions</p> <p>Our results show that claudin-6 protein is significantly down-regulated in breast invasive ductal carcinomas and is an important correlate with lymphatic metastasis, but claudin-6 down-regulation was not correlated with upregulation of the methylation associated proteins (MeCP2, DNMT1) or histone modification associated proteins (HDAC1, H3Ac, H4Ac). Interestingly, the expression of MeCP2 was positively correlated with the expression of H3Ac and H3Ac protein expression was positively correlated with the expression of H4Ac in breast invasive ductal carcinoma</p> <p>Virtual slides</p> <p>The virtual slide(s) for this article can be found here: <url>http://www.diagnosticpathology.diagnomx.eu/vs/4549669866581452</url></p
Identification and Function Prediction of Novel MicroRNAs in Laoshan Dairy Goats
MicroRNAs are a class of endogenous small RNAs that play important roles in post-transcriptional gene regulation by directing degradation of mRNAs or facilitating repression of target gene translation. In this study, three small RNA cDNA libraries from the mammary gland tissues of Laoshan dairy goats (Capra hircus) were constructed and sequenced, individually. Through Solexa high-throughput sequencing and bioinformatics analysis, we obtained 50 presumptive novel miRNAs candidates, and 55,448 putative target genes were predicted. GO annotations and KEGG pathway analyses showed the majority of target genes were involved in various biological processes and metabolic pathways. Our results discovered more information about the regulation network between miRNAs and mRNAs and paved a foundation for the molecular genetics of mammary gland development in goats
Enhancing Localization of Mobile Robots in Distributed Sensor Environments for Reliable Proximity Service Applications
Mobile robots can effectively coordinate information among sensor nodes in a distributed physical proximity. Accurately locating the mobile robots in such a distributed scenario is an essential requirement, such that the mobile robots can be instructed to coordinate with the appropriate sensor nodes. Packet loss is one of the prevailing issues on such wireless sensor network-based mobile robot localization applications. The packet loss might result from node failure, data transmission delay, and communication channel instability, which could significantly affect the transmission quality of the wireless signals. Such issues affect the localization accuracy of the mobile robot applications to an overwhelming margin, causing localization failures. To this end, this paper proposes an improved Unscented Kalman Filter-based localization algorithm to reduce the impacts of packet loss in the localization process. Rather than ignoring the missing measurements caused by packet loss, the proposed algorithm exploits the calculated measurement errors to estimate and compensate for the missing measurements. Some simulation experiments are conducted by subjecting the proposed algorithm with various packet loss rates, to evaluate its localization accuracy. The simulations demonstrate that the average localization error of the robot is 0.39 m when the packet loss rate is less than 90%, and the average running time of each iteration is 0.295 ms. The achieved results show that the proposed algorithm exhibits significant tolerance to packet loss while locating mobile robots in real-time, to achieve reliable localization accuracy and outperforms the existing UKF algorithm
- …