26,985 research outputs found
Mining frequent biological sequences based on bitmap without candidate sequence generation
Biological sequences carry a lot of important genetic information of organisms. Furthermore, there is an inheritance law related to protein function and structure which is useful for applications such as disease prediction. Frequent sequence mining is a core technique for association rule discovery, but existing algorithms suffer from low efficiency or poor error rate because biological sequences differ from general sequences with more characteristics. In this paper, an algorithm for mining Frequent Biological Sequence based on Bitmap, FBSB, is proposed. FBSB uses bitmaps as the simple data structure and transforms each row into a quicksort list QS-list for sequence growth. For the continuity and accuracy requirement of biological sequence mining, tested sequences used during the mining process of FBSB are real ones instead of generated candidates, and all the frequent sequences can be mined without any errors. Comparing with other algorithms, the experimental results show that FBSB can achieve a better performance on both run time and scalability
The Impact of Board Internationalization on Real Earnings Management: Evidence From China
In this article, we examine the impact of board internationalization on real earnings management. Using the annual data of 2,899 Chinese listed non-financial firms with 16,638 firm-year observations over the period from 2008 to 2017 for empirical analysis, we find robust evidence that higher proportion of foreign directors on corporate boards reduces real earnings management. Results support the hypothesis that foreign directors increase boards’ effectiveness in monitoring the management and, consequently, lead to less earnings management by corporate executives. Our results are robust to alternative measures of board internationalization, instrumental variable analysis, and adding additional control variables. We further observe that foreign directors are more effective in reducing earnings management in firms with local directors with foreign experience and in Chinese provinces with developed institutional environment. Moreover, Chinese firms complement accrual and real activities–based earnings management, and board internationalization is effective in reducing both types of earnings management. Overall, our findings imply board internationalization improves the quality of reported earnings to outside shareholders
Fast all-optical nuclear spin echo technique based on EIT
We demonstrate an all-optical Raman spin echo technique, using
Electromagnetically Induced Transparency (EIT) to create the different pulses
of the spin echo sequence: initialization, pi-rotation, and readout. The first
pulse of the sequence induces coherence directly from a mixed state, and the
technique is used to measure the nuclear spin coherence of an inhomogeneously
broadened ensemble of rare-earth ions (Pr). In contrast to previous
experiments it does not require any preparatory hole burning pulse sequences,
which greatly shortens the total duration of the sequence. The effect of the
different pulses is characterized by quantum state tomography and is compared
with simulations. We demonstrate two applications of the technique by using the
spin echo sequence to accurately compensate a magnetic field across our sample,
and to measure the coherence time at high temperatures up to 11 K, where
standard preparation techniques are difficult to implement. We explore the
potential of the technique and possible applications.Comment: 8 pages, 6 figure
Stability of the Period-Doubled Core of the 90-degree Partial in Silicon
In a recent Letter [N. Lehto and S. Oberg, Phys. Rev. Lett. 80, 5568 (1998)],
Lehto and Oberg investigated the effects of strain fields on the core structure
of the 90-degree partial dislocation in silicon, especially the influence of
the choice of supercell periodic boundary conditions in theoretical
simulations. We show that their results for the relative stability between the
two structures are in disagreement with cell-size converged tight-binding total
energy (TBTE) calculations, which suggest the DP core to be more stable,
regardless of the choice of boundary condition. Moreover, we argue that this
disagreement is due to their use of a Keating potential.Comment: 1 page. Submitted to Comments section of PRL. Also available at
http://www.physics.rutgers.edu/~dhv/preprints/rn_dcom/index.htm
Detecting Sockpuppets in Deceptive Opinion Spam
This paper explores the problem of sockpuppet detection in deceptive opinion
spam using authorship attribution and verification approaches. Two methods are
explored. The first is a feature subsampling scheme that uses the KL-Divergence
on stylistic language models of an author to find discriminative features. The
second is a transduction scheme, spy induction that leverages the diversity of
authors in the unlabeled test set by sending a set of spies (positive samples)
from the training set to retrieve hidden samples in the unlabeled test set
using nearest and farthest neighbors. Experiments using ground truth sockpuppet
data show the effectiveness of the proposed schemes.Comment: 18 pages, Accepted at CICLing 2017, 18th International Conference on
Intelligent Text Processing and Computational Linguistic
Axisymmetric Self-Similar Equilibria of Self-Gravitating Isothermal Systems
All axisymmetric self-similar equilibria of self-gravitating, rotating,
isothermal systems are identified by solving the nonlinear Poisson equation
analytically. There are two families of equilibria: (1) Cylindrically symmetric
solutions in which the density varies with cylindrical radius as R^(-alpha),
with 0 <= alpha <= 2. (2) Axially symmetric solutions in which the density
varies as f(theta)/r^2, where `r' is the spherical radius and `theta' is the
co-latitude. The singular isothermal sphere is a special case of the latter
class with f(theta)=constant. The axially symmetric equilibrium configurations
form a two-parameter family of solutions and include equilibria which are
surprisingly asymmetric with respect to the equatorial plane. The asymmetric
equilibria are, however, not force-free at the singular points r=0, infinity,
and their relevance to real systems is unclear. For each hydrodynamic
equilibrium, we determine the phase-space distribution of the collisionless
analog.Comment: 13 pages, 7 figures, uses emulateapj.sty. Submitted to Ap
Deep Bilevel Learning
We present a novel regularization approach to train neural networks that
enjoys better generalization and test error than standard stochastic gradient
descent. Our approach is based on the principles of cross-validation, where a
validation set is used to limit the model overfitting. We formulate such
principles as a bilevel optimization problem. This formulation allows us to
define the optimization of a cost on the validation set subject to another
optimization on the training set. The overfitting is controlled by introducing
weights on each mini-batch in the training set and by choosing their values so
that they minimize the error on the validation set. In practice, these weights
define mini-batch learning rates in a gradient descent update equation that
favor gradients with better generalization capabilities. Because of its
simplicity, this approach can be integrated with other regularization methods
and training schemes. We evaluate extensively our proposed algorithm on several
neural network architectures and datasets, and find that it consistently
improves the generalization of the model, especially when labels are noisy.Comment: ECCV 201
- …