175 research outputs found
Approximate Multiplication of Sparse Matrices with Limited Space
Approximate matrix multiplication with limited space has received
ever-increasing attention due to the emergence of large-scale applications.
Recently, based on a popular matrix sketching algorithm---frequent directions,
previous work has introduced co-occuring directions (COD) to reduce the
approximation error for this problem. Although it enjoys the space complexity
of for two input matrices and
where is the sketch size, its time
complexity is , which is still very high for
large input matrices. In this paper, we propose to reduce the time complexity
by exploiting the sparsity of the input matrices. The key idea is to employ an
approximate singular value decomposition (SVD) method which can utilize the
sparsity, to reduce the number of QR decompositions required by COD. In this
way, we develop sparse co-occuring directions, which reduces the time
complexity to \widetilde{O}\left((\nnz(X)+\nnz(Y))\ell+n\ell^2\right) in
expectation while keeps the same space complexity as , where
\nnz(X) denotes the number of non-zero entries in . Theoretical analysis
reveals that the approximation error of our algorithm is almost the same as
that of COD. Furthermore, we empirically verify the efficiency and
effectiveness of our algorithm
Efficient Algorithms for Generalized Linear Bandits with Heavy-tailed Rewards
This paper investigates the problem of generalized linear bandits with
heavy-tailed rewards, whose -th moment is bounded for some
. Although there exist methods for generalized linear
bandits, most of them focus on bounded or sub-Gaussian rewards and are not
well-suited for many real-world scenarios, such as financial markets and
web-advertising. To address this issue, we propose two novel algorithms based
on truncation and mean of medians. These algorithms achieve an almost optimal
regret bound of , where is the
dimension of contextual information and is the time horizon. Our
truncation-based algorithm supports online learning, distinguishing it from
existing truncation-based approaches. Additionally, our mean-of-medians-based
algorithm requires only rewards and one estimator per epoch, making
it more practical. Moreover, our algorithms improve the regret bounds by a
logarithmic factor compared to existing algorithms when . Numerical
experimental results confirm the merits of our algorithms
The isolation and characterization of twelve novel microsatellite loci from Haliotis ovina
Twelve (12) microsatellite loci were developed from Haliotis ovina by magnetic bead hybridization method. Genetic variability was assessed using 30 individuals from three wild populations. The number of alleles per locus was from 2 to 5 and polymorphism information content was from 0.1228 to 0.6542. The observed and expected heterozygosities ranged from 0.0000 to 0.7778 and 0.1288 to 0.6310, respectively. These loci should provide useful information for genetic studies such as genetic diversity, pedigree analysis, construction of genetic linkage maps and marker-assisted selection breeding in H. ovina.Key words: Genetic markers, Haliotis ovina, microsatellites
Dealing with Imbalanced Classes in Bot-IoT Dataset
With the rapidly spreading usage of Internet of Things (IoT) devices, a
network intrusion detection system (NIDS) plays an important role in detecting
and protecting various types of attacks in the IoT network. To evaluate the
robustness of the NIDS in the IoT network, the existing work proposed a
realistic botnet dataset in the IoT network (Bot-IoT dataset) and applied it to
machine learning-based anomaly detection. This dataset contains imbalanced
normal and attack packets because the number of normal packets is much smaller
than that of attack ones. The nature of imbalanced data may make it difficult
to identify the minority class correctly. In this thesis, to address the class
imbalance problem in the Bot-IoT dataset, we propose a binary classification
method with synthetic minority over-sampling techniques (SMOTE). The proposed
classifier aims to detect attack packets and overcome the class imbalance
problem using the SMOTE algorithm. Through numerical results, we demonstrate
the proposed classifier's fundamental characteristics and the impact of
imbalanced data on its performance
- …