175 research outputs found

    Approximate Multiplication of Sparse Matrices with Limited Space

    Full text link
    Approximate matrix multiplication with limited space has received ever-increasing attention due to the emergence of large-scale applications. Recently, based on a popular matrix sketching algorithm---frequent directions, previous work has introduced co-occuring directions (COD) to reduce the approximation error for this problem. Although it enjoys the space complexity of O((mx+my))O((m_x+m_y)\ell) for two input matrices XRmx×nX\in\mathbb{R}^{m_x\times n} and YRmy×nY\in\mathbb{R}^{m_y\times n} where \ell is the sketch size, its time complexity is O(n(mx+my+))O\left(n(m_x+m_y+\ell)\ell\right), which is still very high for large input matrices. In this paper, we propose to reduce the time complexity by exploiting the sparsity of the input matrices. The key idea is to employ an approximate singular value decomposition (SVD) method which can utilize the sparsity, to reduce the number of QR decompositions required by COD. In this way, we develop sparse co-occuring directions, which reduces the time complexity to \widetilde{O}\left((\nnz(X)+\nnz(Y))\ell+n\ell^2\right) in expectation while keeps the same space complexity as O((mx+my))O((m_x+m_y)\ell), where \nnz(X) denotes the number of non-zero entries in XX. Theoretical analysis reveals that the approximation error of our algorithm is almost the same as that of COD. Furthermore, we empirically verify the efficiency and effectiveness of our algorithm

    Efficient Algorithms for Generalized Linear Bandits with Heavy-tailed Rewards

    Full text link
    This paper investigates the problem of generalized linear bandits with heavy-tailed rewards, whose (1+ϵ)(1+\epsilon)-th moment is bounded for some ϵ(0,1]\epsilon\in (0,1]. Although there exist methods for generalized linear bandits, most of them focus on bounded or sub-Gaussian rewards and are not well-suited for many real-world scenarios, such as financial markets and web-advertising. To address this issue, we propose two novel algorithms based on truncation and mean of medians. These algorithms achieve an almost optimal regret bound of O~(dT11+ϵ)\widetilde{O}(dT^{\frac{1}{1+\epsilon}}), where dd is the dimension of contextual information and TT is the time horizon. Our truncation-based algorithm supports online learning, distinguishing it from existing truncation-based approaches. Additionally, our mean-of-medians-based algorithm requires only O(logT)O(\log T) rewards and one estimator per epoch, making it more practical. Moreover, our algorithms improve the regret bounds by a logarithmic factor compared to existing algorithms when ϵ=1\epsilon=1. Numerical experimental results confirm the merits of our algorithms

    The isolation and characterization of twelve novel microsatellite loci from Haliotis ovina

    Get PDF
    Twelve (12) microsatellite loci were developed from Haliotis ovina by magnetic bead hybridization method. Genetic variability was assessed using 30 individuals from three wild populations. The number of alleles per locus was from 2 to 5 and polymorphism information content was from 0.1228 to 0.6542. The observed and expected heterozygosities ranged from 0.0000 to 0.7778 and 0.1288 to 0.6310, respectively. These loci should provide useful information for genetic studies such as genetic diversity, pedigree analysis, construction of genetic linkage maps and marker-assisted selection breeding in H. ovina.Key words: Genetic markers, Haliotis ovina, microsatellites

    Dealing with Imbalanced Classes in Bot-IoT Dataset

    Full text link
    With the rapidly spreading usage of Internet of Things (IoT) devices, a network intrusion detection system (NIDS) plays an important role in detecting and protecting various types of attacks in the IoT network. To evaluate the robustness of the NIDS in the IoT network, the existing work proposed a realistic botnet dataset in the IoT network (Bot-IoT dataset) and applied it to machine learning-based anomaly detection. This dataset contains imbalanced normal and attack packets because the number of normal packets is much smaller than that of attack ones. The nature of imbalanced data may make it difficult to identify the minority class correctly. In this thesis, to address the class imbalance problem in the Bot-IoT dataset, we propose a binary classification method with synthetic minority over-sampling techniques (SMOTE). The proposed classifier aims to detect attack packets and overcome the class imbalance problem using the SMOTE algorithm. Through numerical results, we demonstrate the proposed classifier's fundamental characteristics and the impact of imbalanced data on its performance
    corecore