7,361 research outputs found

    Asymptotic Generalization Bound of Fisher's Linear Discriminant Analysis

    Full text link
    Fisher's linear discriminant analysis (FLDA) is an important dimension reduction method in statistical pattern recognition. It has been shown that FLDA is asymptotically Bayes optimal under the homoscedastic Gaussian assumption. However, this classical result has the following two major limitations: 1) it holds only for a fixed dimensionality DD, and thus does not apply when DD and the training sample size NN are proportionally large; 2) it does not provide a quantitative description on how the generalization ability of FLDA is affected by DD and NN. In this paper, we present an asymptotic generalization analysis of FLDA based on random matrix theory, in a setting where both DD and NN increase and D/N⟶γ∈[0,1)D/N\longrightarrow\gamma\in[0,1). The obtained lower bound of the generalization discrimination power overcomes both limitations of the classical result, i.e., it is applicable when DD and NN are proportionally large and provides a quantitative description of the generalization ability of FLDA in terms of the ratio γ=D/N\gamma=D/N and the population discrimination power. Besides, the discrimination power bound also leads to an upper bound on the generalization error of binary-classification with FLDA

    Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction

    Full text link
    Most existing event extraction (EE) methods merely extract event arguments within the sentence scope. However, such sentence-level EE methods struggle to handle soaring amounts of documents from emerging applications, such as finance, legislation, health, etc., where event arguments always scatter across different sentences, and even multiple such event mentions frequently co-exist in the same document. To address these challenges, we propose a novel end-to-end model, Doc2EDAG, which can generate an entity-based directed acyclic graph to fulfill the document-level EE (DEE) effectively. Moreover, we reformalize a DEE task with the no-trigger-words design to ease the document-level event labeling. To demonstrate the effectiveness of Doc2EDAG, we build a large-scale real-world dataset consisting of Chinese financial announcements with the challenges mentioned above. Extensive experiments with comprehensive analyses illustrate the superiority of Doc2EDAG over state-of-the-art methods. Data and codes can be found at https://github.com/dolphin-zs/Doc2EDAG.Comment: Accepted by EMNLP 201

    Future prospects of mass-degenerate Higgs bosons in the CPCP-conserving two-Higgs-doublet model

    Full text link
    The scenario of two mass-degenerate Higgs bosons within the general two-Higgs-doublet model (2HDM) is revisited. We focus on the global picture when two CPCP-even Higgs bosons of hh and HH are nearly mass-degenerate. A global fit to the signal strength of the 125 GeV Higgs measured at the LHC is performed. Based on the best-fit result of the 2HDM mixing angles (α,β)(\alpha,\beta), theoretical constraints, charged and CPCP-odd Higgs boson direct search constraints and the electroweak precision constraints are imposed to the 2HDM parameter space. We present the signal predictions of the (4b ,2b 2γ)(4b\,, 2b\,2\gamma) channels for the benchmark models at the LHC 14 TeV runs. We also study the direct Higgs boson pair productions at the LHC, and the Z-associated Higgs boson pair production search at the ILC 500 GeV runs, as well as the indirect probes at the CEPC 250 GeV run. We find that the mass-degenerate Higgs boson scenario in the Type-II 2HDM can be fully probed by these future experimental searches.Comment: 31 pages, 9 figures, 5 tables, matches with the PRD published versio
    • …
    corecore