26 research outputs found

    KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models

    Full text link
    Large language models (LLMs) demonstrate remarkable performance on knowledge-intensive tasks, suggesting that real-world knowledge is encoded in their model parameters. However, besides explorations on a few probing tasks in limited knowledge domains, it is not well understood how to evaluate LLMs' knowledge systematically and how well their knowledge abilities generalize, across a spectrum of knowledge domains and progressively complex task formats. To this end, we propose KGQuiz, a knowledge-intensive benchmark to comprehensively investigate the knowledge generalization abilities of LLMs. KGQuiz is a scalable framework constructed from triplet-based knowledge, which covers three knowledge domains and consists of five tasks with increasing complexity: true-or-false, multiple-choice QA, blank filling, factual editing, and open-ended knowledge generation. To gain a better understanding of LLMs' knowledge abilities and their generalization, we evaluate 10 open-source and black-box LLMs on the KGQuiz benchmark across the five knowledge-intensive tasks and knowledge domains. Extensive experiments demonstrate that LLMs achieve impressive performance in straightforward knowledge QA tasks, while settings and contexts requiring more complex reasoning or employing domain-specific facts still present significant challenges. We envision KGQuiz as a testbed to analyze such nuanced variations in performance across domains and task formats, and ultimately to understand, evaluate, and improve LLMs' knowledge abilities across a wide spectrum of knowledge domains and tasks

    Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks

    Full text link
    Online movie review platforms are providing crowdsourced feedback for the film industry and the general public, while spoiler reviews greatly compromise user experience. Although preliminary research efforts were made to automatically identify spoilers, they merely focus on the review content itself, while robust spoiler detection requires putting the review into the context of facts and knowledge regarding movies, user behavior on film review platforms, and more. In light of these challenges, we first curate a large-scale network-based spoiler detection dataset LCS and a comprehensive and up-to-date movie knowledge base UKM. We then propose MVSD, a novel Multi-View Spoiler Detection framework that takes into account the external knowledge about movies and user activities on movie review platforms. Specifically, MVSD constructs three interconnecting heterogeneous information networks to model diverse data sources and their multi-view attributes, while we design and employ a novel heterogeneous graph neural network architecture for spoiler detection as node-level classification. Extensive experiments demonstrate that MVSD advances the state-of-the-art on two spoiler detection datasets, while the introduction of external knowledge and user interactions help ground robust spoiler detection. Our data and code are available at https://github.com/Arthur-Heng/Spoiler-DetectionComment: EMNLP 202

    Identification of genes regulated by Wnt/β-catenin pathway and involved in apoptosis via microarray analysis

    Get PDF
    BACKGROUND: Wnt/β-catenin pathway has critical roles in development and oncogenesis. Although significant progress has been made in understanding the downstream signaling cascade of this pathway, little is known regarding Wnt/β-catenin pathway modification of the cellular apoptosis. METHODS: To identify potential genes regulated by Wnt/β-catenin pathway and involved in apoptosis, we used a stably integrated, inducible RNA interference (RNAi) vector to specific inhibit the expression and the transcriptional activity of β-catenin in HeLa cells. Meanwhile, we designed an oligonucleotide microarray covering 1384 apoptosis-related genes. Using oligonucleotide microarrays, a series of differential expression of genes was identified and further confirmed by RT-PCR. RESULTS: Stably integrated inducible RNAi vector could effectively suppress β-catenin expression and the transcriptional activity of β-catenin/TCF. Meanwhile, depletion of β-catenin in this manner made the cells more sensitive to apoptosis. 130 genes involved in some important cell-apoptotic pathways, such as PTEN-PI3K-AKT pathway, NF-κB pathway and p53 pathway, showed significant alteration in their expression level after the knockdown of β-catenin. CONCLUSION: Coupling RNAi knockdown with microarray and RT-PCR analyses proves to be a versatile strategy for identifying genes regulated by Wnt/β-catenin pathway and for a better understanding the role of this pathway in apoptosis. Some of the identified β-catenin/TCF directed or indirected target genes may represent excellent targets to limit tumor growth

    Solar Ring Mission: Building a Panorama of the Sun and Inner-heliosphere

    Full text link
    Solar Ring (SOR) is a proposed space science mission to monitor and study the Sun and inner heliosphere from a full 360{\deg} perspective in the ecliptic plane. It will deploy three 120{\deg}-separated spacecraft on the 1-AU orbit. The first spacecraft, S1, locates 30{\deg} upstream of the Earth, the second, S2, 90{\deg} downstream, and the third, S3, completes the configuration. This design with necessary science instruments, e.g., the Doppler-velocity and vector magnetic field imager, wide-angle coronagraph, and in-situ instruments, will allow us to establish many unprecedented capabilities: (1) provide simultaneous Doppler-velocity observations of the whole solar surface to understand the deep interior, (2) provide vector magnetograms of the whole photosphere - the inner boundary of the solar atmosphere and heliosphere, (3) provide the information of the whole lifetime evolution of solar featured structures, and (4) provide the whole view of solar transients and space weather in the inner heliosphere. With these capabilities, Solar Ring mission aims to address outstanding questions about the origin of solar cycle, the origin of solar eruptions and the origin of extreme space weather events. The successful accomplishment of the mission will construct a panorama of the Sun and inner-heliosphere, and therefore advance our understanding of the star and the space environment that holds our life.Comment: 41 pages, 6 figures, 1 table, to be published in Advances in Space Researc

    Well-posedness and asynchronous exponential growth of solutions of a two-phase cell division model

    No full text
    In this article we study a two-phase cell division model. The cells of the two different phases have different growth rates. We mainly consider the model of equal mitosis. By using the semigroup theory, we prove that this model is well-posed in suitable function spaces and its solutions have the property of asynchronous exponential growth as time approaches infinity. The corresponding model of asymmetric mitosis is also studied and similar results are obtained

    How Do Morphological Factors Influence the Green Nut Yield of Chinese Torreya?

    No full text
    As an important economic tree species, Chinese Torreya (Torreya grandis cv Merrillii) has been widely planted in the subtropical regions of China. However, it remains to be studied whether morphological traits are the key factors reflecting or affecting the green nut yield of Chinese Torreya, which is necessary for breeding research and plantation management. Therefore, in Zhuji in the Zhejiang Province, the central production area of Chinese Torreya, we investigated the morphological traits (height, ground diameter, under-crown height, crown width, and branching amount) and green nut yield of 120 randomly selected Chinese Torreya. Our results indicated that the differences in the morphological traits among Chinese Torreya individuals were relatively small, but those in the green nut yield traits were great. There was highly significant (p r = 0.38; p r = 0.294; p p < 0.01). These findings imply that if the tree height is fixed, increasing the ground diameter and crown area, appropriately increasing the branching amount, and reducing the under-crown height could be potential technical measures to improve the green nut yield of Chinese Torreya. Our study provides background information on green nut yield and its morphological traits in Chinese Torreya

    FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge

    Full text link
    Evaluating the factual consistency of automatically generated summaries is essential for the progress and adoption of reliable summarization systems. Despite recent advances, existing factuality evaluation models are not robust, being especially prone to entity and relation errors in new domains. We propose FactKB, a simple new approach to factuality evaluation that is generalizable across domains, in particular with respect to entities and relations. FactKB is based on language models pretrained using facts extracted from external knowledge bases. We introduce three types of complementary factuality pretraining objectives based on direct entity facts, facts grounded in auxiliary knowledge about entities, and facts constructed compositionally through knowledge base walks. The resulting factuality evaluation model achieves state-of-the-art performance on two in-domain news summarization benchmarks as well as on three out-of-domain scientific literature datasets. Further analysis of FactKB shows improved ability to detect erroneous entities and relations in summaries and is robust and generalizable across domains
    corecore