203 research outputs found

    A User-Centered Concept Mining System for Query and Document Understanding at Tencent

    Full text link
    Concepts embody the knowledge of the world and facilitate the cognitive processes of human beings. Mining concepts from web documents and constructing the corresponding taxonomy are core research problems in text understanding and support many downstream tasks such as query analysis, knowledge base construction, recommendation, and search. However, we argue that most prior studies extract formal and overly general concepts from Wikipedia or static web pages, which are not representing the user perspective. In this paper, we describe our experience of implementing and deploying ConcepT in Tencent QQ Browser. It discovers user-centered concepts at the right granularity conforming to user interests, by mining a large amount of user queries and interactive search click logs. The extracted concepts have the proper granularity, are consistent with user language styles and are dynamically updated. We further present our techniques to tag documents with user-centered concepts and to construct a topic-concept-instance taxonomy, which has helped to improve search as well as news feeds recommendation in Tencent QQ Browser. We performed extensive offline evaluation to demonstrate that our approach could extract concepts of higher quality compared to several other existing methods. Our system has been deployed in Tencent QQ Browser. Results from online A/B testing involving a large number of real users suggest that the Impression Efficiency of feeds users increased by 6.01% after incorporating the user-centered concepts into the recommendation framework of Tencent QQ Browser.Comment: Accepted by KDD 201

    Mulco: Recognizing Chinese Nested Named Entities Through Multiple Scopes

    Full text link
    Nested Named Entity Recognition (NNER) has been a long-term challenge to researchers as an important sub-area of Named Entity Recognition. NNER is where one entity may be part of a longer entity, and this may happen on multiple levels, as the term nested suggests. These nested structures make traditional sequence labeling methods unable to properly recognize all entities. While recent researches focus on designing better recognition methods for NNER in a variety of languages, the Chinese NNER (CNNER) still lacks attention, where a free-for-access, CNNER-specialized benchmark is absent. In this paper, we aim to solve CNNER problems by providing a Chinese dataset and a learning-based model to tackle the issue. To facilitate the research on this task, we release ChiNesE, a CNNER dataset with 20,000 sentences sampled from online passages of multiple domains, containing 117,284 entities failing in 10 categories, where 43.8 percent of those entities are nested. Based on ChiNesE, we propose Mulco, a novel method that can recognize named entities in nested structures through multiple scopes. Each scope use a designed scope-based sequence labeling method, which predicts an anchor and the length of a named entity to recognize it. Experiment results show that Mulco has outperformed several baseline methods with the different recognizing schemes on ChiNesE. We also conduct extensive experiments on ACE2005 Chinese corpus, where Mulco has achieved the best performance compared with the baseline methods

    Testing the mantle plume hypothesis: An IODP effort to drill into the Kamchatka-Okhotsk Sea system

    Get PDF
    The great mantle plume debate (GPD) has been going on for ∼15 years (Foulger and Natland, 2003; Anderson, 2004; Niu, 2005; Davies, 2005; Foulger, 2005; Campbell, 2005; Campbell and Davies, 2006), centered on whether mantle plumes exist as a result of Earth’s cooling or whether their existence is purely required for convenience in explaining certain Earth phenomena (Niu, 2005). Despite the mounting evidence that many of the so-called plumes may be localized melting anomalies, the debate is likely to continue. We recognize that the slow progress of the debate results from communication difficulties. Many debaters may not truly appreciate (1) what the mantle plume hypothesis actually is, and (2) none of the petrological, geochemical and geophysical methods widely used can actually provide smoking-gun evidence for or against mantle plume hypothesis. In this short paper, we clarify these issues, and elaborate a geologically effective approach to test the hypothesis. According to the mantle plume hypothesis, a thermal mantle plume must originate from the thermal boundary layer at the core-mantle boundary (CMB), and a large mantle plume head is required to carry the material from the deep mantle to the surface. The plume head product in ocean basins is the oceanic plateau, which is a lithospheric terrane that is large (1000’s km across), thick (>200 km), shallow (2–4 km high above the surrounding seafloors), buoyant (∼1% less dense than the surrounding lithosphere), and thus must be preserved in the surface geology (Niu et al., 2003). The Hawaiian volcanism has been considered as the surface expression of a type mantle plume, but it does not seem to have a (known) plume head product. If this is true, the Hawaiian mantle plume in particular and the mantle plume hypothesis in general must be questioned. Therefore, whether there is an oceanic plateau-like product for the Hawaiian volcanism is key to testing the mantle plume hypothesis, and the Kamchatka-Okhotsk Sea basement is the best candidate to find out if it is indeed the Hawaiian mantle plume head product or not (Niu et al., 2003; Niu, 2004)

    ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis

    Full text link
    Multimodal Sentiment Analysis leverages multimodal signals to detect the sentiment of a speaker. Previous approaches concentrate on performing multimodal fusion and representation learning based on general knowledge obtained from pretrained models, which neglects the effect of domain-specific knowledge. In this paper, we propose Contrastive Knowledge Injection (ConKI) for multimodal sentiment analysis, where specific-knowledge representations for each modality can be learned together with general knowledge representations via knowledge injection based on an adapter architecture. In addition, ConKI uses a hierarchical contrastive learning procedure performed between knowledge types within every single modality, across modalities within each sample, and across samples to facilitate the effective learning of the proposed representations, hence improving multimodal sentiment predictions. The experiments on three popular multimodal sentiment analysis benchmarks show that ConKI outperforms all prior methods on a variety of performance metrics.Comment: Accepted by ACL Findings 202

    RESEARCH ON CO2 FLOODING FOR IMPROVED OIL RECOVERY IN WATER FLOODING ABANDONED RESERVOIRS

    Get PDF
    CO2 injection is an effective technique for improved oil recovery in light oil reservoirs, especially for water flooding abandoned reservoirs. In this study, the lower part of Es1 reservoirs in Pucheng oilfield was introduced as the target reservoir. By studying the minimum miscible pressure in CO2 flooding, the reservoir could achieve miscible flooding. Long core displacement experiments proved that water alternating CO2 flooding could significantly improve the recovery. For the reservoir characteristics, anti-corrosion technology in the process of injection was researched, and the H-20 inhibitor was screened. A channeling blocking agent in combination with the delayed expansion of gel particles and cross-linked copolymer was used to control the gas fluidity. The Pu 1-1 well groups were optimized to conduct a field trial. The cumulative injected liquid CO2 was 19219.95 ton, 0.248 PV and the cumulative increasing oil was 4520.9 t. The predicted recovery will increase by 8.3%. The successful implementation of the project can provide technical attempt for completion of energy to succeed and energy-saving emission reduction targets.</span

    RNA topoisomerase is prevalent in all domains of life and associates with polyribosomes in animals

    Get PDF
    DNA Topoisomerases are essential to resolve topological problems during DNA metabolism in all species. However, the prevalence and function of RNA topoisomerases remain uncertain. Here, we show that RNA topoisomerase activity is prevalent in Type IA topoisomerases from bacteria, archaea, and eukarya. Moreover, this activity always requires the conserved Type IA core domains and the same catalytic residue used in DNA topoisomerase reaction; however, it does not absolutely require the non-conserved carboxyl-terminal domain (CTD), which is necessary for relaxation reactions of supercoiled DNA. The RNA topoisomerase activity of human Top3β differs from that of Escherichia coli topoisomerase I in that the former but not the latter requires the CTD, indicating that topoisomerases have developed distinct mechanisms during evolution to catalyze RNA topoisomerase reactions. Notably, Top3β proteins from several animals associate with polyribosomes, which are units of mRNA translation, whereas the Top3 homologs from E. coli and yeast lack the association. The Top3β-polyribosome association requires TDRD3, which directly interacts with Top3β and is present in animals but not bacteria or yeast. We propose that RNA topoisomerases arose in the early RNA world, and that they are retained through all domains of DNA-based life, where they mediate mRNA translation as part of polyribosomes in animals

    Large-scale Synthesis of β-SiC Nanochains and Their Raman/Photoluminescence Properties

    Get PDF
    Although the SiC/SiO2 nanochain heterojunction has been synthesized, the chained homogeneous nanostructure of SiC has not been reported before. Herein, the novel β-SiC nanochains are synthesized assisted by the AAO template. The characterized results demonstrate that the nanostructures are constructed by spheres of 25–30 nm and conjoint wires of 15–20 nm in diameters. Raman and photoluminescence measurements are used to explore the unique optical properties. A speed-alternating vapor–solid (SA-VS) growth mechanism is proposed to interpret the formation of this typical nanochains. The achieved nanochains enrich the species of one-dimensional (1D) nanostructures and may hold great potential applications in nanotechnology

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead

    Research on the establishment and operation management of flexible organization

    No full text
    This paper puts forward the necessity of establishing flexible organization for enterprises to cope with external challenges, analyzes the characteristics and main problems of flexible organization, and probes into some suggestions on the establishment and operation management of flexible organization. In order to realize the flexible management of the project organization, it is necessary to explore the continuous improvement in practice, so as to achieve its role in effectively invigorating the stock and efficiently integrating resources
    • …
    corecore