48 research outputs found

    Generating Better Concept Hierarchies Using Automatic Document Classification

    Get PDF
    ABSTRACT This paper presents a hybrid concept hierarchy development technique for web returned documents retrieved by a meta-search engine. The aim of the technique is to separate the initial retrieved documents into topical oriented categories, prior to the actual concept hierarchy generation. The topical categories correspond to different semantic aspects of the query. This is done using a 1-of-n automatic document classification, on the initial set of returned documents. Then, an individual topical concept hierarchy is automatically generated inside each of the resulted categories. Both steps are executed on the fly at retrieval time. Due to the efficiency constraints imposed by the web retrieval context, the algorithm only uses document snippets (rather than full web pages) for both document classification and concept hierarchy generation. Experimental results show that the algorithm is able to improve the quality of the concept hierarchy presented to the searcher; at the same time, the efficiency parameters are kept within reasonable intervals

    Incorporating Document Keyphrases in Search Results

    Get PDF

    Investigation of Growth Mechanism of Plasma Electrolytic Oxidation Coating on Al-Ti Double-Layer Composite Plate

    No full text
    The aluminum–titanium (Al-Ti) double-layer composite plate is a promising composite material, but necessary surface protection was required before its application. In this paper, plasma electrolytic oxidation (PEO) was employed to fabricate a ceramic coating on the surface of a Al-Ti double-layer composite plate. To investigate the coating growth mechanism on the Al-Ti double-layer composite plate, a single-Al plate and a single-Ti plate were introduced for comparison experiments. Results showed that, the composite of Al and Ti accelerated the coating growth rate on the part-Ti portion of the composite plate, and that of the part-Al portion was decreased. Electrochemical impedance spectroscopy analysis indicated that the equivalent circuit of the Al-Ti coating was formed by connecting two different circuits in parallel. The reaction behavior revealed that the electric energy during the PEO would leak from the circuit with the weaker blocking effect, and confirmed that the electric energy distribution followed the law of low-resistance distribution. Finally, the mechanism was extended to the PEO treatment on general metal matrix composites to broaden the application theory of the technology

    A Planar Dielectrophoretic Microdevice for Particle Manipulation

    No full text
    This paper presents both theoretical and experimental study of particle motion in a typical interdigitated electrode array. Both finite element method and numerical simulation were performed to predict the movement of particles. The simulation results indicated that the particle motion and separation behaviors strongly depend on the combined contributions of a number of parameters, such as the frequency of the electric field, applied voltage, dielectric properties of the particles and the surrounding medium

    Li et al. Automatically Finding Significant Topical terms from Documents ABSTRACT Automatically Finding Significant Topical Terms from Documents

    No full text
    With the pervasion of digital textual data, text mining is becoming more and more important to deriving competitive advantages. One factor for successful text mining applications is the ability of finding significant topical terms for discovering interesting patterns or relationships. Document keyphrases are phrases carrying the most important topical concepts for a given document. In many applications, keyphrases as textual elements are better suited for text mining and could provide more discriminating power than single words. This paper describes an automatic keyphrase identification program (KIP). KIP’s algorithm examines the composition of noun phrases and calculates their scores by looking up a domain-specific glossary database; the ones with higher scores are extracted as keyphrases. KIP’s learning function can enrich its glossary database by automatically adding new identified keyphrases. KIP’s personalization feature allows the user build a glossary database specifically suitable for the area of his/her interest

    Li et al. Incorporating Document Keyphrases in Search Results Incorporating Document Keyphrases in Search Results ABSTRACT

    No full text
    Effectiveness and efficiency of searching and returned results presentation is the key to a search engine. Before downloading and examining the document text, users usually first judge the relevance of a return hit to the query by looking at document metadata presented in the return result. However, the metadata coming with the return hit is usually not rich enough for users to predict the content of the document. Keyphrases provide a concise summary of a document’s content, offering subject metadata characterizing and summarizing document. In this paper, we propose a mechanism of enriching the metadata of the return results by incorporating automatically extracted document keyphrases in each return hit. By looking at the keyphrases in each return hit, the user can predict the content of the document more easily, quickly, and accurately. The experimental results show that our solution may save users time up to 32 % and users would like to use our proposed search interface with document keyphrases as part of the metadata of a return hit

    A Hybrid Classifier Approach for Web Retrieved Documents Classification Abstract

    No full text
    The paper presents a hybrid technique for the classification of web returned hits into concept hierarchies. The technique involves a combination of manual and automatic classifiers. At first, all web returned documents are assigned to human defined categories using manual classifiers, and then automatic classifiers are used to generate a concept hierarchy for each of these categories. The results of the evaluation reveal the following: (a) for polysemous queries, our system is able to generate meaningful categories corresponding to (but not limited to), the different semantic facets of the queries; (b) as expected, for non-polysemous queries the system generates fewer categories; (c) the hierarchy precision of the concept hierarchies generated for polysemous queries is found to be significantly better when compared to the one obtained using a baseline system
    corecore