3 research outputs found

    Causal Graph Discovery For Hydrological Time Series Knowledge Discovery

    Full text link
    Causal inference or causal relationship discovery is an important task in hydrological study to explore the causes of abnormal hydrology phenomena such as drought and flood, which will help improving our prediction and response ability to natural disasters. Different from generic causality study where causalrelation discovery is sufficient, for extreme hydrological situation prediction and modeling, we need not only to construct a causal graph to reveal the contributing factors, but also to provide the lead time of each cause to its effect. Lead time is the time difference between the occurrence of lead and effect. Though causal inference or causal relationship discovery has been a major topic in many science problems, majority of the work has been focused on the validity of such relationship with no knowledge on cause-effect time lead information. Such insight is critical for hydrological modeling and prediction, in which time lead information is desired for knowing how long different factors will affect certain extreme situations such as flood or drought. The most commonly used computational algorithms for causality discovered can be categorized as using regression approaches or Bayesian approaches. Regression based approaches such as Granger\u27s causality assume linear causality and first order causal relationship. Bayesian approaches, such as the PC algorithm from Pearl\u27s causality definition, have exponential runtime complexity which makes it difficult to be applied to hydrological systems with a high number of variables. Furthermore, no existing approaches incorporate the lead time concept in the discovery of causal relationship. In this paper, we propose a new approach, mutual information causal (MI-Causal), for causal relationship discovery, which embodies the advantages of existing approaches and overcomes the limitations to satisfy the hydrologic need. The experimental results from both synthetic and real time hydrological data show that our proposed method outperforms regression approaches and Bayesian based approaches

    Exploring Technical Phrase Frames from Research Paper Titles

    Get PDF
    This paper proposes a method for exploring technical phrase frames by extracting word n-grams that match our information needs and interests from research paper titles. Technical phrase frames, the outcome of our method, are phrases with wildcards that may be substituted for any technical term. Our method, first of all, extracts word trigrams from research paper titles and constructs a co-occurrence graph of the trigrams. Even by simply applying Page Rank algorithm to the co-occurrence graph, we obtain the trigrams that can be regarded as technical key phrases at the higher ranks in terms of Page Rank score. In contrast, our method assigns weights to the edges of the co-occurrence graph based on Jaccard similarity between trigrams and then apply weighted Page Rank algorithm. Consequently, we obtain widely different but more interesting results. While the top-ranked trigrams obtained by unweighted Page Rank have just a self-contained meaning, those obtained by our method are technical phrase frames, i.e., A word sequence that forms a complete technical phrase only after putting a technical word (or words) before or/and after it. We claim that our method is a useful tool for discovering important phrase logical patterns, which can expand query keywords for improving information retrieval performance and can also work as candidate phrasings in technical writing to make our research papers attractive.29th IEEE International Conference on Advanced Information Networking and Applications Workshops, WAINA 2015; Gwangju; South Korea; 25 March 2015 through 27 March 201
    corecore