6 research outputs found

    LearnFCA: A Fuzzy FCA and Probability Based Approach for Learning and Classification

    Get PDF
    Formal concept analysis(FCA) is a mathematical theory based on lattice and order theory used for data analysis and knowledge representation. Over the past several years, many of its extensions have been proposed and applied in several domains including data mining, machine learning, knowledge management, semantic web, software development, chemistry ,biology, medicine, data analytics, biology and ontology engineering. This thesis reviews the state-of-the-art of theory of Formal Concept Analysis(FCA) and its various extensions that have been developed and well-studied in the past several years. We discuss their historical roots, reproduce the original definitions and derivations with illustrative examples. Further, we provide a literature review of it’s applications and various approaches adopted by researchers in the areas of dataanalysis, knowledge management with emphasis to data-learning and classification problems. We propose LearnFCA, a novel approach based on FuzzyFCA and probability theory for learning and classification problems. LearnFCA uses an enhanced version of FuzzyLattice which has been developed to store class labels and probability vectors and has the capability to be used for classifying instances with encoded and unlabelled features. We evaluate LearnFCA on encodings from three datasets - mnist, omniglot and cancer images with interesting results and varying degrees of success. Adviser: Dr Jitender Deogu

    LEARNFCA: A FUZZY FCA AND PROBABILITY BASED APPROACH FOR LEARNING AND CLASSIFICATION

    Get PDF
    Formal concept analysis(FCA) is a mathematical theory based on lattice and order theory used for data analysis and knowledge representation. Over the past several years, many of its extensions have been proposed and applied in several domains including data mining, machine learning, knowledge management, semantic web, software development, chemistry ,biology, medicine, data analytics, biology and ontology engineering. This thesis reviews the state-of-the-art of theory of Formal Concept Analysis(FCA) and its various extensions that have been developed and well-studied in the past several years. We discuss their historical roots, reproduce the original definitions and derivations with illustrative examples. Further, we provide a literature review of it’s applications and various approaches adopted by researchers in the areas of dataanalysis, knowledge management with emphasis to data-learning and classification problems. We propose LearnFCA, a novel approach based on FuzzyFCA and probability theory for learning and classification problems. LearnFCA uses an enhanced version of FuzzyLattice which has been developed to store class labels and probability vectors and has the capability to be used for classifying instances with encoded and unlabelled features. We evaluate LearnFCA on encodings from three datasets - mnist, omniglot and cancer images with interesting results and varying degrees of success. Adviser: Jitender Deogu

    Supporting the Maintenance of Identifier Names: A Holistic Approach to High-Quality Automated Identifier Naming

    Get PDF
    A considerable part of the source code is identifier names-- unique lexical tokens that provide information about entities, and entity interactions, within the code. Identifier names provide human-readable descriptions of classes, functions, variables, etc. Poor or ambiguous identifier names (i.e., names that do not correctly describe the code behavior they are associated with) will lead developers to spend more time working towards understanding the code\u27s behavior. Bad naming can also have detrimental effects on tools that rely on natural language clues; degrading the quality of their output and making them unreliable. Additionally, misinterpretations of the code, caused by poor names, can result in the injection of quality issues into the system under maintenance. Thus, improved identifier naming increases developer effectiveness, higher-quality software, and higher-quality software analysis tools. In this dissertation, I establish several novel concepts that help measure and improve the quality of identifiers. The output of this dissertation work is a set of identifier name appraisal and quality tools that integrate into the developer workflow. Through a sequence of empirical studies, I have formulated a series of heuristics and linguistic patterns to evaluate the quality of identifier names in the code and provide naming structure recommendations. I envision and working towards supporting developers in integrating my contributions, discussed in this dissertation, into their development workflow to significantly improve the process of crafting and maintaining high-quality identifier names in the source code

    Supporting feature-level software maintenance

    Get PDF
    Software maintenance is the process of modifying a software system to fix defects, improve performance, add new functionality, or adapt the system to a new environment. A maintenance task is often initiated by a bug report or a request for new functionality. Bug reports typically describe problems with incorrect behaviors or functionalities. These behaviors or functionalities are known as features. Even in very well-designed systems, the source code that implements features is often not completely modularized. The delocalized nature of features makes maintaining them challenging. Since maintenance tasks are expressed in terms of features, the goal of this dissertation is to support software maintenance at the feature-level. We focus on two tasks in particular: feature location and impact analysis via feature coupling.;Feature location is the process of identifying the source code that implements a feature, and it is an essential first step to any maintenance task. There are many existing techniques for feature location that incorporate various types of analyses such as static, dynamic, and textual. In this dissertation, we recognize the advantages of leveraging several types of analyses and introduce a new approach to feature location based on combining dynamic analysis, textual analysis, and web mining algorithms applied to software. The use of web mining for feature location is a novel contribution, and we show that our new techniques based on web mining are significantly more effective than the current state of the art.;After using feature location to identify a feature\u27s source code, maintenance can be completed on that feature. Impact analysis should then be performed to revalidate the system and determine which other features may have been affected by the modifications. We define three feature coupling metrics that capture the relationship between features based on structural information, textual information, and their combination. Our novel feature coupling metrics can be used for impact analysis to quantify the strength of coupling between pairs of features. We performed three empirical studies on open-source software systems to assess the feature coupling metrics and established three major results. First, there is a moderate to strong statistically significant correlation between feature coupling and faults. Second, feature coupling can be used to correctly determine about half of the other features that would be affected by a change to a given feature. Finally, we found that the metrics align with developers\u27 opinions about pairs of features that are actually coupled

    UVR8 mediated spatial differences as a prerequisite for UV-B induced inflorescence phototropism

    Get PDF
    In Arabidopsis hypocotyls, phototropins are the dominant photoreceptors for the positive phototropism response towards unilateral ultraviolet-B (UV-B) radiation. We report a stark contrast of response mechanism with inflorescence stems with a central role for UV RESISTANCE LOCUS 8 (UVR8). The perception of UV-B occurs mainly in the epidermis and cortex with a lesser contribution of the endodermis. Unilateral UV-B exposure does not lead to a spatial difference in UVR8 protein levels but does cause differential UVR8 signal throughout the stem with at the irradiated side 1) increase of the transcription factor ELONGATED HYPOCOTYL 5 (HY5), 2) an associated strong activation of flavonoid biosynthesis genes and flavonoid accumulation, 3) increased GA2oxidase expression, diminished gibberellin1 levels and accumulation of DELLA protein REPRESSOR OF GA1 (RGA) and, 4) increased expression of the auxin transport regulator, PINOID, contributing to local diminished auxin signalling. Our molecular findings are in support of the Blaauw theory (1919), suggesting that differential growth occurs trough unilateral photomorphogenic growth inhibition. Together the data indicate phototropin independent inflorescence phototropism through multiple locally UVR8-regulated hormone pathways

    Tree Peony Species Are a Novel Resource for Production of α-Linolenic Acid

    Get PDF
    Tree peony is known worldwide for its excellent ornamental and medical values, but recent reports that their seeds contain over 40% α-linolenic acid (ALA), an essential fatty acid for humans drew additional interest of biochemists. To understand the key factors that contribute to this rich accumulation of ALA, we carried out a comprehensive study of oil accumulation in developing seeds of nine wild tree peony species. The fatty acid content and composition was highly variable among the nine species; however, we selected a high- (P. rockii) and low-oil (P. lutea) accumulating species for a comparative transcriptome analysis. Similar to other oilseed transcriptomic studies, upregulation of select genes involved in plastidial fatty acid synthesis, and acyl editing, desaturation and triacylglycerol assembly in the endoplasmic reticulum was noted in seeds of P. rockii relative to P. lutea. Also, in association with the ALA content, transcript levels for fatty acid desaturases (SAD, FAD2 and FAD3), which encode for enzymes necessary for polyunsaturated fatty acid synthesis were higher in P. rockii compared to P. lutea. We further showed that the overexpression of PrFAD2 and PrFAD3 in Arabidopsis increased linoleic and α-linolenic acid content, respectively and modulated their final ratio in the seed oil. In conclusion, we identified the key steps that contribute to efficient ALA synthesis and validated the necessary desaturases in P. rockii that are responsible for not only increasing oil content but also modulating 18:2/18:3 ratio in seeds. Together, these results will aid to improve essential fatty acid content in seeds of tree peonies and other crops of agronomic interest
    corecore