216 research outputs found

    Feature selection for chemical sensor arrays using mutual information

    Get PDF
    We address the problem of feature selection for classifying a diverse set of chemicals using an array of metal oxide sensors. Our aim is to evaluate a filter approach to feature selection with reference to previous work, which used a wrapper approach on the same data set, and established best features and upper bounds on classification performance. We selected feature sets that exhibit the maximal mutual information with the identity of the chemicals. The selected features closely match those found to perform well in the previous study using a wrapper approach to conduct an exhaustive search of all permitted feature combinations. By comparing the classification performance of support vector machines (using features selected by mutual information) with the performance observed in the previous study, we found that while our approach does not always give the maximum possible classification performance, it always selects features that achieve classification performance approaching the optimum obtained by exhaustive search. We performed further classification using the selected feature set with some common classifiers and found that, for the selected features, Bayesian Networks gave the best performance. Finally, we compared the observed classification performances with the performance of classifiers using randomly selected features. We found that the selected features consistently outperformed randomly selected features for all tested classifiers. The mutual information filter approach is therefore a computationally efficient method for selecting near optimal features for chemical sensor arrays

    Ecological Thresholds in the Savanna Landscape: Developing a Protocol for Monitoring the Change in Composition and Utilisation of Large Trees

    Get PDF
    BACKGROUND: Acquiring greater understanding of the factors causing changes in vegetation structure -- particularly with the potential to cause regime shifts -- is important in adaptively managed conservation areas. Large trees (> or =5 m in height) play an important ecosystem function, and are associated with a stable ecological state in the African savanna. There is concern that large tree densities are declining in a number of protected areas, including the Kruger National Park, South Africa. In this paper the results of a field study designed to monitor change in a savanna system are presented and discussed. METHODOLOGY/PRINCIPAL FINDINGS: Developing the first phase of a monitoring protocol to measure the change in tree species composition, density and size distribution, whilst also identifying factors driving change. A central issue is the discrete spatial distribution of large trees in the landscape, making point sampling approaches relatively ineffective. Accordingly, fourteen 10 m wide transects were aligned perpendicular to large rivers (3.0-6.6 km in length) and eight transects were located at fixed-point photographic locations (1.0-1.6 km in length). Using accumulation curves, we established that the majority of tree species were sampled within 3 km. Furthermore, the key ecological drivers (e.g. fire, herbivory, drought and disease) which influence large tree use and impact were also recorded within 3 km. CONCLUSIONS/SIGNIFICANCE: The technique presented provides an effective method for monitoring changes in large tree abundance, size distribution and use by the main ecological drivers across the savanna landscape. However, the monitoring of rare tree species would require individual marking approaches due to their low densities and specific habitat requirements. Repeat sampling intervals would vary depending on the factor of concern and proposed management mitigation. Once a monitoring protocol has been identified and evaluated, the next stage is to integrate that protocol into a decision-making system, which highlights potential leading indicators of change. Frequent monitoring would be required to establish the rate and direction of change. This approach may be useful in generating monitoring protocols for other dynamic systems

    Microbial Diversity of a Brazilian Coastal Region Influenced by an Upwelling System and Anthropogenic Activity

    Get PDF
    BACKGROUND: Upwelling systems are characterised by an intense primary biomass production in the surface (warmest) water after the outcrop of the bottom (coldest) water, which is rich in nutrients. Although it is known that the microbial assemblage plays an important role in the food chain of marine systems and that the upwelling systems that occur in southwest Brazil drive the complex dynamics of the food chain, little is known about the microbial composition present in this region. METHODOLOGY/PRINCIPAL FINDINGS: We carried out a molecular survey based on SSU rRNA gene from the three domains of the phylogenetic tree of life present in a tropical upwelling region (Arraial do Cabo, Rio de Janeiro, Brazil). The aim was to analyse the horizontal and vertical variations of the microbial composition in two geographically close areas influenced by anthropogenic activity (sewage disposal/port activity) and upwelling phenomena, respectively. A lower estimated diversity of microorganisms of the three domains of the phylogenetic tree of life was found in the water of the area influenced by anthropogenic activity compared to the area influenced by upwelling phenomena. We observed a heterogenic distribution of the relative abundance of taxonomic groups, especially in the Archaea and Eukarya domains. The bacterial community was dominated by Proteobacteria, Cyanobacteria and Bacteroidetes phyla, whereas the microeukaryotic community was dominated by Metazoa, Fungi, Alveolata and Stramenopile. The estimated archaeal diversity was the lowest of the three domains and was dominated by uncharacterised marine Crenarchaeota that were most closely related to Marine Group I. CONCLUSIONS/SIGNIFICANCE: The variety of conditions and the presence of different microbial assemblages indicated that the area of Arraial do Cabo can be used as a model for detailed studies that contemplate the correlation between pollution-indicating parameters and the depletion of microbial diversity in areas close to anthropogenic activity; functional roles and geochemical processes; phylogeny of the uncharacterised diversity; and seasonal variations of the microbial assemblages

    Variation in Symbiodinium ITS2 Sequence Assemblages among Coral Colonies

    Get PDF
    Endosymbiotic dinoflagellates in the genus Symbiodinium are fundamentally important to the biology of scleractinian corals, as well as to a variety of other marine organisms. The genus Symbiodinium is genetically and functionally diverse and the taxonomic nature of the union between Symbiodinium and corals is implicated as a key trait determining the environmental tolerance of the symbiosis. Surprisingly, the question of how Symbiodinium diversity partitions within a species across spatial scales of meters to kilometers has received little attention, but is important to understanding the intrinsic biological scope of a given coral population and adaptations to the local environment. Here we address this gap by describing the Symbiodinium ITS2 sequence assemblages recovered from colonies of the reef building coral Montipora capitata sampled across Kāne'ohe Bay, Hawai'i. A total of 52 corals were sampled in a nested design of Coral Colony(Site(Region)) reflecting spatial scales of meters to kilometers. A diversity of Symbiodinium ITS2 sequences was recovered with the majority of variance partitioning at the level of the Coral Colony. To confirm this result, the Symbiodinium ITS2 sequence diversity in six M. capitata colonies were analyzed in much greater depth with 35 to 55 clones per colony. The ITS2 sequences and quantitative composition recovered from these colonies varied significantly, indicating that each coral hosted a different assemblage of Symbiodinium. The diversity of Symbiodinium ITS2 sequence assemblages retrieved from individual colonies of M. capitata here highlights the problems inherent in interpreting multi-copy and intra-genomically variable molecular markers, and serves as a context for discussing the utility and biological relevance of assigning species names based on Symbiodinium ITS2 genotyping

    The Natural Statistics of Audiovisual Speech

    Get PDF
    Humans, like other animals, are exposed to a continuous stream of signals, which are dynamic, multimodal, extended, and time varying in nature. This complex input space must be transduced and sampled by our sensory systems and transmitted to the brain where it can guide the selection of appropriate actions. To simplify this process, it's been suggested that the brain exploits statistical regularities in the stimulus space. Tests of this idea have largely been confined to unimodal signals and natural scenes. One important class of multisensory signals for which a quantitative input space characterization is unavailable is human speech. We do not understand what signals our brain has to actively piece together from an audiovisual speech stream to arrive at a percept versus what is already embedded in the signal structure of the stream itself. In essence, we do not have a clear understanding of the natural statistics of audiovisual speech. In the present study, we identified the following major statistical features of audiovisual speech. First, we observed robust correlations and close temporal correspondence between the area of the mouth opening and the acoustic envelope. Second, we found the strongest correlation between the area of the mouth opening and vocal tract resonances. Third, we observed that both area of the mouth opening and the voice envelope are temporally modulated in the 2–7 Hz frequency range. Finally, we show that the timing of mouth movements relative to the onset of the voice is consistently between 100 and 300 ms. We interpret these data in the context of recent neural theories of speech which suggest that speech communication is a reciprocally coupled, multisensory event, whereby the outputs of the signaler are matched to the neural processes of the receiver

    Network Modeling Identifies Molecular Functions Targeted by miR-204 to Suppress Head and Neck Tumor Metastasis

    Get PDF
    Due to the large number of putative microRNA gene targets predicted by sequence-alignment databases and the relative low accuracy of such predictions which are conducted independently of biological context by design, systematic experimental identification and validation of every functional microRNA target is currently challenging. Consequently, biological studies have yet to identify, on a genome scale, key regulatory networks perturbed by altered microRNA functions in the context of cancer. In this report, we demonstrate for the first time how phenotypic knowledge of inheritable cancer traits and of risk factor loci can be utilized jointly with gene expression analysis to efficiently prioritize deregulated microRNAs for biological characterization. Using this approach we characterize miR-204 as a tumor suppressor microRNA and uncover previously unknown connections between microRNA regulation, network topology, and expression dynamics. Specifically, we validate 18 gene targets of miR-204 that show elevated mRNA expression and are enriched in biological processes associated with tumor progression in squamous cell carcinoma of the head and neck (HNSCC). We further demonstrate the enrichment of bottleneckness, a key molecular network topology, among miR-204 gene targets. Restoration of miR-204 function in HNSCC cell lines inhibits the expression of its functionally related gene targets, leads to the reduced adhesion, migration and invasion in vitro and attenuates experimental lung metastasis in vivo. As importantly, our investigation also provides experimental evidence linking the function of microRNAs that are located in the cancer-associated genomic regions (CAGRs) to the observed predisposition to human cancers. Specifically, we show miR-204 may serve as a tumor suppressor gene at the 9q21.1–22.3 CAGR locus, a well established risk factor locus in head and neck cancers for which tumor suppressor genes have not been identified. This new strategy that integrates expression profiling, genetics and novel computational biology approaches provides for improved efficiency in characterization and modeling of microRNA functions in cancer as compared to the state of art and is applicable to the investigation of microRNA functions in other biological processes and diseases

    Population Structure of Humpback Whales from Their Breeding Grounds in the South Atlantic and Indian Oceans

    Get PDF
    Although humpback whales are among the best-studied of the large whales, population boundaries in the Southern Hemisphere (SH) have remained largely untested. We assess population structure of SH humpback whales using 1,527 samples collected from whales at fourteen sampling sites within the Southwestern and Southeastern Atlantic, the Southwestern Indian Ocean, and Northern Indian Ocean (Breeding Stocks A, B, C and X, respectively). Evaluation of mtDNA population structure and migration rates was carried out under different statistical frameworks. Using all genetic evidence, the results suggest significant degrees of population structure between all ocean basins, with the Southwestern and Northern Indian Ocean most differentiated from each other. Effective migration rates were highest between the Southeastern Atlantic and the Southwestern Indian Ocean, followed by rates within the Southeastern Atlantic, and the lowest between the Southwestern and Northern Indian Ocean. At finer scales, very low gene flow was detected between the two neighbouring sub-regions in the Southeastern Atlantic, compared to high gene flow for whales within the Southwestern Indian Ocean. Our genetic results support the current management designations proposed by the International Whaling Commission of Breeding Stocks A, B, C, and X as four strongly structured populations. The population structure patterns found in this study are likely to have been influenced by a combination of long-term maternally directed fidelity of migratory destinations, along with other ecological and oceanographic features in the region

    Prognostic DNA methylation markers for sporadic colorectal cancer: a systematic review

    Get PDF
    Background Biomarkers that can predict the prognosis of colorectal cancer (CRC) patients and that can stratify high-risk early stage patients from low-risk early stage patients are urgently needed for better management of CRC. During the last decades, a large variety of prognostic DNA methylation markers has been published in the literature. However, to date, none of these markers are used in clinical practice. Methods To obtain an overview of the number of published prognostic methylation markers for CRC, the number of markers that was validated independently, and the current level of evidence (LoE), we conducted a systematic review of PubMed, EMBASE, and MEDLINE. In addition, we scored studies based on the REMARK guidelines that were established in order to attain more transparency and complete reporting of prognostic biomarker studies. Eighty-three studies reporting on 123 methylation markers fulfilled the study entry criteria and were scored according to REMARK. Results Sixty-three studies investigated single methylation markers, whereas 20 studies reported combinations of methylation markers. We observed substantial variation regarding the reporting of sample sizes and patient characteristics, statistical analyses, and methodology. The median (range) REMARK score for the studies was 10.7 points (4.5 to 17.5) out of a maximum of 20 possible points. The median REMARK score was lower in studies, which reported a p value below 0.05 versus those, which did not (p = 0.005). A borderline statistically significant association was observed between the reported p value of the survival analysis and the size of the study population (p = 0.051). Only 23 out of 123 markers (17%) were investigated in two or more study series. For 12 markers, and two multimarker panels, consistent results were reported in two or more study series. For four markers, the current LoE is level II, for all other markers, the LoE is lower. Conclusion This systematic review reflects that adequate reporting according to REMARK and validation of prognostic methylation markers is absent in the majority of CRC methylation marker studies. However, this systematic review provides a comprehensive overview of published prognostic methylation markers for CRC and highlights the most promising markers that have been published in the last two decades
    corecore