103 research outputs found

    Stabilité d'une structure spatiale et compromis d'une analyse statistique multi-tableaux : application à la physico-chimie d'un lac réservoir

    Get PDF
    10 paramètres physico-chimiques sont mesurés en surface dans 10 stations à 12 dates sur le lac réservoir de la Sorme (Saône-et-Loire). L'article montre comment une analyse multitableaux peut caractériser la structure spatiale et préciser sa stabilité. Les notions d'interstructure et de compromis sont accessibles par une procédure simple et efficace.This paper is based on the observation that three-dimensional data matrices (sites x times x variables) often used in limnological investigations require statistical analyses fitted to experimental objectives. Many apparently different statistical tools (3-mode PCA of TUCKER, 1964; KROONENBERG, 1983; projection of variables WILLIAMS and STEPHENSON, 1973; DOLEDEC and CHESSEL, 1987) may be useful to clarify limnological problems such as : 1) the temporal variability of a pattern (elimination of spatial heterogenity) : 2) the spatial structure of a pattern (elimination of temporal effects, mapping of an average situation) : 3) the temporal variability of Lake stratification (stability, modification or inversion) : 4) the spatial structure of temporal variability, and 5) the between variables typology of a spatial and temporal structure. Our methodological approach allowed us to assess the temporal stability of the spatial structure of the Lake waters (question 3) using a multitable analysis known as triadic analysis (THIOULOUSE and CHESSEL, 1987).As part of the limnological study of a reservoir Lake (Sorme reservoir Lake, Saône-et-Loire, France) 10 commonly used physical and chemical variables were studied from July 1980 to October 1931. During this period, 12 water samples were taken near the surface at each of the 10 stations scattered along the Sorme Lake (see figure 1). Main morphometric features of the Sorme Lake are : 1) a surface area equal to 230 ha, 2) a 25 km long perimeter and 3) a volume of 9.5 106 m3 with a maximum depth of 13 meters upstream of the dam and an average depth of about 4 meters. Seasonal tidal range was only a few meters.Only 2 of the 3 concepts of triadic analysis stated by THIOULOUSE and CHESSEL, 1987 are developed here : 1) for each of the 12 tables (stations x variables) coming from the 12 sampling dates, data are first centered (elimination of mean) and standardized (division by standard deviation) (see figure 2). The resulting table Y called interstructure matrix, i.e. interstructure between each of the sampling dates matrix, is organized to have sampling dates as columns and the ten physical and chemical variables at each station successively as fines. Principal Component Analysis (PCA on the variance-covariance matrix) is then applied to the interstructure matrix. In our case it is a one-dimensional matrix, i.e. according to physical and chemical variables, there is only one spatial structure common to each sampling date (figure 3 and 2) compromise matrix are associated with the successive PCA factors of the interstructure (figure 4). According to the previous remark, only the first factor is considered. Data are reorganized to have physical and chemical variables as columns, and stations as fines. This last table defines a compromise matrix labelled Z. The mapping of the numerical values of matrix Z renders a ten-dimensional description of the permanent spatial structure (figure 5). To summarize the multivariate description, matrix Z is processed with a PCA on the variance-covariance matrix producing a three-dimensional compromise (figure 6).The interpretation of the compromise table by mapping the factorial scores of the PCA leads to a functional scheme of the reservoir Lake waters distinguishing five sectors (see figure 7) as a function of water depth, influence of tidal range, influence of tributaries and of the Sorme River. 3 stations are periodically isolated from the reservoir and produce 3 sectors with lower pH and temperature values and higher concentrations in ammonia and sulphate according to the influence of tributaries. The 4th sector is associated with the former submerged valley, i.e. main channel of the Sorme River prior to the dam closure, and demonstrated an ionic gradient concerning mainly nitrate and chloride-concentrations. The 5th sector, opposed to the latter, consists in the deeper area of the Sorme Lake which reveals rather homogeneous waters near the surface

    Habitat filtering determines spatial variation of macroinvertebrate community traits in northern headwater streams

    Get PDF
    Although our knowledge of the spatial distribution of stream organisms has been increasing rapidly in the last decades, there is still little consensus about trait-based variability of macroinvertebrate communities within and between catchments in near-pristine systems. Our aim was to examine the taxonomic and trait based stability vs. variability of stream macroinvertebrates in three high-latitude catchments in Finland. The collected taxa were assigned to unique trait combinations (UTCs) using biological traits. We found that only a single or a highly limited number of taxa formed a single UTC, suggesting a low degree of redundancy. Our analyses revealed significant differences in the environmental conditions of the streams among the three catchments. Linear models, rarefaction curves and beta-diversity measures showed that the catchments differed in both alpha and beta diversity. Taxon- and trait-based multivariate analyses also indicated that the three catchments were significantly different in terms of macroinvertebrate communities. All these findings suggest that habitat filtering, i.e., environmental differences among catchments, determines the variability of macroinvertebrate communities, thereby contributing to the significant biological differences among the catchments. The main implications of our study is that the sensitivity of trait-based analyses to natural environmental variation should be carefully incorporated in the assessment of environmental degradation, and that further studies are needed for a deeper understanding of trait-based community patterns across near-pristine streams

    Importance of data structure in comparing two dimension reduction methods for classification of microarray gene expression data

    Get PDF
    BACKGROUND: With the advance of microarray technology, several methods for gene classification and prognosis have been already designed. However, under various denominations, some of these methods have similar approaches. This study evaluates the influence of gene expression variance structure on the performance of methods that describe the relationship between gene expression levels and a given phenotype through projection of data onto discriminant axes. RESULTS: We compared Between-Group Analysis and Discriminant Analysis (with prior dimension reduction through Partial Least Squares or Principal Components Analysis). A geometric approach showed that these two methods are strongly related, but differ in the way they handle data structure. Yet, data structure helps understanding the predictive efficiency of these methods. Three main structure situations may be identified. When the clusters of points are clearly split, both methods perform equally well. When the clusters superpose, both methods fail to give interesting predictions. In intermediate situations, the configuration of the clusters of points has to be handled by the projection to improve prediction. For this, we recommend Discriminant Analysis. Besides, an innovative way of simulation generated the three main structures by modelling different partitions of the whole variance into within-group and between-group variances. These simulated datasets were used in complement to some well-known public datasets to investigate the methods behaviour in a large diversity of structure situations. To examine the structure of a dataset before analysis and preselect an a priori appropriate method for its analysis, we proposed a two-graph preliminary visualization tool: plotting patients on the Between-Group Analysis discriminant axis (x-axis) and on the first and the second within-group Principal Components Analysis component (y-axis), respectively. CONCLUSION: Discriminant Analysis outperformed Between-Group Analysis because it allows for the dataset structure. An a priori knowledge of that structure may guide the choice of the analysis method. Simulated datasets with known properties are valuable to assess and compare the performance of analysis methods, then implementation on real datasets checks and validates the results. Thus, we warn against the use of unchallenging datasets for method comparison, such as the Golub dataset, because their structure is such that any method would be efficient

    Traits and stress: keys to identify community effects of low levels of toxicants in test systems

    Get PDF
    Community effects of low toxicant concentrations are obscured by a multitude of confounding factors. To resolve this issue for community test systems, we propose a trait-based approach to detect toxic effects. An experiment with outdoor stream mesocosms was established 2-years before contamination to allow the development of biotic interactions within the community. Following pulse contamination with the insecticide thiacloprid, communities were monitored for additional 2 years to observe long-term effects. Applying a priori ecotoxicological knowledge species were aggregated into trait-based groups that reflected stressor-specific vulnerability of populations to toxicant exposure. This reduces inter-replicate variation that is not related to toxicant effects and enables to better link exposure and effect. Species with low intrinsic sensitivity showed only transient effects at the highest thiacloprid concentration of 100 μg/l. Sensitive multivoltine species showed transient effects at 3.3 μg/l. Sensitive univoltine species were affected at 0.1 μg/l and did not recover during the year after contamination. Based on these results the new indicator SPEARmesocosm was calculated as the relative abundance of sensitive univoltine taxa. Long-term community effects of thiacloprid were detected at concentrations 1,000 times below those detected by the PRC (Principal Response Curve) approach. We also found that those species, characterised by the most vulnerable trait combination, that were stressed were affected more strongly by thiacloprid than non-stressed species. We conclude that the grouping of species according to toxicant-related traits enables identification and prediction of community response to low levels of toxicants and that additionally the environmental context determines species sensitivity to toxicants

    Relating Habitat and Climatic Niches in Birds

    Get PDF
    Predicting species' responses to the combined effects of habitat and climate changes has become a major challenge in ecology and conservation biology. However, the effects of climatic and habitat gradients on species distributions have generally been considered separately. Here, we explore the relationships between the habitat and thermal dimensions of the ecological niche in European common birds. Using data from the French Breeding Bird Survey, a large-scale bird monitoring program, we correlated the habitat and thermal positions and breadths of 74 bird species, controlling for life history traits and phylogeny. We found that cold climate species tend to have niche positions in closed habitats, as expected by the conjunction of the biogeographic history of birds' habitats, and their current continent-scale gradients. We also report a positive correlation between thermal and habitat niche breadths, a pattern consistent with macroecological predictions concerning the processes shaping species' distributions. Our results suggest that the relationships between the climatic and habitat components of the niche have to be taken into account to understand and predict changes in species' distributions

    Geometagenomics illuminates the impact of agriculture on the distribution and prevalence of plant viruses at the ecosystem scale

    Get PDF
    Disease emergence events regularly result from human activities such as agriculture, which frequently brings large populations of genetically uniform hosts into contact with potential pathogens. Although viruses cause nearly 50% of emerging plant diseases, there is little systematic information about virus distribution across agro-ecological interfaces and large gaps in understanding of virus diversity in nature. Here we applied a novel landscape-scale geometagenomics approach to examine relationships between agricultural land use and distributions of plantassociated viruses in two Mediterranean-climate biodiversity hotspots (Western Cape region of South Africa and Rhône river delta region of France). In total, we analysed 1725 geo-referenced plant samples collected over two years from 4.5 × 4.5 km2 grids spanning farmlands and adjacent uncultivated vegetation. We found substantial virus prevalence (25.8–35.7%) in all ecosystems, but prevalence and identified family-level virus diversity were greatest in cultivated areas, with some virus families displaying strong agricultural associations. Our survey revealed 94 previously unknown virus species, primarily from uncultivated plants. This is the first effort to systematically evaluate plant-associated viromes across broad agro-ecological interfaces. Our findings indicate that agriculture substantially influences plant virus distributions and highlight the extent of current ignorance about the diversity and roles of viruses in nature

    Methods of nutrition surveillance in low-income countries

    Get PDF
    Background In 1974 a joint FAO/UNICEF/WHO Expert Committee met to develop methods for nutrition surveillance. There has been much interest and activity in this topic since then, however there is a lack of guidance for practitioners and confusion exists around the terminology of nutrition surveillance. In this paper we propose a classification of data collection activities, consider the technical issues for each category, and examine the potential applications and challenges related to information and communication technology. Analysis There are three major approaches used to collect primary data for nutrition surveillance: repeated cross-sectional surveys; community-based sentinel monitoring; and the collection of data in schools. There are three major sources of secondary data for surveillance: from feeding centres, health facilities, and community-based data collection, including mass screening for malnutrition in children. Surveillance systems involving repeated surveys are suitable for monitoring and comparing national trends and for planning and policy development. To plan at a local level, surveys at district level or in programme implementation areas are ideal, but given the usually high cost of primary data collection, data obtained from health systems are more appropriate provided they are interpreted with caution and with contextual information. For early warning, data from health systems and sentinel site assessments may be valuable, if consistent in their methods of collection and any systematic bias is deemed to be steady. For evaluation purposes, surveillance systems can only give plausible evidence of whether a programme is effective. However the implementation of programmes can be monitored as long as data are collected on process indicators such as access to, and use of, services. Surveillance systems also have an important role to provide information that can be used for advocacy and for promoting accountability for actions or lack of actions, including service delivery. Conclusion This paper identifies issues that affect the collection of nutrition surveillance data, and proposes definitions of terms to differentiate between diverse sources of data of variable accuracy and validity. Increased interest in nutrition globally has resulted in high level commitments to reduce and prevent undernutrition. This review helps to address the need for accurate and regular data to convert these commitments into practice
    • …
    corecore