289 research outputs found
Strategies for converting to a DBMS environment
The conversion to data base management systems processing techniques consists of three different strategies - one for each of the major stages in the development process. Each strategy was chosen for its approach in bringing about a smooth evolutionary type transition from one mode of operation to the next. The initial strategy of the indoctrination stage consisted of: (1) providing maximum access to current administrative data as soon as possible; (2) select and developing small prototype systems; (3) establishing a user information center as a central focal point for user training and assistance; and (4) developing a training program for programmers, management and ad hoc users in DBMS application and utilization. Security, the rate of the data dictionary, and data base tuning and capacity planning, and the development of a change of attitude in an automated office are issues meriting consideration
Penalized composite link models for aggregated spatial count data: a mixed model approach
Mortality data provide valuable information for the study of the spatial distri- bution of mortality risk, in disciplines such as spatial epidemiology and public health. However, they are frequently available in an aggregated form over irreg- ular geographical units, hindering the visualization of the underlying mortality risk. Also, it can be of interest to obtain mortality risk estimates on a finer spatial resolution, such that they can be linked to potential risk factors that are usually measured in a different spatial resolution. In this paper, we propose the use of the penalized composite link model and its mixed model representation. This model considers the nature of mortality rates by incorporating the population size at the finest resolution, and allows the creation of mortality maps at a finer scale, thus reducing the visual bias resulting from the spatial aggrega- tion within original units. We also extend the model by considering individual random effects at the aggregated scale, in order to take into account the overdis- persion. We illustrate our novel proposal using two datasets: female deaths by lung cancer in Indiana, USA, and male lip cancer incidence in Scotland counties. We also compare the performance of our proposal with the area-to-point Poisson kriging approach
Penalized composite link mixed models for two-dimensional count data
Mortality data provide valuable information for the study of the spatial distribution of mortality risk, in disciplines such as spatial epidemiology, medical demography, and public health. However, they are often available in an aggregated form over irregular geographical units, hindering the visualization of the underlying mortality risk and the detection of meaningful patterns. Also, it could be of interest to obtain mortality risk estimates on a finer spatial resolution, such that they can be linked with potential risk factors — in a posterior correlation analysis — that are usually measured in a different spatial resolution than mortality data. In this paper, we propose the use of the penalized composite link model and its representation as a mixed model to deal with these issues. This model takes into account the nature of mortality rates by incorporating the population size at the finest resolution, and allows the creation of mortality maps at a desirable scale, reducing the visual bias resulting from the spatial aggregation within original units. We illustrate our proposal with the analysis of several datasets related with deaths by respiratory diseases, cardiovascular diseases, and lung cancer.MTM2011-28285-C02-02
MTM2014-52184-
Modeling latent spatio-temporal disease incidence using penalized composite link models
Epidemiological data are frequently recorded at coarse spatio-temporal resolutions to protect confidential information or to summarize it in a compact manner. However, the detailed patterns followed by the source data, which may be of interest to researchers and public health officials, are overlooked. We propose to use the penalized composite link model (Eilers PCH (2007)), combined with spatio-temporal P-splines methodology (Lee D.-J., Durban M (2011)) to estimate the underlying trend within data that have been aggregated not only in space, but also in time. Model estimation is carried out within a generalized linear mixed model framework, and sophisticated algorithms are used to speed up computations that otherwise would be unfeasible. The model is then used to analyze data obtained during the largest outbreak of Q-fever in the Netherlands.Grant No. MTM2014-52184-P awarded to MD, and DA, and by Agencia Estatal de Investigació
Modelling latent trends from spatio-temporally grouped data using composite link mixed models
Epidemiological data are frequently recorded at coarse spatio-temporal resolutions. The aggregation process is done for several reasons: to protect confidential patients' information, to compare with other datasets at a coarser resolution than the original, or to summarize data in a compact manner. However, we lose detailed patterns that follow the original data, which can be of interest for researchers and public health officials. In this paper we propose the use of the penalized composite link model (Eilers, 2007), together with its mixed model representation, to estimate the underlying trend behind grouped data at a finer spatio-temporal resolution. Also, this model allows the incorporation of fine-scale population into the estimation procedure. We assume the underlying trend is smooth across space and time. The mixed model representation enables the use of sophisticated algorithms such as the SAP algorithm of Rodríguez- Álvarez et al. (2015) for fast estimation of the amount of smoothness. We illustrate our proposal with the analysis of data obtained during the largest outbreak of Q fever in the Netherlands.MTM2011-28285-C02-02, MTM2014-52184-
Contrasting abundance and residency patterns of two sympatric populations of transient killer whales (Orcinus orca) in the northern Gulf of Alaska
Two sympatric populations of “transient” (mammal-eating)
killer whales were photo-identified over 27 years (1984–2010) in Prince William Sound and Kenai Fjords, coastal waters of the northern Gulf of Alaska (GOA). A total of 88 individuals were identified during 203 encounters with “AT1” transients (22 individuals) and 91 encounters with “GOA” transients (66 individuals). The median number of individuals identified annually was similar for both populations (AT1=7; GOA=8), but mark-recapture estimates showed the AT1 whales to have much higher fidelity to the study area, whereas the GOA whales had a higher exchange of
individuals. Apparent survival estimates were generally high for both populations, but there was a significant
reduction in the survival of AT1 transients after the Exxon Valdez oil spill in 1989, with an abrupt decline in estimated abundance from a high of 22 in 1989 to a low of seven whales at the end of 2010. There was no detectable decline in GOA population abundance or survival over the same period, but abundance ranged from just 6 to 18 whales annually. Resighting data from adjacent coastal waters
and movement tracks from satellite tags further indicated that the GOA whales are part of a larger population with a more extensive range, whereas AT1 whales are resident to
the study area
Fast estimation of multidimensional adaptive P-spline models
A fast and stable algorithm for estimating multidimensional adaptive P-spline models is presented. We call it as Separation of Overlapping Penalties (SOP) as it is an extension of the Separation of Anisotropic Penalties (SAP) algorithm. SAP was originally derived for the estimation of the smoothing parameters of a multidimensional tensor product P-spline model with anisotropic penalties.MTM2014-55966-P
MTM2014-52184-
On the estimation of variance parameters in non-standard generalised linear mixed models: Application to penalised smoothing
We present a novel method for the estimation of variance parameters in generalised linear mixed models. The method has its roots in Harville (1977)'s work, but it is able to deal with models that have a precision matrix for the random-effect vector that is linear in the inverse of the variance parameters (i.e., the precision parameters). We call the method SOP (Separation of Overlapping Precision matrices). SOP is based on applying the method of successive approximations to easy-to-compute estimate updates of the variance parameters. These estimate updates have an appealing form: they are the ratio of a (weighted) sum of squares to a quantity related to effective degrees of freedom. We provide the sufficient and necessary conditions for these estimates to be strictly positive. An important application field of SOP is penalised regression estimation of models where multiple quadratic penalties act on the same regression coefficients. We discuss in detail two of those models: penalised splines for locally adaptive smoothness and for hierarchical curve data. Several data examples in these settings are presented.MTM2014-55966-P
MTM2014-52184-
Smooth additive mixed models for predicting aboveground biomass
Aboveground biomass estimation in short-rotation forestry plantations is an essential step in the development of crop management strategies as well as allowing the economic viability of the crop to be determined prior to harvesting. Hence, it is important to develop new methodologies that improve the accuracy of predictions, using only a minimum set of easily obtainable information i.e. diameter and height. Many existing models base their predictions only on diameter (mainly due to the complexity of including further covariates), or rely on complicated equations to obtain biomass predictions. However, in tree species, it is important to include height when estimating aboveground biomass because this will vary from one genotype to another. This work proposes the use of a more flexible and easy to implement model for predicting aboveground biomass (stem, branches and total) as a smooth function of height and diameter using smooth additive mixed models which preserve the additive property necessary to model the relationship within wood fractions, and allows the inclusion of random effects and interaction terms. The model is applied to the analysis of three trials carried out in Spain, where nine clones at three different sites are compared. Also, an analysis of slash pine data is carried out in order to compared with the approach proposed by Parresol (2001
Spatio-temporal adaptive penalized splines with application to Neuroscience
Data analysed here derive from experiments conducted to study neurons' activity in the visual cortex of behaving monkeys. We consider a spatio-temporal adaptive penalized spline (P-spline) approach for modelling the firing rate of visual neurons. To the best of our knowledge, this is the first attempt in the statistical literature for locally adaptive smoothing in three dimensions. Estimation is based on the Separation of Overlapping Penalties (SOP) algorithm, which provides the stability and speed we look for.MTM2014-55966-P
MTM2014-52184-P
RETICS, Oftared - RD12/0034/001
- …