39 research outputs found
Change point analysis of historical battle deaths
It has been claimed and disputed that World War II has been followed by a
`long peace', an unprecedented decline of war. We conduct a full changepoint
analysis of well-documented, publicly-available battle deaths datasets, using
new techniques that enable the robust detection of changes in the statistical
properties of such heavy-tailed data. We first test and calibrate these
techniques. We then demonstrate the existence of changes, independent of data
presentation, at around 1910 and 1950 CE, bracketing the World Wars, and around
the 1830s and 1994 CE. Our analysis provides a methodology for future
investigations and an empirical basis for political and historical discussions.Comment: 24 pages, 11 figure
A genomic and transcriptomic approach for a differential diagnosis between primary and secondary ovarian carcinomas in patients with a previous history of breast cancer
<p>Abstract</p> <p>Background</p> <p>The distinction between primary and secondary ovarian tumors may be challenging for pathologists. The purpose of the present work was to develop genomic and transcriptomic tools to further refine the pathological diagnosis of ovarian tumors after a previous history of breast cancer.</p> <p>Methods</p> <p>Sixteen paired breast-ovary tumors from patients with a former diagnosis of breast cancer were collected. The genomic profiles of paired tumors were analyzed using the Affymetrix GeneChip<sup>Âź </sup>Mapping 50 K Xba Array or Genome-Wide Human SNP Array 6.0 (for one pair), and the data were normalized with ITALICS (ITerative and Alternative normaLIzation and Copy number calling for affymetrix Snp arrays) algorithm or Partek Genomic Suite, respectively. The transcriptome of paired samples was analyzed using Affymetrix GeneChip<sup>Âź </sup>Human Genome U133 Plus 2.0 Arrays, and the data were normalized with gc-Robust Multi-array Average (gcRMA) algorithm. A hierarchical clustering of these samples was performed, combined with a dataset of well-identified primary and secondary ovarian tumors.</p> <p>Results</p> <p>In 12 of the 16 paired tumors analyzed, the comparison of genomic profiles confirmed the pathological diagnosis of primary ovarian tumor (n = 5) or metastasis of breast cancer (n = 7). Among four cases with uncertain pathological diagnosis, genomic profiles were clearly distinct between the ovarian and breast tumors in two pairs, thus indicating primary ovarian carcinomas, and showed common patterns in the two others, indicating metastases from breast cancer. In all pairs, the result of the transcriptomic analysis was concordant with that of the genomic analysis.</p> <p>Conclusions</p> <p>In patients with ovarian carcinoma and a previous history of breast cancer, SNP array analysis can be used to distinguish primary and secondary ovarian tumors. Transcriptomic analysis may be used when primary breast tissue specimen is not available.</p
Identifying stationary phases in multivariate time series for highlighting behavioural modes and home range settlements
Recent advances in biologging open promising perspectives in the study of animal movements at numerous scales. It is now possible to record time series of animal locations and ancillary data (e.g. activity level derived from onâboard accelerometers) over extended areas and long durations with a high spatial and temporal resolution. Such time series are often piecewise stationary, as the animal may alternate between different stationary phases (i.e. characterized by a specific mean and variance of some key parameter for limited periods). Identifying when these phases start and end is a critical first step to understand the dynamics of the underlying movement processes.
We introduce a new segmentationâclustering method we called segclust2d (available as a r package at cran.r-project.org/package=segclust2d). It can segment bivariate (or more generally multivariate) time series and possibly cluster the various segments obtained, corresponding to different phases assumed to be stationary. This method is easy to use, as it only requires specifying a minimum segment length (to prevent overâsegmentation), based on biological rather than statistical considerations.
This method can be applied to bivariate piecewise time series of any nature. We focus here on two types of time series related to animal movement, corresponding to (a) at large scale, series of bivariate coordinates of relocations, to highlight temporary home ranges, and (b) at smaller scale, bivariate series derived from relocations data, such as speed and turning angle, to highlight different behavioural modes such as transit, feeding and resting.
Using computer simulations, we show that segclust2d can rival and even outperform previous, more complex methods, which were specifically developed to highlight changes of movement modes or home range shifts (based on hidden Markov and OrnsteinâUhlenbeck modelling), which, contrary to our method, usually require the user to provide relevant initial guesses to be efficient. Furthermore, we demonstrate it on actual examples involving a zebra's smallâscale movements and an elephant's largeâscale movements, to illustrate how various movement modes and home range shifts, respectively, can be identified.The code of the method is publicly available as an R package (cran.râproje ct.org/packa ge=segcl ust2d ). The data used in the examples (24âhr GPS track of a plains zebra and GPS track of an African el-ephant recorded for >2.5 years) are available on Dryad: https: //doi. org/10.5061/dryad.2j63369 (Patin, Etienne, Lebarbier, ChamaillĂ©âJammes, & Benhamou, 2019).The Grant ANRâ16âCE02â0001â01 of the French âAgence Nationale de la Recherche', and the Zone Atelier program of the CNRS.http://wileyonlinelibrary.com/journal/jane2020-09-20hj2019Zoology and Entomolog