357 research outputs found
A double-helix neutron detector using micron-size B-10 powder
A double-helix electrode configuration is combined with a B powder
coating technique to build large-area (9 in 36 in) neutron detectors.
The neutron detection efficiency for each of the four prototypes is comparable
to a single 2-bar He drift tube of the same length (36 in). One unit has
been operational continuously for 18 months and the change of efficiency is
less than 1%. An analytic model for pulse heigh spectra is described and the
predicted mean film thickness agrees with the experiment to within 30%. Further
detector optimization is possible through film texture, power size, moderator
box and gas. The estimated production cost per unit is less than 3k US\$ and
the technology is thus suitable for deployment in large numbers
The uniqueness of flow in probing the aggregation behavior of clinically relevant antibodies
The development of therapeutic monoclonal antibodies (mAbs) can be hindered by their tendency to aggregate throughout their lifetime, which can illicit immunogenic responses and render mAb manufacturing unfeasible. Consequently, there is a need to identify mAbs with desirable thermodynamic stability, solubility, and lack of selfâassociation. These behaviors are assessed using an array of in silico and in vitro assays, as no single assay can predict aggregation and developability. We have developed an extensional and shear flow device (EFD), which subjects proteins to defined hydrodynamic forces which mimic those experienced in bioprocessing. Here, we utilize the EFD to explore the aggregation propensity of 33 IgG1 mAbs, whose variable domains are derived from clinical antibodies. Using submilligram quantities of material per replicate, wideâranging EFDâinduced aggregation (9â81% protein in pellet) was observed for these mAbs, highlighting the EFD as a sensitive method to assess aggregation propensity. By comparing the EFDâinduced aggregation data to those obtained previously from 12 other biophysical assays, we show that the EFD provides distinct information compared with current measures of adverse biophysical behavior. Assessing a candidate's liability to hydrodynamic force thus adds novel insight into the rational selection of developable mAbs that complements other assays
Statistical Consequences of Devroye Inequality for Processes. Applications to a Class of Non-Uniformly Hyperbolic Dynamical Systems
In this paper, we apply Devroye inequality to study various statistical
estimators and fluctuations of observables for processes. Most of these
observables are suggested by dynamical systems. These applications concern the
co-variance function, the integrated periodogram, the correlation dimension,
the kernel density estimator, the speed of convergence of empirical measure,
the shadowing property and the almost-sure central limit theorem. We proved in
\cite{CCS} that Devroye inequality holds for a class of non-uniformly
hyperbolic dynamical systems introduced in \cite{young}. In the second appendix
we prove that, if the decay of correlations holds with a common rate for all
pairs of functions, then it holds uniformly in the function spaces. In the last
appendix we prove that for the subclass of one-dimensional systems studied in
\cite{young} the density of the absolutely continuous invariant measure belongs
to a Besov space.Comment: 33 pages; companion of the paper math.DS/0412166; corrected version;
to appear in Nonlinearit
Mechanical Strength of 17 134 Model Proteins and Cysteine Slipknots
A new theoretical survey of proteins' resistance to constant speed stretching
is performed for a set of 17 134 proteins as described by a structure-based
model. The proteins selected have no gaps in their structure determination and
consist of no more than 250 amino acids. Our previous studies have dealt with
7510 proteins of no more than 150 amino acids. The proteins are ranked
according to the strength of the resistance. Most of the predicted top-strength
proteins have not yet been studied experimentally. Architectures and folds
which are likely to yield large forces are identified. New types of potent
force clamps are discovered. They involve disulphide bridges and, in
particular, cysteine slipknots. An effective energy parameter of the model is
estimated by comparing the theoretical data on characteristic forces to the
corresponding experimental values combined with an extrapolation of the
theoretical data to the experimental pulling speeds. These studies provide
guidance for future experiments on single molecule manipulation and should lead
to selection of proteins for applications. A new class of proteins, involving
cystein slipknots, is identified as one that is expected to lead to the
strongest force clamps known. This class is characterized through molecular
dynamics simulations.Comment: 40 pages, 13 PostScript figure
Cooperative folding of intrinsically disordered domains drives assembly of a strong elongated protein.
Bacteria exploit surface proteins to adhere to other bacteria, surfaces and host cells. Such proteins need to project away from the bacterial surface and resist significant mechanical forces. SasG is a protein that forms extended fibrils on the surface of Staphylococcus aureus and promotes host adherence and biofilm formation. Here we show that although monomeric and lacking covalent cross-links, SasG maintains a highly extended conformation in solution. This extension is mediated through obligate folding cooperativity of the intrinsically disordered E domains that couple non-adjacent G5 domains thermodynamically, forming interfaces that are more stable than the domains themselves. Thus, counterintuitively, the elongation of the protein appears to be dependent on the inherent instability of its domains. The remarkable mechanical strength of SasG arises from tandemly arrayed 'clamp' motifs within the folded domains. Our findings reveal an elegant minimal solution for the assembly of monomeric mechano-resistant tethers of variable length.This research was supported by Biotechnology and Biological Research Council Grants BB/J006459/1 (D.T.G. and J.C.), BB/J005029/1 (F.W. and J.R.P), BB/G019452/1 (O.E.F and D.J.B) and BB/G020671/1 (C.G.B. and J.R.P.). H.K.H.F. is supported by a studentship from a Wellcome Trust 4-year PhD programme (WT095024MA). C.M.J. is supported by the German Federal Ministry of Education and Research (BMBF), grant BIOSCAT (contract N° 05K12YE1). J.C. is a Wellcome Trust Senior Research
Fellow (WT/095195). J.R.P holds a British Heart Foundation Senior Basic Science Fellowship (FS/12/36/29588). The authors acknowledge the use of EMBL SAXS beamline P12 at Petra-3 (DESY, Hamburg, Germany) and Maxim Petoukhov (EMBL) for providing a modified version of SASR EF. The research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/2007-2013) under BioStruct-X (grant agreement N°283570). The authors would like to thank Diamond Light Source for beamtime (proposal mx-7864) and Johan Turkenburg and Sam Hart for assistance with crystal testing and data collection.This is the final version of the article. It first appeared from NPG via http://dx.doi.org/10.1038/ncomms827
Sampling constrained probability distributions using Spherical Augmentation
Statistical models with constrained probability distributions are abundant in
machine learning. Some examples include regression models with norm constraints
(e.g., Lasso), probit, many copula models, and latent Dirichlet allocation
(LDA). Bayesian inference involving probability distributions confined to
constrained domains could be quite challenging for commonly used sampling
algorithms. In this paper, we propose a novel augmentation technique that
handles a wide range of constraints by mapping the constrained domain to a
sphere in the augmented space. By moving freely on the surface of this sphere,
sampling algorithms handle constraints implicitly and generate proposals that
remain within boundaries when mapped back to the original space. Our proposed
method, called {Spherical Augmentation}, provides a mathematically natural and
computationally efficient framework for sampling from constrained probability
distributions. We show the advantages of our method over state-of-the-art
sampling algorithms, such as exact Hamiltonian Monte Carlo, using several
examples including truncated Gaussian distributions, Bayesian Lasso, Bayesian
bridge regression, reconstruction of quantized stationary Gaussian process, and
LDA for topic modeling.Comment: 41 pages, 13 figure
Cooperative folding of intrinsically disordered domains drives assembly of a strong elongated protein
Bacteria exploit surface proteins to adhere to other bacteria, surfaces and host cells. Such proteins need to project away from the bacterial surface and resist significant mechanical forces. SasG is a protein that forms extended fibrils on the surface of Staphylococcus aureus and promotes host adherence and biofilm formation. Here we show that although monomeric and lacking covalent cross-links, SasG maintains a highly extended conformation in solution. This extension is mediated through obligate folding cooperativity of the intrinsically disordered E domains that couple non-adjacent G5 domains thermodynamically, forming interfaces that are more stable than the domains themselves. Thus, counterintuitively, the elongation of the protein appears to be dependent on the inherent instability of its domains. The remarkable mechanical strength of SasG arises from tandemly arrayed 'clamp' motifs within the folded domains. Our findings reveal an elegant minimal solution for the assembly of monomeric mechano-resistant tethers of variable length
BSDB: the biomolecule stretching database
We describe the Biomolecule Stretching Data Base that has been recently set up at http://www.ifpan.edu.pl/BSDB/. It provides information about mechanostability of proteins. Its core is based on simulations of stretching of 17â134 proteins within a structure-based model. The primary information is about the heights of the maximal force peaks, the forceâdisplacement patterns, and the sequencing of the contact-rupturing events. We also summarize the possible types of the mechanical clamps, i.e. the motifs which are responsible for a protein's resistance to stretching
Discrimination of water quality monitoring sites in River Vouga using a mixed-effect state space model
The surface water quality monitoring is an important concern of public organizations due to its relevance to the public health. Statistical methods are taken as consistent and essential tools in the monitoring procedures in order to prevent and identify environmental problems. This work presents the study case of the hydrological basin of the river Vouga, in Portugal. The main goal is discriminate the water monitoring sites using the monthly dissolved oxygen concentration dataset between January 2002 and May 2013. This is achieved through the extraction of trend and seasonal components in a linear mixed-effect state space model. The parameters estimation is performed with both maximum likelihood method and distribution-free estimators in a two-step procedure. The application of the Kalman smoother algorithm allows to obtain predictions of the structural components as trend and seasonality. The water monitoring sites are discriminated through the structural components by a hierarchical agglomerative clustering procedure. This procedure identified different homogenous groups relatively to the trend and seasonality components and some characteristics of the hydrological basin are presented in order to support the results
Probabilistic change detection and visualization methods for the assessment of temporal stability in biomedical data quality
The final publication is available at Springer via http://dx.doi.org/DOI 10.1007/s10618-014-0378-6. Published online.Knowledge discovery on biomedical data can be based on on-line, data-stream analyses, or using retrospective, timestamped, off-line datasets. In both cases, changes in the processes that generate data or in their quality features through time may hinder either the knowledge discovery process or the generalization of past knowledge. These problems can be seen as a lack of data temporal stability. This work establishes the temporal stability as a data quality dimension and proposes new methods for its assessment based on a probabilistic framework. Concretely, methods are proposed for (1) monitoring changes, and (2) characterizing changes, trends and detecting temporal subgroups. First, a probabilistic change detection algorithm is proposed based on the Statistical Process Control of the posterior Beta distribution of the JensenâShannon distance, with a memoryless forgetting mechanism. This algorithm (PDF-SPC) classifies the degree of current change in three states: In-Control, Warning, and Out-of-Control. Second, a novel method is proposed to visualize and characterize the temporal changes of data based on the projection of a non-parametric information-geometric statistical manifold of time windows. This projection facilitates the exploration of temporal trends using the proposed IGT-plot and, by means of unsupervised learning methods, discovering conceptually-related temporal subgroups. Methods are evaluated using real and simulated data based on the National Hospital Discharge Survey (NHDS) dataset.The work by C Saez has been supported by an Erasmus Lifelong Learning Programme 2013 Grant. This work has been supported by own IBIME funds. The authors thank Dr. Gregor Stiglic, from the Univeristy of Maribor, Slovenia, for his support on the NHDS data.SĂĄez Silvestre, C.; Pereira Rodrigues, P.; Gama, J.; Robles Viejo, M.; GarcĂa GĂłmez, JM. (2014). Probabilistic change detection and visualization methods for the assessment of temporal stability in biomedical data quality. Data Mining and Knowledge Discovery. 28:1-1. doi:10.1007/s10618-014-0378-6S1128Aggarwal C (2003) A framework for diagnosing changes in evolving data streams. In Proceedings of the International Conference on Management of Data ACM SIGMOD, pp 575â586Amari SI, Nagaoka H (2007) Methods of information geometry. American Mathematical Society, Providence, RIArias E (2014) United states life tables, 2009. Natl Vital Statist Rep 62(7): 1â63Aspden P, Corrigan JM, Wolcott J, Erickson SM (2004) Patient safety: achieving a new standard for care. Committee on data standards for patient safety. The National Academies Press, Washington, DCBasseville M, Nikiforov IV (1993) Detection of abrupt changes: theory and application. Prentice-Hall Inc, Upper Saddle River, NJBorg I, Groenen PJF (2010) Modern multidimensional scaling: theory and applications. Springer, BerlinBowman AW, Azzalini A (1997) Applied smoothing techniques for data analysis: the Kernel approach with S-plus illustrations (Oxford statistical science series). Oxford University Press, OxfordBrandes U, Pich C (2007) Eigensolver methods for progressive multidimensional scaling of large data. In: Kaufmann M, Wagner D (eds) Graph drawing. Lecture notes in computer science, vol 4372. Springer, Berlin, pp 42â53Brockwell P, Davis R (2009) Time series: theory and methods., Springer series in statisticsSpringer, BerlinCesario SK (2002) The âChristmas Effectâ and other biometeorologic influences on childbearing and the health of women. J Obstet Gynecol Neonatal Nurs 31(5):526â535Chakrabarti K, Garofalakis M, Rastogi R, Shim K (2001) Approximate query processing using wavelets. VLDB J 10(2â3):199â223Cruz-Correia RJ, Pereira Rodrigues P, Freitas A, Canario Almeida F, Chen R, Costa-Pereira A (2010) Data quality and integration issues in electronic health records. Information discovery on electronic health records, pp 55â96CsiszĂĄr I (1967) Information-type measures of difference of probability distributions and indirect observations. Studia Sci Math Hungar 2:299â318Dasu T, Krishnan S, Lin D, Venkatasubramanian S, Yi K (2009) Change (detection) you can believe. In: Finding distributional shifts in data streams. In: Proceedings of the 8th international symposium on intelligent data analysis: advances in intelligent data analysis VIII, IDA â09. Springer, Berlin, pp 21â34Endres D, Schindelin J (2003) A new metric for probability distributions. IEEE Trans Inform Theory 49(7):1858â1860Gama J, Gaber MM (2007) Learning from data streams: processing techniques in sensor networks. Springer, BerlinGama J, Medas P, Castillo G, Rodrigues P (2004) Learning with drift detection. In: Bazzan A, Labidi S (eds) Advances in artificial intelligenceâSBIA 2004., Lecture notes in computer scienceSpringer, Berlin, pp 286â295Gama J (2010) Knowledge discovery from data streams, 1st edn. Chapman & Hall, LondonGehrke J, Korn F, Srivastava D (2001) On computing correlated aggregates over continual data streams. SIGMOD Rec 30(2):13â24Guha S, Shim K, Woo J (2004) Rehist: relative error histogram construction algorithms. In: Proceedings of the thirtieth international conference on very large data bases VLDB, pp 300â311Han J, Kamber M, Pei J (2012) Data mining: concepts and techniques. Morgan Kaufmann, Elsevier, Burlington, MAHowden LM, Meyer JA, (2011) Age and sex composition. 2010 Census Briefs US Department of Commerce. Economics and Statistics Administration, US Census BureauHrovat G, Stiglic G, Kokol P, Ojstersek M (2014) Contrasting temporal trend discovery for large healthcare databases. Comput Methods Program Biomed 113(1):251â257Keim DA (2000) Designing pixel-oriented visualization techniques: theory and applications. IEEE Trans Vis Comput Graph 6(1):59â78Kifer D, Ben-David S, Gehrke J (2004) Detecting change in data streams. In: Proceedings of the thirtieth international conference on Very large data bases, VLDB Endowment, VLDB â04, vol 30, pp 180â191Klinkenberg R, Renz I (1998) Adaptive information filtering: Learning in the presence of concept drifts. In: Workshop notes of the ICML/AAAI-98 workshop learning for text categorization. AAAI Press, Menlo Park, pp 33â40Kohonen T (1982) Self-organized formation of topologically correct feature maps. Biolog Cybern 43(1):59â69Lin J (1991) Divergence measures based on the Shannon entropy. IEEE Trans Inform Theory 37:145â151Mitchell TM, Caruana R, Freitag D, McDermott J, Zabowski D (1994) Experience with a learning personal assistant. Commun ACM 37(7):80â91Mouss H, Mouss D, Mouss N, Sefouhi L (2004) Test of page-hinckley, an approach for fault detection in an agro-alimentary production system. In: Proceedings of the 5th Asian Control Conference, vol 2, pp 815â818National Research Council (2011) Explaining different levels of longevity in high-income countries. The National Academies Press, Washington, DCNHDS (2010) United states department of health and human services. Centers for disease control and prevention. National center for health statistics. National hospital discharge survey codebookNHDS (2014) National Center for Health Statistics, National Hospital Discharge Survey (NHDS) data, US Department of Health and Human Services, Centers for Disease Control and Prevention, National Center for Health Statistics, Hyattsville, Maryland. http://www.cdc.gov/nchs/nhds.htmPapadimitriou S, Sun J, Faloutsos C (2005) Streaming pattern discovery in multiple time-series. In: Proceedings of the 31st international conference on very large data bases, VLDB endowment, VLDB â05, pp 697â708Parzen E (1962) On estimation of a probability density function and mode. Ann Math Statist 33(3):1065â1076Ramsay JO, Silverman BW (2005) Functional data analysis. Springer, New YorkRodrigues P, Correia R (2013) Streaming virtual patient records. In: Krempl G, Zliobaite I, Wang Y, Forman G (eds) Real-world challenges for data stream mining. University Magdeburg, Otto-von-Guericke, pp 34â37Rodrigues P, Gama J, Pedroso J (2008) Hierarchical clustering of time-series data streams. IEEE Trans Knowl Data Eng 20(5):615â627Rodrigues PP, Gama Ja (2010) A simple dense pixel visualization for mobile sensor data mining. In: Proceedings of the second international conference on knowledge discovery from sensor data, sensor-KDDâ08. Springer, Berlin, pp 175â189Rodrigues PP, Gama J, SebastiĂŁ o R (2010) Memoryless fading windows in ubiquitous settings. In Proceedings of ubiquitous data mining (UDM) workshop in conjunction with the 19th european conference on artificial intelligenceâECAI 2010, pp 27â32Rodrigues PP, SebastiĂŁ o R, Santos CC (2011) Improving cardiotocography monitoring: a memory-less stream learning approach. In: Proceedings of the learning from medical data streams workshop. Bled, SloveniaRubner Y, Tomasi C, Guibas L (2000) The earth moverâs distance as a metric for image retrieval. Int J Comput Vision 40(2):99â121SebastiĂŁo R, Gama J (2009) A study on change detection methods. In: 4th Portuguese conference on artificial intelligenceSebastiĂŁo R, Gama J, Rodrigues P, Bernardes J (2010) Monitoring incremental histogram distribution for change detection in data streams. In: Gaber M, Vatsavai R, Omitaomu O, Gama J, Chawla N, Ganguly A (eds) Knowledge discovery from sensor data, vol 5840., Lecture notes in computer science. Springer, Berlin, pp 25â42SebastiĂŁo R, Silva M, Rabiço R, Gama J, Mendonça T (2013) Real-time algorithm for changes detection in depth of anesthesia signals. Evol Syst 4(1):3â12SĂĄez C, MartĂnez-Miranda J, Robles M, GarcĂa-GĂłmez JM (2012) O rganizing data quality assessment of shifting biomedical data. Stud Health Technol Inform 180:721â725SĂĄez C, Robles M, GarcĂa-GĂłmez JM (2013) Comparative study of probability distribution distances to define a metric for the stability of multi-source biomedical research data. In: Engineering in medicine and biology society (EMBC), 2013 35th annual international conference of the IEEE, pp 3226â3229SĂĄez C, Robles M, GarcĂa-GĂłmez JM (2014) Stability metrics for multi-source biomedical data based on simplicial projections from probability distribution distances. Statist Method Med Res (forthcoming)Shewhart WA, Deming WE (1939) Statistical method from the viewpoint of quality control. Graduate School of the Department of Agriculture, Washington, DCShimazaki H, Shinomoto S (2010) Kernel bandwidth optimization in spike rate estimation. J Comput Neurosci 29(1â2):171â182Solberg LI, Engebretson KI, Sperl-Hillen JM, Hroscikoski MC, OâConnor PJ (2006) Are claims data accurate enough to identify patients for performance measures or quality improvement? the case of diabetes, heart disease, and depression. Am J Med Qual 21(4):238â245Spiliopoulou M, Ntoutsi I, Theodoridis Y, Schult R (2006) monic: modeling and monitoring cluster transitions. In: Proceedings of the 12th ACm SIGKDD international conference on knowledge discovery and data mining, KDD â06. ACm, New York, NY, pp 706â711Stiglic G, Kokol P (2011) Interpretability of sudden concept drift in medical informatics domain. In Proceedings of the 2010 IEEE international conference on data mining workshops, pp 609â613Torgerson W (1952) Multidimensional scaling: I theory and method. Psychometrika 17(4):401â419Wang RY, Strong DM (1996) Beyond accuracy: what data quality means to data consumers. J Manage Inform Syst 12(4):5â33Weiskopf NG, Weng C (2013) M ethods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc 20(1):144â151Wellings K, Macdowall W, Catchpole M, Goodrich J (1999) Seasonal variations in sexual activity and their implications for sexual health promotion. J R Soc Med 92(2):60â64Westgard JO, Barry PL (2010) Basic QC practices: training in statistical quality control for medical laboratories. Westgard Quality Corporation, Madison, WIWidmer G, Kubat M (1996) Learning in the presence of concept drift and hidden contexts. Mach Learn 23(1):69â10
- âŚ