Search CORE

15,789 research outputs found

Farm-nonfarm linkages in rural sub-Saharan Africa

Author: Brown James
Haggblade Steven
Hazell Peter B.
Publication venue
Publication date
Field of study

This paper is an accumulation, over the past 25 years, of a body of detailed work examining the structure of Africa's rural, nonfarm economy. First, it systematically reviews empirical evidence on the nature and magnitude of the African rural, nonfarm economy. It then explores differences across locality and size, across countries and over time, in an effort to assess likely patterns of growth. A subsequent review of key production and consumption parameters allows an estimate of the magnitude of the agricultural growth multipliers in Africa. The paper concludes with a brief discussion of policies and programs that will be necessary if farm-nonfarm growth linkages are to achieve their full potential.Banks&Banking Reform,Agricultural Knowledge&Information Systems,Municipal Financial Management,Environmental Economics&Policies,Economic Theory&Research

Research Papers in Economics

Iterative Random Forests to detect predictive and stable high-order interactions

Author: Basu Sumanta
Brown James B.
Kumbier Karl
Yu Bin
Publication venue
Publication date: 23/12/2017
Field of study

Genomics has revolutionized biology, enabling the interrogation of whole transcriptomes, genome-wide binding sites for proteins, and many other molecular processes. However, individual genomic assays measure elements that interact in vivo as components of larger molecular machines. Understanding how these high-order interactions drive gene expression presents a substantial statistical challenge. Building on Random Forests (RF), Random Intersection Trees (RITs), and through extensive, biologically inspired simulations, we developed the iterative Random Forest algorithm (iRF). iRF trains a feature-weighted ensemble of decision trees to detect stable, high-order interactions with same order of computational cost as RF. We demonstrate the utility of iRF for high-order interaction discovery in two prediction problems: enhancer activity in the early Drosophila embryo and alternative splicing of primary transcripts in human derived cell lines. In Drosophila, among the 20 pairwise transcription factor interactions iRF identifies as stable (returned in more than half of bootstrap replicates), 80% have been previously reported as physical interactions. Moreover, novel third-order interactions, e.g. between Zelda (Zld), Giant (Gt), and Twist (Twi), suggest high-order relationships that are candidates for follow-up experiments. In human-derived cells, iRF re-discovered a central role of H3K36me3 in chromatin-mediated splicing regulation, and identified novel 5th and 6th order interactions, indicative of multi-valent nucleosomes with specific roles in splicing regulation. By decoupling the order of interactions from the computational cost of identification, iRF opens new avenues of inquiry into the molecular mechanisms underlying genome biology

arXiv.org e-Print Archive

Crossref

University of Birmingham Research Portal

eScholarship - University of California

Measuring reproducibility of high-throughput experiments

Author: Bickel Peter J.
Brown James B.
Huang Haiyan
Li Qunhua
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 21/10/2011
Field of study

Reproducibility is essential to reliable scientific discovery in high-throughput experiments. In this work we propose a unified approach to measure the reproducibility of findings identified from replicate experiments and identify putative discoveries using reproducibility. Unlike the usual scalar measures of reproducibility, our approach creates a curve, which quantitatively assesses when the findings are no longer consistent across replicates. Our curve is fitted by a copula mixture model, from which we derive a quantitative reproducibility score, which we call the "irreproducible discovery rate" (IDR) analogous to the FDR. This score can be computed at each set of paired replicate ranks and permits the principled setting of thresholds both for assessing reproducibility and combining replicates. Since our approach permits an arbitrary scale for each replicate, it provides useful descriptive measures in a wide variety of situations to be explored. We study the performance of the algorithm using simulations and give a heuristic analysis of its theoretical properties. We demonstrate the effectiveness of our method in a ChIP-seq experiment.Comment: Published in at http://dx.doi.org/10.1214/11-AOAS466 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Refining interaction search through signed iterative Random Forests

Author: Basu Sumanta
Brown James B.
Celniker Susan
Kumbier Karl
Yu Bin
Publication venue
Publication date: 16/10/2018
Field of study

Advances in supervised learning have enabled accurate prediction in biological systems governed by complex interactions among biomolecules. However, state-of-the-art predictive algorithms are typically black-boxes, learning statistical interactions that are difficult to translate into testable hypotheses. The iterative Random Forest algorithm took a step towards bridging this gap by providing a computationally tractable procedure to identify the stable, high-order feature interactions that drive the predictive accuracy of Random Forests (RF). Here we refine the interactions identified by iRF to explicitly map responses as a function of interacting features. Our method, signed iRF, describes subsets of rules that frequently occur on RF decision paths. We refer to these rule subsets as signed interactions. Signed interactions share not only the same set of interacting features but also exhibit similar thresholding behavior, and thus describe a consistent functional relationship between interacting features and responses. We describe stable and predictive importance metrics to rank signed interactions. For each SPIM, we define null importance metrics that characterize its expected behavior under known structure. We evaluate our proposed approach in biologically inspired simulations and two case studies: predicting enhancer activity and spatial gene expression patterns. In the case of enhancer activity, s-iRF recovers one of the few experimentally validated high-order interactions and suggests novel enhancer elements where this interaction may be active. In the case of spatial gene expression patterns, s-iRF recovers all 11 reported links in the gap gene network. By refining the process of interaction recovery, our approach has the potential to guide mechanistic inquiry into systems whose scale and complexity is beyond human comprehension

arXiv.org e-Print Archive

Interferometric data for a shock-wave/boundary-layer interaction

Author: Brown James L.
Dunagan Stephen E.
Miles John B.
Publication venue
Publication date
Field of study

An experimental study of the axisymmetric shock-wave / boundary-layer strong interaction flow generated in the vicinity of a cylinder-cone intersection was conducted. The study data are useful in the documentation and understanding of compressible turbulent strong interaction flows, and are part of a more general effort to improve turbulence modeling for compressible two- and three-dimensional strong viscous/inviscid interactions. The nominal free stream Mach number was 2.85. Tunnel total pressures of 1.7 and 3.4 atm provided Reynolds number values of 18 x 10(6) and 36 x 10(6) based on model length. Three cone angles were studied giving negligible, incipient, and large scale flow separation. The initial cylinder boundary layer upstream of the interaction had a thickness of 1.0 cm. The subsonic layer of the cylinder boundary layer was quite thin, and in all cases, the shock wave penetrated a significant portion of the boundary layer. Owing to the thickness of the cylinder boundary layer, considerable structural detail was resolved for the three shock-wave / boundary-layer interaction cases considered. The primary emphasis was on the application of the holographic interferometry technique. The density field was deduced from an interferometric analysis based on the Able transform. Supporting data were obtained using a 2-D laser velocimeter, as well as mean wall pressure and oil flow measurements. The attached flow case was observed to be steady, while the separated cases exhibited shock unsteadiness. Comparisons with Navier-Stokes computations using a two-equation turbulence model are presented

NASA Technical Reports Server

Recommended from our members

*-DCC: A platform to collect, annotate, and explore a large variety of sequencing experiments.

Author: Brown James B
Daub Carsten O
Hörtenhuber Matthias
Mukarram Abdul K
Stoiber Marcus H
Publication venue: eScholarship, University of California
Publication date: 01/03/2020
Field of study

BackgroundOver the past few years the variety of experimental designs and protocols for sequencing experiments increased greatly. To ensure the wide usability of the produced data beyond an individual project, rich and systematic annotation of the underlying experiments is crucial.FindingsWe first developed an annotation structure that captures the overall experimental design as well as the relevant details of the steps from the biological sample to the library preparation, the sequencing procedure, and the sequencing and processed files. Through various design features, such as controlled vocabularies and different field requirements, we ensured a high annotation quality, comparability, and ease of annotation. The structure can be easily adapted to a large variety of species. We then implemented the annotation strategy in a user-hosted web platform with data import, query, and export functionality.ConclusionsWe present here an annotation structure and user-hosted platform for sequencing experiment data, suitable for lab-internal documentation, collaborations, and large-scale annotation efforts

eScholarship - University of California