1,748 research outputs found
Towards Reverse Engineering of PDF Documents
summary:We present a progress report on our ongoing project of reverse engineering scientific PDF documents. The aim is to obtain mathematical markup that can be used as source for regenerating a document that resembles the original as closely as possible. This source can then be a basis for further document processing. Our current tool uses specialised PDF extraction together with image analysis to produce near perfect input for parsing mathematical formula. Applying a linear grammar and specific drivers for each output format to this input, we can produce an accurate reproduction of formulae when presented with their coordinates. In this paper we will show how this information can be exploited to discover the locations of both inline and display formulae, and also to perform rudimentary layout analysis of the whole document, identifying structures such as headings and paragraphs
Regular Expression Matching and Operational Semantics
Many programming languages and tools, ranging from grep to the Java String
library, contain regular expression matchers. Rather than first translating a
regular expression into a deterministic finite automaton, such implementations
typically match the regular expression on the fly. Thus they can be seen as
virtual machines interpreting the regular expression much as if it were a
program with some non-deterministic constructs such as the Kleene star. We
formalize this implementation technique for regular expression matching using
operational semantics. Specifically, we derive a series of abstract machines,
moving from the abstract definition of matching to increasingly realistic
machines. First a continuation is added to the operational semantics to
describe what remains to be matched after the current expression. Next, we
represent the expression as a data structure using pointers, which enables
redundant searches to be eliminated via testing for pointer equality. From
there, we arrive both at Thompson's lockstep construction and a machine that
performs some operations in parallel, suitable for implementation on a large
number of cores, such as a GPU. We formalize the parallel machine using process
algebra and report some preliminary experiments with an implementation on a
graphics processor using CUDA.Comment: In Proceedings SOS 2011, arXiv:1108.279
Instability in the Molecular Dynamics Step of Hybrid Monte Carlo in Dynamical Fermion Lattice QCD Simulations
We investigate instability and reversibility within Hybrid Monte Carlo simulations using a non-perturbatively improved Wilson action. We demonstrate the onset of instability as tolerance parameters and molecular dynamics step sizes are varied. We compare these findings with theoretical expectations and present limits on simulation parameters within which a stable and reversible algorithm is obtained for physically relevant simulations. Results of optimisation experiments with respect to tolerance prarameters are also presented
Instability in the molecular dynamics step of a hybrid Monte Carlo algorithm in dynamical fermion lattice QCD simulations
We investigate instability and reversibility within Hybrid Monte Carlo
simulations using a non-perturbatively improved Wilson action. We demonstrate
the onset of instability as tolerance parameters and molecular dynamics step
sizes are varied. We compare these findings with theoretical expectations and
present limits on simulation parameters within which a stable and reversible
algorithm is obtained for physically relevant simulations. Results of
optimisation experiments with respect to tolerance prarameters are also
presented.Comment: RevTeX, Some results here were presented at Vielat 99, Vienna,
Austria, Sept 1999 22 Pages, 10 figures, to be submitted to PR
Development and Implementation of a High Throughput Screen for the Human Sperm-Specific Isoform of Glyceraldehyde 3-Phosphate Dehydrogenase (GAPDHS)
Glycolytic isozymes that are restricted to the male germline are potential targets for the development of reversible, non-hormonal male contraceptives. GAPDHS, the sperm-specific isoform of glyceraldehyde-3-phosphate dehydrogenase, is an essential enzyme for glycolysis making it an attractive target for rational drug design. Toward this goal, we have optimized and validated a high-throughput spectrophotometric assay for GAPDHS in 384-well format. The assay was stable over time and tolerant to DMSO. Whole plate validation experiments yielded Zâ values >0.8 indicating a robust assay for HTS. Two compounds were identified and confirmed from a test screen of the Prestwick collection. This assay was used to screen a diverse chemical library and identified fourteen small molecules that modulated the activity of recombinant purified GAPDHS with confirmed IC50 values ranging from 1.8 to 42 ”M. These compounds may provide useful scaffolds as molecular tools to probe the role of GAPDHS in sperm motility and long term to develop potent and selective GAPDHS inhibitors leading to novel contraceptive agents
Recombinant human sperm-specific glyceraldehyde-3-phosphate dehydrogenase (GAPDHS) is expressed at high yield as an active homotetramer in baculovirus-infected insect cells
The sperm-specific glyceraldehyde-3-phosphate dehydrogenase (GAPDHS) isoform is a promising contraceptive target because it is specific to male germ cells, essential for sperm motility and male fertility, and well suited to pharmacological inhibition. However, GAPDHS is difficult to isolate from native sources and recombinant expression frequently results in high production of insoluble enzyme. We chose to use the Bac-to-Bac baculovirus-insect cell system to express a His-tagged form of human GAPDHS (Hu his-GAPDHS) lacking the proline-rich N-terminal sequence. This recombinant Hu his-GAPDHS was successfully produced in Spodoptera frugiperda 9 (Sf9) cells by infection with recombinant virus as a soluble, enzymatically active form in high yield, >35mg/L culture. Biochemical characterization of the purified enzyme by mass spectrometry and size exclusion chromatography confirmed the presence of the tetrameric form. Further characterization by peptide ion matching mass spectrometry and Edman sequencing showed that unlike the mixed tetramer forms produced in bacterial expression systems, human his-GAPDHS expressed in baculovirus-infected insect cells is homotetrameric. The ability to express and purify active human GAPDHS as homotetramers in high amounts will greatly aid in drug discovery efforts targeting this enzyme for discovery of novel contraceptives and three compounds were identified as inhibitors of Hu his-GAPDHS from a pilot screen of 1120 FDA-approved compounds
The impact of donor and recipient common clinical and genetic variation on estimated glomerular filtration rate in a European renal transplant population
Genetic variation across the HLA is known to influence renalâtransplant outcome. However, the impact of genetic variation beyond the HLA is less clear. We tested the association of common genetic variation and clinical characteristics, from both the donor and recipient, with postâtransplant eGFR at different timeâpoints, out to 5âyears postâtransplantation.
We conducted GWAS metaâanalyses across 10,844 donors and recipients from five European ancestry cohorts. We also analysed the impact of polygenic risk scores (PRS), calculated using genetic variants associated with nonâtransplant eGFR, on postâtransplant eGFR.
PRS calculated using the recipient genotype alone, as well as combined donor and recipient genotypes were significantly associated with eGFR at 1âyear postâtransplant. 32% of the variability in eGFR at 1âyear postâtransplant was explained by our model containing clinical covariates (including weights for death/graftâfailure), principal components and combined donorârecipient PRS, with 0.3% contributed by the PRS. No individual genetic variant was significantly associated with eGFR postâtransplant in the GWAS.
This is the first study to examine PRS, composed of variants that impact kidney function in the general population, in a postâtransplant context. Despite PRS being a significant predictor of eGFR postâtransplant, the effect size of common genetic factors is limited compared to clinical variables
Recommended from our members
Spectatorsâ Negotiations of Risk, Masculinity and Performative Mobilities at the TT Races
This paper explores the particular assemblage of place, event and individual identity performances that occur each year in the Isle of Man in and through the TT (Tourist Trophy) motorcycle races. These road races are associated with a high degree of risk for the racers and the confluence of over 30,000 visitors and 10,000 motorcycles also presents potential risks for spectators and residents alike. Both motorcycling and risk-taking have been associated with particular forms of masculinity, notably hegemonic, working class and youthful masculinities. Using detailed surveys of spectators we argue that the TT races, while undoubtedly
dominated by men and predicated on a cultural privileging of speed and skill, are grounded in varying combinations of determinate and reflexive attitudes to risk, reflecting the performance of a variety of gendered, âbikerâ and wider identity-based positionalities. Findings also highlight a particular inter-relation of mobilities and place identities at the TT races and bring to light the highly significant and under-researched embodied, performative and emotional mobilities of spectators. The conceptual and methodological importance of (a) situated research of both mobilities and gender in specific place-temporalities and (b) wider surveys of motorcyclists to complement ethnographic studies of small cohorts are also stressed
Application of Diffusion Tensor Imaging Parameters to Detect Change in Longitudinal Studies in Cerebral Small Vessel Disease.
Cerebral small vessel disease (SVD) is the major cause of vascular cognitive impairment, resulting in significant disability and reduced quality of life. Cognitive tests have been shown to be insensitive to change in longitudinal studies and, therefore, sensitive surrogate markers are needed to monitor disease progression and assess treatment effects in clinical trials. Diffusion tensor imaging (DTI) is thought to offer great potential in this regard. Sensitivity of the various parameters that can be derived from DTI is however unknown. We aimed to evaluate the differential sensitivity of DTI markers to detect SVD progression, and to estimate sample sizes required to assess therapeutic interventions aimed at halting decline based on DTI data. We investigated 99 patients with symptomatic SVD, defined as clinical lacunar syndrome with MRI confirmation of a corresponding infarct as well as confluent white matter hyperintensities over a 3 year follow-up period. We evaluated change in DTI histogram parameters using linear mixed effect models and calculated sample size estimates. Over a three-year follow-up period we observed a decline in fractional anisotropy and increase in diffusivity in white matter tissue and most parameters changed significantly. Mean diffusivity peak height was the most sensitive marker for SVD progression as it had the smallest sample size estimate. This suggests disease progression can be monitored sensitively using DTI histogram analysis and confirms DTI's potential as surrogate marker for SVD
Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context
Long noncoding RNAs (lncRNAs) are commonly dys-regulated in tumors, but only a handful are known toplay pathophysiological roles in cancer. We inferredlncRNAs that dysregulate cancer pathways, onco-genes, and tumor suppressors (cancer genes) bymodeling their effects on the activity of transcriptionfactors, RNA-binding proteins, and microRNAs in5,185 TCGA tumors and 1,019 ENCODE assays.Our predictions included hundreds of candidateonco- and tumor-suppressor lncRNAs (cancerlncRNAs) whose somatic alterations account for thedysregulation of dozens of cancer genes and path-ways in each of 14 tumor contexts. To demonstrateproof of concept, we showed that perturbations tar-geting OIP5-AS1 (an inferred tumor suppressor) andTUG1 and WT1-AS (inferred onco-lncRNAs) dysre-gulated cancer genes and altered proliferation ofbreast and gynecologic cancer cells. Our analysis in-dicates that, although most lncRNAs are dysregu-lated in a tumor-specific manner, some, includingOIP5-AS1, TUG1, NEAT1, MEG3, and TSIX, synergis-tically dysregulate cancer pathways in multiple tumorcontexts
- âŠ