1,693 research outputs found

    Towards Reverse Engineering of PDF Documents

    Get PDF
    summary:We present a progress report on our ongoing project of reverse engineering scientific PDF documents. The aim is to obtain mathematical markup that can be used as source for regenerating a document that resembles the original as closely as possible. This source can then be a basis for further document processing. Our current tool uses specialised PDF extraction together with image analysis to produce near perfect input for parsing mathematical formula. Applying a linear grammar and specific drivers for each output format to this input, we can produce an accurate reproduction of formulae when presented with their coordinates. In this paper we will show how this information can be exploited to discover the locations of both inline and display formulae, and also to perform rudimentary layout analysis of the whole document, identifying structures such as headings and paragraphs

    Regular Expression Matching and Operational Semantics

    Full text link
    Many programming languages and tools, ranging from grep to the Java String library, contain regular expression matchers. Rather than first translating a regular expression into a deterministic finite automaton, such implementations typically match the regular expression on the fly. Thus they can be seen as virtual machines interpreting the regular expression much as if it were a program with some non-deterministic constructs such as the Kleene star. We formalize this implementation technique for regular expression matching using operational semantics. Specifically, we derive a series of abstract machines, moving from the abstract definition of matching to increasingly realistic machines. First a continuation is added to the operational semantics to describe what remains to be matched after the current expression. Next, we represent the expression as a data structure using pointers, which enables redundant searches to be eliminated via testing for pointer equality. From there, we arrive both at Thompson's lockstep construction and a machine that performs some operations in parallel, suitable for implementation on a large number of cores, such as a GPU. We formalize the parallel machine using process algebra and report some preliminary experiments with an implementation on a graphics processor using CUDA.Comment: In Proceedings SOS 2011, arXiv:1108.279

    Instability in the Molecular Dynamics Step of Hybrid Monte Carlo in Dynamical Fermion Lattice QCD Simulations

    Get PDF
    We investigate instability and reversibility within Hybrid Monte Carlo simulations using a non-perturbatively improved Wilson action. We demonstrate the onset of instability as tolerance parameters and molecular dynamics step sizes are varied. We compare these findings with theoretical expectations and present limits on simulation parameters within which a stable and reversible algorithm is obtained for physically relevant simulations. Results of optimisation experiments with respect to tolerance prarameters are also presented

    Instability in the molecular dynamics step of a hybrid Monte Carlo algorithm in dynamical fermion lattice QCD simulations

    Get PDF
    We investigate instability and reversibility within Hybrid Monte Carlo simulations using a non-perturbatively improved Wilson action. We demonstrate the onset of instability as tolerance parameters and molecular dynamics step sizes are varied. We compare these findings with theoretical expectations and present limits on simulation parameters within which a stable and reversible algorithm is obtained for physically relevant simulations. Results of optimisation experiments with respect to tolerance prarameters are also presented.Comment: RevTeX, Some results here were presented at Vielat 99, Vienna, Austria, Sept 1999 22 Pages, 10 figures, to be submitted to PR

    Development and Implementation of a High Throughput Screen for the Human Sperm-Specific Isoform of Glyceraldehyde 3-Phosphate Dehydrogenase (GAPDHS)

    Get PDF
    Glycolytic isozymes that are restricted to the male germline are potential targets for the development of reversible, non-hormonal male contraceptives. GAPDHS, the sperm-specific isoform of glyceraldehyde-3-phosphate dehydrogenase, is an essential enzyme for glycolysis making it an attractive target for rational drug design. Toward this goal, we have optimized and validated a high-throughput spectrophotometric assay for GAPDHS in 384-well format. The assay was stable over time and tolerant to DMSO. Whole plate validation experiments yielded Z’ values >0.8 indicating a robust assay for HTS. Two compounds were identified and confirmed from a test screen of the Prestwick collection. This assay was used to screen a diverse chemical library and identified fourteen small molecules that modulated the activity of recombinant purified GAPDHS with confirmed IC50 values ranging from 1.8 to 42 ”M. These compounds may provide useful scaffolds as molecular tools to probe the role of GAPDHS in sperm motility and long term to develop potent and selective GAPDHS inhibitors leading to novel contraceptive agents

    Recombinant human sperm-specific glyceraldehyde-3-phosphate dehydrogenase (GAPDHS) is expressed at high yield as an active homotetramer in baculovirus-infected insect cells

    Get PDF
    The sperm-specific glyceraldehyde-3-phosphate dehydrogenase (GAPDHS) isoform is a promising contraceptive target because it is specific to male germ cells, essential for sperm motility and male fertility, and well suited to pharmacological inhibition. However, GAPDHS is difficult to isolate from native sources and recombinant expression frequently results in high production of insoluble enzyme. We chose to use the Bac-to-Bac baculovirus-insect cell system to express a His-tagged form of human GAPDHS (Hu his-GAPDHS) lacking the proline-rich N-terminal sequence. This recombinant Hu his-GAPDHS was successfully produced in Spodoptera frugiperda 9 (Sf9) cells by infection with recombinant virus as a soluble, enzymatically active form in high yield, >35mg/L culture. Biochemical characterization of the purified enzyme by mass spectrometry and size exclusion chromatography confirmed the presence of the tetrameric form. Further characterization by peptide ion matching mass spectrometry and Edman sequencing showed that unlike the mixed tetramer forms produced in bacterial expression systems, human his-GAPDHS expressed in baculovirus-infected insect cells is homotetrameric. The ability to express and purify active human GAPDHS as homotetramers in high amounts will greatly aid in drug discovery efforts targeting this enzyme for discovery of novel contraceptives and three compounds were identified as inhibitors of Hu his-GAPDHS from a pilot screen of 1120 FDA-approved compounds

    The impact of donor and recipient common clinical and genetic variation on estimated glomerular filtration rate in a European renal transplant population

    Get PDF
    Genetic variation across the HLA is known to influence renal‐transplant outcome. However, the impact of genetic variation beyond the HLA is less clear. We tested the association of common genetic variation and clinical characteristics, from both the donor and recipient, with post‐transplant eGFR at different time‐points, out to 5‐years post‐transplantation. We conducted GWAS meta‐analyses across 10,844 donors and recipients from five European ancestry cohorts. We also analysed the impact of polygenic risk scores (PRS), calculated using genetic variants associated with non‐transplant eGFR, on post‐transplant eGFR. PRS calculated using the recipient genotype alone, as well as combined donor and recipient genotypes were significantly associated with eGFR at 1‐year post‐transplant. 32% of the variability in eGFR at 1‐year post‐transplant was explained by our model containing clinical covariates (including weights for death/graft‐failure), principal components and combined donor‐recipient PRS, with 0.3% contributed by the PRS. No individual genetic variant was significantly associated with eGFR post‐transplant in the GWAS. This is the first study to examine PRS, composed of variants that impact kidney function in the general population, in a post‐transplant context. Despite PRS being a significant predictor of eGFR post‐transplant, the effect size of common genetic factors is limited compared to clinical variables

    Application of Diffusion Tensor Imaging Parameters to Detect Change in Longitudinal Studies in Cerebral Small Vessel Disease.

    Get PDF
    Cerebral small vessel disease (SVD) is the major cause of vascular cognitive impairment, resulting in significant disability and reduced quality of life. Cognitive tests have been shown to be insensitive to change in longitudinal studies and, therefore, sensitive surrogate markers are needed to monitor disease progression and assess treatment effects in clinical trials. Diffusion tensor imaging (DTI) is thought to offer great potential in this regard. Sensitivity of the various parameters that can be derived from DTI is however unknown. We aimed to evaluate the differential sensitivity of DTI markers to detect SVD progression, and to estimate sample sizes required to assess therapeutic interventions aimed at halting decline based on DTI data. We investigated 99 patients with symptomatic SVD, defined as clinical lacunar syndrome with MRI confirmation of a corresponding infarct as well as confluent white matter hyperintensities over a 3 year follow-up period. We evaluated change in DTI histogram parameters using linear mixed effect models and calculated sample size estimates. Over a three-year follow-up period we observed a decline in fractional anisotropy and increase in diffusivity in white matter tissue and most parameters changed significantly. Mean diffusivity peak height was the most sensitive marker for SVD progression as it had the smallest sample size estimate. This suggests disease progression can be monitored sensitively using DTI histogram analysis and confirms DTI's potential as surrogate marker for SVD

    Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context

    Get PDF
    Long noncoding RNAs (lncRNAs) are commonly dys-regulated in tumors, but only a handful are known toplay pathophysiological roles in cancer. We inferredlncRNAs that dysregulate cancer pathways, onco-genes, and tumor suppressors (cancer genes) bymodeling their effects on the activity of transcriptionfactors, RNA-binding proteins, and microRNAs in5,185 TCGA tumors and 1,019 ENCODE assays.Our predictions included hundreds of candidateonco- and tumor-suppressor lncRNAs (cancerlncRNAs) whose somatic alterations account for thedysregulation of dozens of cancer genes and path-ways in each of 14 tumor contexts. To demonstrateproof of concept, we showed that perturbations tar-geting OIP5-AS1 (an inferred tumor suppressor) andTUG1 and WT1-AS (inferred onco-lncRNAs) dysre-gulated cancer genes and altered proliferation ofbreast and gynecologic cancer cells. Our analysis in-dicates that, although most lncRNAs are dysregu-lated in a tumor-specific manner, some, includingOIP5-AS1, TUG1, NEAT1, MEG3, and TSIX, synergis-tically dysregulate cancer pathways in multiple tumorcontexts
    • 

    corecore