65 research outputs found

    Corpus Conversion Service: A Machine Learning Platform to Ingest Documents at Scale

    Full text link
    Over the past few decades, the amount of scientific articles and technical literature has increased exponentially in size. Consequently, there is a great need for systems that can ingest these documents at scale and make the contained knowledge discoverable. Unfortunately, both the format of these documents (e.g. the PDF format or bitmap images) as well as the presentation of the data (e.g. complex tables) make the extraction of qualitative and quantitive data extremely challenging. In this paper, we present a modular, cloud-based platform to ingest documents at scale. This platform, called the Corpus Conversion Service (CCS), implements a pipeline which allows users to parse and annotate documents (i.e. collect ground-truth), train machine-learning classification algorithms and ultimately convert any type of PDF or bitmap-documents to a structured content representation format. We will show that each of the modules is scalable due to an asynchronous microservice architecture and can therefore handle massive amounts of documents. Furthermore, we will show that our capability to gather ground-truth is accelerated by machine-learning algorithms by at least one order of magnitude. This allows us to both gather large amounts of ground-truth in very little time and obtain very good precision/recall metrics in the range of 99\% with regard to content conversion to structured output. The CCS platform is currently deployed on IBM internal infrastructure and serving more than 250 active users for knowledge-engineering project engagements.Comment: Accepted paper at KDD 2018 conferenc

    An extreme-scale implicit solver for complex PDEs: highly heterogeneous flow in earth's mantle

    Get PDF
    Mantle convection is the fundamental physical process within earth's interior responsible for the thermal and geological evolution of the planet, including plate tectonics. The mantle is modeled as a viscous, incompressible, non-Newtonian fluid. The wide range of spatial scales, extreme variability and anisotropy in material properties, and severely nonlinear rheology have made global mantle convection modeling with realistic parameters prohibitive. Here we present a new implicit solver that exhibits optimal algorithmic performance and is capable of extreme scaling for hard PDE problems, such as mantle convection. To maximize accuracy and minimize runtime, the solver incorporates a number of advances, including aggressive multi-octree adaptivity, mixed continuous-discontinuous discretization, arbitrarily-high-order accuracy, hybrid spectral/geometric/algebraic multigrid, and novel Schur-complement preconditioning. These features present enormous challenges for extreme scalability. We demonstrate that---contrary to conventional wisdom---algorithmically optimal implicit solvers can be designed that scale out to 1.5 million cores for severely nonlinear, ill-conditioned, heterogeneous, and anisotropic PDEs

    Long-term outcome and patterns of failure in patients with advanced head and neck cancer

    Get PDF
    <p>Abstract</p> <p>Purpose</p> <p>To access the long-time outcome and patterns of failure in patients with advanced head and neck squamous cell carcinoma (HNSCC).</p> <p>Methods and materials</p> <p>Between 1992 and 2005 127 patients (median age 55 years, UICC stage III n = 6, stage IV n = 121) with primarily inoperable, advanced HNSCC were treated with definite platinum-based radiochemotherapy (median dose 66.4 Gy). Analysed end-points were overall survival (OS), disease-free survival (DFS), loco-regional progression-free survival (LPFS), development of distant metastases (DM), prognostic factors and causes of death.</p> <p>Results</p> <p>The mean follow-up time was 34 months (range, 3-156 months), the 3-, 5- and 10-year OS rates were 39%, 28% and 14%, respectively. The median OS was 23 months. Forty-seven patients achieved a complete remission and 78 patients a partial remission. The median LPFS was 17 months, the 3-, 5- and 10-year LPFS rates were 41%, 33% and 30%, respectively. The LPFS was dependent on the nodal stage (p = 0.029). The median DFS was 11 months (range, 2-156 months), the 3-, 5- and 10-year DFS rates were 30%, 24% and 22%, respectively. Prognostic factors in univariate analyses were alcohol abuse (n = 102, p = 0.015), complete remission (n = 47, p < 0.001), local recurrence (n = 71, p < 0.001), development of DM (n = 45, p < 0.001; median OS 16 months) and borderline significance in nodal stage N2 versus N3 (p = 0.06). Median OS was 26 months with lung metastases (n = 17). Nodal stage was a predictive factor for the development of DM (p = 0.025). Cause of death was most commonly tumor progression.</p> <p>Conclusions</p> <p>In stage IV HNSCC long-term survival is rare and DM is a significant predictor for mortality. If patients developed DM, lung metastases had the most favourable prognosis, so intensified palliative treatment might be justified in DM limited to the lungs.</p

    Disease control and functional outcome in three modern combined organ preserving regimens for locally advanced squamous cell carcinoma of the head and neck (SCCHN)

    Get PDF
    <p>Abstract</p> <p>Purpose</p> <p>To report our experience on disease control and functional outcome using three modern combined-modality approaches for definitive radiochemotherapy of locally advanced SCCHN with modern radiotherapy techniques: radiochemotherapy (RChT), radioimmunotherapy (RIT) with cetuximab, or induction chemotherapy with docetaxel, cisplatin, and 5-FU (TPF) combined with either RChT or RIT.</p> <p>Methods</p> <p>Toxicity and outcome was retrospectively analysed in patients receiving definitive RChT, RIT, or induction chemotherapy followed by RChT or RIT between 2006 and 2009. Outcome was estimated using Kaplan-Meier analyses, toxicity was analysed according to CTCAE v 3.0.</p> <p>Results</p> <p>Thirty-eight patients were treated with RChT, 38 patients with RIT, 16 patients received TPF followed by either RChT or RIT. Radiotherapy was mostly applied as IMRT (68%). Long-term toxicity was low, only one case of grad III dysphagia requiring oesophageal dilatation, no case of either xerostomia ≥ grade II or cervical plexopathy were observed. Median overall survival (OS) was 25.7 months (RChT) and 27.7 months (RIT), median locoregional progression-free survival (PFS) was not reached yet. Subgroup analysis showed no significant differences between TPF, RChT, and RIT despite higher age and co-morbidities in the RIT group. Results suggested improved OS, distant and overall PFS for the TPF regimen.</p> <p>Conclusion</p> <p>Late radiation effects in our cohort are rare. No significant differences in outcome between RChT and RIT were observed. Adding TPF suggests improved progression-free and overall survival, impact of TPF on locoregional PFS was marginal, therefore radiotherapeutic options for intensification of local treatment should be explored.</p

    A meta-analysis of hyperfractionated and accelerated radiotherapy and combined chemotherapy and radiotherapy regimens in unresected locally advanced squamous cell carcinoma of the head and neck

    Get PDF
    BACKGROUND: Former meta-analyses have shown a survival benefit for the addition of chemotherapy (CHX) to radiotherapy (RT) and to some extent also for the use of hyperfractionated radiation therapy (HFRT) and accelerated radiation therapy (AFRT) in locally advanced squamous cell carcinoma (SCC) of the head and neck. However, the publication of new studies and the fact that many older studies that were included in these former meta-analyses used obsolete radiation doses, CHX schedules or study designs prompted us to carry out a new analysis using strict inclusion criteria. METHODS: Randomised trials testing curatively intended RT (≥60 Gy in >4 weeks/>50 Gy in <4 weeks) on SCC of the oral cavity, oropharynx, hypopharynx, and larynx published as full paper or in abstract form between 1975 and 2003 were eligible. Trials comparing RT alone with concurrent or alternating chemoradiation (5-fluorouracil (5-FU), cisplatin, carboplatin, mitomycin C) were analyzed according to the employed radiation schedule and the used CHX regimen. Studies comparing conventionally fractionated radiotherapy (CFRT) with either HFRT or AFRT without CHX were separately examined. End point of the meta-analysis was overall survival. RESULTS: Thirty-two trials with a total of 10 225 patients were included into the meta-analysis. An overall survival benefit of 12.0 months was observed for the addition of simultaneous CHX to either CFRT or HFRT/AFRT (p < 0.001). Separate analyses by cytostatic drug indicate a prolongation of survival of 24.0 months, 16.8 months, 6.7 months, and 4.0 months, respectively, for the simultaneous administration of 5-FU, cisplatin-based, carboplatin-based, and mitomycin C-based CHX to RT (each p < 0.01). Whereas no significant gain in overall survival was observed for AFRT in comparison to CFRT, a substantial prolongation of median survival (14.2 months, p < 0.001) was seen for HFRT compared to CFRT (both without CHX). CONCLUSION: RT combined with simultaneous 5-FU, cisplatin, carboplatin, and mitomycin C as single drug or combinations of 5-FU with one of the other drugs results in a large survival advantage irrespective the employed radiation schedule. If radiation therapy is used as single modality, hyperfractionation leads to a significant improvement of overall survival. Accelerated radiation therapy alone, especially when given as split course radiation schedule or extremely accelerated treatments with decreased total dose, does not increase overall survival
    corecore