116,416 research outputs found

    Deep Structured Feature Networks for Table Detection and Tabular Data Extraction from Scanned Financial Document Images

    Full text link
    Automatic table detection in PDF documents has achieved a great success but tabular data extraction are still challenging due to the integrity and noise issues in detected table areas. The accurate data extraction is extremely crucial in finance area. Inspired by this, the aim of this research is proposing an automated table detection and tabular data extraction from financial PDF documents. We proposed a method that consists of three main processes, which are detecting table areas with a Faster R-CNN (Region-based Convolutional Neural Network) model with Feature Pyramid Network (FPN) on each page image, extracting contents and structures by a compounded layout segmentation technique based on optical character recognition (OCR) and formulating regular expression rules for table header separation. The tabular data extraction feature is embedded with rule-based filtering and restructuring functions that are highly scalable. We annotate a new Financial Documents dataset with table regions for the experiment. The excellent table detection performance of the detection model is obtained from our customized dataset. The main contributions of this paper are proposing the Financial Documents dataset with table-area annotations, the superior detection model and the rule-based layout segmentation technique for the tabular data extraction from PDF files

    StoryDroid: Automated Generation of Storyboard for Android Apps

    Full text link
    Mobile apps are now ubiquitous. Before developing a new app, the development team usually endeavors painstaking efforts to review many existing apps with similar purposes. The review process is crucial in the sense that it reduces market risks and provides inspiration for app development. However, manual exploration of hundreds of existing apps by different roles (e.g., product manager, UI/UX designer, developer) in a development team can be ineffective. For example, it is difficult to completely explore all the functionalities of the app in a short period of time. Inspired by the conception of storyboard in movie production, we propose a system, StoryDroid, to automatically generate the storyboard for Android apps, and assist different roles to review apps efficiently. Specifically, StoryDroid extracts the activity transition graph and leverages static analysis techniques to render UI pages to visualize the storyboard with the rendered pages. The mapping relations between UI pages and the corresponding implementation code (e.g., layout code, activity code, and method hierarchy) are also provided to users. Our comprehensive experiments unveil that StoryDroid is effective and indeed useful to assist app development. The outputs of StoryDroid enable several potential applications, such as the recommendation of UI design and layout code

    Analytical methods and simulation models to assess innovative operational measures and technologies for rail port terminals: the case of Valencia Principe Felipe terminal

    Get PDF
    The topic of freight transport by rail is a complex theme and, in recent years, a main issue of European policy. The legislation evolution and the White Paper 2011 have demonstrated the European intention to re-launch this sector. The challenge is to promote the intermodal transport system to the detriment of road freight transport. In this context intermodal freight terminals, play a primary role for the supply chain, they are the connection point between the various transport nodes and the nodal points where the freight are handled, stored and transferred between different modes to final customer. To achieve the purpose, it is strengthen the improvement of existing intermodal freight terminals and the development of innovative intermodal freight terminals towards higher performance (ERRAC, 2012). Many terminal performances improvements have been proposed and sometime experimented. They are normally basing on combinations of operational measures and innovative technologies (e.g. automatic horizontal and parallel storage and handling, automated gate and sensors for tracking systems data exchange) tested in various terminals, with often-contradictory results. The research work described in this paper (developed within the Capacity4Rail EU project) focusses on the assessment of effects that these innovations can have in the intermodal freight terminals combined in various alternative consistent effective scenarios. The methodological framework setup to assess these innovations is basing on a combination of analytical methods based on sequential algorithms and discrete events simulation models. The output of this assessment method are key performance indicators (KPIs) selected according to terminals typologies and related to different aspects (e.g. management, operation and organization). The present paper illustrates the application of the methodological framework, tuned on the operation of various intermodal terminals, for the validation on today operation and the assessment of possible future scenarios to the case study of the Principe Felipe sea-rail terminal in Valencia

    FraudDroid: Automated Ad Fraud Detection for Android Apps

    Get PDF
    Although mobile ad frauds have been widespread, state-of-the-art approaches in the literature have mainly focused on detecting the so-called static placement frauds, where only a single UI state is involved and can be identified based on static information such as the size or location of ad views. Other types of fraud exist that involve multiple UI states and are performed dynamically while users interact with the app. Such dynamic interaction frauds, although now widely spread in apps, have not yet been explored nor addressed in the literature. In this work, we investigate a wide range of mobile ad frauds to provide a comprehensive taxonomy to the research community. We then propose, FraudDroid, a novel hybrid approach to detect ad frauds in mobile Android apps. FraudDroid analyses apps dynamically to build UI state transition graphs and collects their associated runtime network traffics, which are then leveraged to check against a set of heuristic-based rules for identifying ad fraudulent behaviours. We show empirically that FraudDroid detects ad frauds with a high precision (93%) and recall (92%). Experimental results further show that FraudDroid is capable of detecting ad frauds across the spectrum of fraud types. By analysing 12,000 ad-supported Android apps, FraudDroid identified 335 cases of fraud associated with 20 ad networks that are further confirmed to be true positive results and are shared with our fellow researchers to promote advanced ad fraud detectionComment: 12 pages, 10 figure
    • …
    corecore