658 research outputs found

    Using object detection to extract structured content from documents

    Get PDF
    Structured content such as figures, tables, graphs, captions, and other graphical material often capture the essence of a document. Experienced readers often review the graphical material in a document first to quickly grasp the contents of the document. It is thus evident that identifying and extracting the structured content of a document, e.g., graphical components, is important in building a deeper semantic understanding of the document. Techniques presented herein automatically extract the structured content of documents. Machine-learning techniques, e.g., object detection, computer vision, etc., are used to recognize and extract the structured content. The techniques work well regardless of the tool used to create the document. For example, the document can be a PDF file, captured via screenshot, generated by a computer-aided design tool, etc. The techniques work across fields of study, across publishing conventions, languages and written scripts, and are robust to different formats of graphical content, e.g., vector/raster graphics

    Hashing for Similarity Search: A Survey

    Full text link
    Similarity search (nearest neighbor search) is a problem of pursuing the data items whose distances to a query item are the smallest from a large database. Various methods have been developed to address this problem, and recently a lot of efforts have been devoted to approximate search. In this paper, we present a survey on one of the main solutions, hashing, which has been widely studied since the pioneering work locality sensitive hashing. We divide the hashing algorithms two main categories: locality sensitive hashing, which designs hash functions without exploring the data distribution and learning to hash, which learns hash functions according the data distribution, and review them from various aspects, including hash function design and distance measure and search scheme in the hash coding space

    Semi-Supervised Classification Using Object Metadata

    Get PDF
    Generally, the present disclosure is directed to classification of data objects (e.g. documents, images, graphs, etc.). In particular, in some implementations, the systems and methods of the present disclosure can include or otherwise leverage one or more machine-learned models to classify a data object based on object data and/or metadata associated with the object

    Suggesting Deletion of Blurry Photos

    Get PDF
    Generally, the present disclosure is directed to identifying and suggesting deletion of blurry photos. In particular, in some implementations, the systems and methods of the present disclosure can include or otherwise leverage one or more machine-learned models to predict a blurriness characteristic of an image based on image data. For example, the blurriness characteristic can describe a percentage of the image that is blurry
    • …
    corecore