15 research outputs found

    Full Page Handwriting Recognition via Image to Sequence Extraction

    Full text link
    We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation. Being based on an Image to Sequence architecture, it can be trained to extract text present in an image and sequence it correctly without imposing any constraints on language, shape of characters or orientation and layout of text and non-text. The model can also be trained to generate auxiliary markup related to formatting, layout and content. We use character level token vocabulary, thereby supporting proper nouns and terminology of any subject. The model achieves a new state-of-art in full page recognition on the IAM dataset and when evaluated on scans of real world handwritten free form test answers - a dataset beset with curved and slanted lines, drawings, tables, math, chemistry and other symbols - it performs better than all commercially available HTR APIs. It is deployed in production as part of a commercial web application

    Interlayer exchange coupling in Pt/Co/Ru and Pt/Co/Ir superlattices

    No full text

    Anytime Recognition of Objects and Scenes

    No full text
    Humans are capable of perceiving a scene at a glance, and obtain deeper understanding with additional time. Sim-ilarly, visual recognition deployments should be robust to varying computational budgets. Such situations require Anytime recognition ability, which is rarely considered in computer vision research. We present a method for learn-ing dynamic policies to optimize Anytime performance in visual architectures. Our model sequentially orders feature computation and performs subsequent classification. Cru-cially, decisions are made at test time and depend on ob-served data and intermediate results. We show the applica-bility of this system to standard problems in scene and ob-ject recognition. On suitable datasets, we can incorporate a semantic back-off strategy that gives maximally specific predictions for a desired level of accuracy; this provides a new view on the time course of human visual perception. 1

    Dzyaloshinskii–Moriya interaction in Pt/Co/Ir and Pt/Co/Ru multilayer films

    No full text
    The interfacial Dzyaloshinskii–Moriya interaction (iDMI) in asymmetric magnetic multilayer films has displayed increasingly important roles in the modification of domain walls, stabilization of Skyrmions, and realization of new topological spin textures such as magnetic radial vortices. Unlike magnetization and magnetic anisotropy which can be readily measured, iDMI is difficult to measure. In this work, we measured the iDMI in Pt/Co/Ir and Pt/Co/Ru multilayer films by exploring the spin-orbit torque induced effective field under an in-plane bias magnetic field. Skyrmions have been reported to exist in Pt/Co/Ir multilayers. We found that Pt/Co/Ru multilayers have a similar magnitude of the iDMI for Pt/Co/Ir multilayers, suggesting that Pt/Co/Ru is a good candidate to host Skyrmions

    Charged line segments and ellipsoidal equipotentials

    No full text
    This is a survey of the electrostatic potentials produced by charged straight-line segments, in various numbers of spatial dimensions, with comparisons between uniformly charged segments and those having non-uniform linear charge distributions that give rise to ellipsoidal equipotentials surrounding the segments. A uniform linear distribution of charge is compatible with ellipsoidal equipotentials only for three dimensions. In higher dimensions, the linear charge density giving rise to ellipsoidal equipotentials is counter-intuitive --- the charge distribution has a maximum at the center of the segment and vanishes at the ends of the segment. Only in two dimensions is the continuous charge distribution intuitive --- for that one case of ellipsoidal equipotentials, the charge is peaked at the ends of the segment and minimized at the center
    corecore