807 research outputs found

    CONTINUOUS DEPTH MAP RECONSTRUCTION FROM LIGHT FIELDS

    Get PDF
    Light field analysis recently received growing interest, since its rich structure information benefits many computer vision tasks. This paper presents a novel method to reconstruct continuous depth maps from light field data. Conventional approaches usually treat depth map reconstruction as an optimization problem with discrete labels. On the contrary, our proposed method can obtain continuous depth maps by solving a linear system, which preserves richer details compared with conventional discrete approaches. Structure tensor is employed to extract raw depth information and corresponding confidence levels from the light field data. We introduce a method to reduce the adverse effect of unreliable local estimations, which helps to get rid of errors in specular areas and edges where depth values are discontinuous. Experiments on both synthetic and real light field data demonstrate the effectiveness of the proposed method. Index Terms — Depth map reconstruction, light field, linear system 1

    A Unified Approach to Robust Inference for Genetic Covariance

    Full text link
    Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with complex traits. Many complex traits are found to have shared genetic etiology. Genetic covariance is defined as the underlying covariance of genetic values and can be used to measure the shared genetic architecture. The data of two outcomes may be collected from the same group or different groups of individuals and the outcomes can be of different types or collected based on different study designs. This paper proposes a unified approach to robust estimation and inference for genetic covariance of general outcomes that may be associated with genetic variants nonlinearly. We provide the asymptotic properties of the proposed estimator and show that our proposal is robust under certain model mis-specification. Our method under linear working models provides robust inference for the narrow-sense genetic covariance, even when both linear models are mis-specified. Various numerical experiments are performed to support the theoretical results. Our method is applied to an outbred mice GWAS data set to study the overlapping genetic effects between the behavioral and physiological phenotypes. The real data results demonstrate the robustness of the proposed method and reveal interesting genetic covariance among different mice developmental traits

    Soil Liquid Limit and Plastic Limit Treating System Based on Analytic Method

    Get PDF
    AbstractAccording to two present China national standards, a software as Soil Liquid Limit and Plastic Limit Data Treating System, with analytic method, was developed using object-oriented visual programming tool. The analytic method used in the developed system was different to traditional method of treating soil liquid limit and plastic limit data. N-S algorithm flowchart demonstrated that switch statement and condition statement were taken as main algorithm and second level select nested structure was taken as main frame for the developed system. Three kinds of soil specimens were tested with liquid and plastic limit combined test and the test data was treated with graphic method, Excel software and Soil Liquid Limit and Plastic Limit Data Treating System. The comparative conclusion indicated that Soil Liquid Limit and Plastic Limit Data Treating System improved efficiency and accuracy evidently for treating soil liquid and plastic limit data and had advantages of easy operation and high reliability

    Convolutional Initialization for Data-Efficient Vision Transformers

    Full text link
    Training vision transformer networks on small datasets poses challenges. In contrast, convolutional neural networks (CNNs) can achieve state-of-the-art performance by leveraging their architectural inductive bias. In this paper, we investigate whether this inductive bias can be reinterpreted as an initialization bias within a vision transformer network. Our approach is motivated by the finding that random impulse filters can achieve almost comparable performance to learned filters in CNNs. We introduce a novel initialization strategy for transformer networks that can achieve comparable performance to CNNs on small datasets while preserving its architectural flexibility.Comment: 14 pages, 9 figures, 8 table

    Combinando video y programación: el papel de la programación en un video musical interactivo

    Full text link
    [ES] La tecnología de programación ha jugado un papel importante en el desarrollo de los videojuegos en los últimos años. Por otro lado, estamos asistiendo a un cambio paulatino en el modo de ver películas, el sistema tradicional no puede satisfacer la demanda de entretenimiento de la audiencia, más acorde con el planteamiento del vídeo musical interactivo que surge en cierta medida como alternativa al tradicional. La idea de este proyecto está inspirada en el proyecto de película interactiva de ¿Black Mirror: Bandersnatch¿ lanzada por Netflix a finales de 2018. En ella se aplica un método interactivo relacionado con el videojuego ofreciéndonos un engañoso control sobre los personajes y el desarrollo de la historia. Este trabajo se enmarca dentro de la creación de un producto interactivo, realizado junto a Zakaria Bouassa (un alumno también del máster), en el que la audiencia podrá experimentar distintas sensaciones emocionales observando diferentes vídeos. En el desarrollo de las diferentes opciones se utilizará una descripción de los ambientes psicológicos en dos frecuencias de energía (alta o baja) y de afecto (positivo o negativo) para ir generando diferentes combinaciones de vídeo que se asociará a los diferentes elementos del producto interactivo. La responsabilidad principal de la parte de programación es organizar y fusionar los videoclips a través del software de programación. Se usará el software Unity para crear interfaces de ventana, botones de opción y guiar la dirección de conexión de videoclips. La estructura utilizada para el Desarrollo del programa ha sido la de árbol binario.[EN] Programming technology has played an important role in the development of video games in recent years. On the other hand, we are witnessing a gradual change in the way we watch movies, the traditional system cannot satisfy the audience's demand for entertainment, more in line with the approach of interactive music video that arises to some extent as an alternative to the traditional one. The idea of this project is inspired by the interactive film project of "Black Mirror: Bandersnatch" launched by Netflix at the end of 2018, in which an interactive method related to the video game is applied, offering us virtual control over the characters and the development of the history. This work is part of the creation of an interactive product, carried out together with Zakaria Bouassa (another student from the same master), in which the audience will be able to experience different emotional sensations through watching different videos. For the development of these different options, a description of the psychological environments in two frequencies of energy (high or low) and affect (positive or negative) will be used to generate different video combinations that will be associated with the different elements of the interactive product. The main responsibility of the programming part is to organize and merge the video clips through the programming software. Unity software will be used to create window interfaces, buttons, and guide the direction of the video clip. The binary tree structure was used for the development of this software.Li, J. (2020). Combinando video y programación: el papel de la programación en un video musical interactivo. Universitat Politècnica de València. http://hdl.handle.net/10251/150000TFG

    A unified formulation for circle and polygon concrete-filled steel tube columns under axial compression

    Get PDF
    Current design practice of concrete-filled steel tube (CFST) columns uses different formulas for different section profiles to predict the axial load bearing capacity. It has always been a challenge and practically important issue for researchers and design engineers who want to find a unified formula that can be used in the design of the columns with various sections, including solid, hollow, circular and polygonal sections. This has been driven by modern design requirements for continuous optimization of structures in terms of not only the use of materials, but also the topology of structural components. This paper extends the authors’ previous work [1] on a unified formulation of the axial load bearing capacity for circular hollow and solid CFST columns to, now, including hollow and solid CFST columns with regular polygonal sections. This is done by taking a circular section as a special case of a polygonal one. Finally, a unified formula is proposed for calculating the axial load bearing capacity of solid and hollow CFST columns with either circular or polygonal sections. In addition, laboratory tests on hollow circular and square CFST long columns are reported. These results are useful addition to the very limited open literature on testing these columns, and are also as a part of the validation process of the proposed analytical formulas

    NaturalConv: A Chinese Dialogue Dataset Towards Multi-turn Topic-driven Conversation

    Full text link
    In this paper, we propose a Chinese multi-turn topic-driven conversation dataset, NaturalConv, which allows the participants to chat anything they want as long as any element from the topic is mentioned and the topic shift is smooth. Our corpus contains 19.9K conversations from six domains, and 400K utterances with an average turn number of 20.1. These conversations contain in-depth discussions on related topics or widely natural transition between multiple topics. We believe either way is normal for human conversation. To facilitate the research on this corpus, we provide results of several benchmark models. Comparative results show that for this dataset, our current models are not able to provide significant improvement by introducing background knowledge/topic. Therefore, the proposed dataset should be a good benchmark for further research to evaluate the validity and naturalness of multi-turn conversation systems. Our dataset is available at https://ai.tencent.com/ailab/nlp/dialogue/#datasets.Comment: Accepted as a main track paper at AAAI 202

    Robust Point Cloud Processing through Positional Embedding

    Full text link
    End-to-end trained per-point embeddings are an essential ingredient of any state-of-the-art 3D point cloud processing such as detection or alignment. Methods like PointNet, or the more recent point cloud transformer -- and its variants -- all employ learned per-point embeddings. Despite impressive performance, such approaches are sensitive to out-of-distribution (OOD) noise and outliers. In this paper, we explore the role of an analytical per-point embedding based on the criterion of bandwidth. The concept of bandwidth enables us to draw connections with an alternate per-point embedding -- positional embedding, particularly random Fourier features. We present compelling robust results across downstream tasks such as point cloud classification and registration with several categories of OOD noise.Comment: 18 pages, 13 figures, 5 table

    Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation

    Full text link
    Recent advances in large-scale pre-training provide large models with the potential to learn knowledge from the raw text. It is thus natural to ask whether it is possible to leverage these large models as knowledge bases for downstream tasks. In this work, we answer the aforementioned question in unsupervised knowledge-grounded conversation. We explore various methods that best elicit knowledge from large models. Our human study indicates that, though hallucinations exist, large models post the unique advantage of being able to output common sense and summarize facts that cannot be directly retrieved from the search engine. To better exploit such generated knowledge in dialogue generation, we treat the generated knowledge as a noisy knowledge source and propose the posterior-based reweighing as well as the noisy training strategy. Empirical results on two benchmarks show advantages over the state-of-the-art methods.Comment: Accepted to EMNLP 2022 Main Conference. The code is publicly available at https://github.com/lyy1994/PLM_as_KB/tree/main/projects/plm_as_k
    corecore