59 research outputs found

    Visibility And Confidence Estimation Of An Onboard-Camera Image For An Intelligent Vehicle

    Get PDF
    More and more drivers nowadays enjoy the convenience brought by advanced driver assistances system (ADAS) including collision detection, lane keeping and ACC. However, many assistant functions are still constrained by weather and terrain. In the way towards automated driving, the need of an automatic condition detector is inevitable, since many solutions only work for certain conditions. When it comes to camera, which is most commonly used tool in lane detection, obstacle detection, visibility estimation is one of such important parameters we need to analyze. Although many papers have proposed their own ways to estimate visibility range, there is little research on the question of how to estimate the confidence of an image. In this thesis, we introduce a new way to detect visual distance based on a monocular camera, and thereby we calculate the overall image confidence. Much progresses has been achieved in the past ten years from restoration of foggy images, real-time fog detection to weather classification. However, each method has its own drawbacks, ranging from complexity, cost, and inaccuracy. According to these considerations, the new way we proposed to estimate visibility range is based on a single vision system. In addition, this method can maintain a relatively robust estimation and produce a more accurate result

    Improving attention model based on cognition grounded data for sentiment analysis

    Get PDF
    Attention models are proposed in sentiment analysis and other classification tasks because some words are more important than others to train the attention models. However, most existing methods either use local context based information, affective lexicons, or user preference information. In this work, we propose a novel attention model trained by cognition grounded eye-tracking data. First,a reading prediction model is built using eye-tracking data as dependent data and other features in the context as independent data. The predicted reading time is then used to build a cognition grounded attention layer for neural sentiment analysis. Our model can capture attentions in context both in terms of words at sentence level as well as sentences at document level. Other attention mechanisms can also be incorporated together to capture other aspects of attentions, such as local attention, and affective lexicons. Results of our work include two parts. The first part compares our proposed cognition ground attention model with other state-of-the-art sentiment analysis models. The second part compares our model with an attention model based on other lexicon based sentiment resources. Evaluations show that sentiment analysis using cognition grounded attention model outperforms the state-of-the-art sentiment analysis methods significantly. Comparisons to affective lexicons also indicate that using cognition grounded eye-tracking data has advantages over other sentiment resources by considering both word information and context information. This work brings insight to how cognition grounded data can be integrated into natural language processing (NLP) tasks

    Learning Heterogeneous Network Embedding From Text and Links

    Get PDF
    Finding methods to represent multiple types of nodes in heterogeneous networks is both challenging and rewarding, as there is much less work in this area compared with that of homogeneous networks. In this paper, we propose a novel approach to learn node embedding for heterogeneous networks through a joint learning framework of both network links and text associated with nodes. A novel attention mechanism is also used to make good use of text extended through links to obtain much larger network context. Link embedding is first learned through a random-walk-based method to process multiple types of links. Text embedding is separately learned at both sentence level and document level to capture salient semantic information more comprehensively. Then, both types of embeddings are jointly fed into a hierarchical neural network model to learn node representation through mutual enhancement. The attention mechanism follows linked edges to obtain context of adjacent nodes to extend context for node representation. The evaluation on a link prediction task in a heterogeneous network data set shows that our method outperforms the current state-of-the-art method by 2.5%-5.0% in AUC values with p-value less than 10 -9 , indicating very significant improvement

    UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons

    Full text link
    The automatic co-speech gesture generation draws much attention in computer animation. Previous works designed network structures on individual datasets, which resulted in a lack of data volume and generalizability across different motion capture standards. In addition, it is a challenging task due to the weak correlation between speech and gestures. To address these problems, we present UnifiedGesture, a novel diffusion model-based speech-driven gesture synthesis approach, trained on multiple gesture datasets with different skeletons. Specifically, we first present a retargeting network to learn latent homeomorphic graphs for different motion capture standards, unifying the representations of various gestures while extending the dataset. We then capture the correlation between speech and gestures based on a diffusion model architecture using cross-local attention and self-attention to generate better speech-matched and realistic gestures. To further align speech and gesture and increase diversity, we incorporate reinforcement learning on the discrete gesture units with a learned reward function. Extensive experiments show that UnifiedGesture outperforms recent approaches on speech-driven gesture generation in terms of CCA, FGD, and human-likeness. All code, pre-trained models, databases, and demos are available to the public at https://github.com/YoungSeng/UnifiedGesture.Comment: 16 pages, 11 figures, ACM MM 202

    Applications of various range shifters for proton pencil beam scanning radiotherapy

    Get PDF
    Background A range pull-back device, such as a machine-related range shifter (MRS) or a universal patient-related range shifter (UPRS), is needed in pencil beam scanning technique to treat shallow tumors. Methods Three UPRS made by QFix (Avondale, PA, USA) allow treating targets across the body: U-shaped bolus (UB), anterior lateral bolus (ALB), and couch top bolus. Head-and-neck (HN) patients who used the UPRS were tested. The in-air spot sizes were measured and compared in this study at air gaps: 6 cm, 16 cm, and 26 cm. Measurements were performed in a solid water phantom using a single-field optimization pencil beam scanning field with the ALB placed at 0, 10, and 20 cm air gaps. The two-dimensional dose maps at the middle of the spread-out Bragg peak were measured using ion chamber array MatriXX PT (IBA-Dosimetry, Schwarzenbruck, Germany) located at isocenter and compared with the treatment planning system. Results A UPRS can be consistently placed close to the patient and maintains a relatively small spot size resulting in improved dose distributions. However, when a UPRS is non-removable (e.g. thick couch top), the quality of volumetric imaging is degraded due to their high Z material construction, hindering the value of Image-Guided Radiation Therapy (IGRT). Limitations of using UPRS with small air gaps include reduced couch weight limit, potential collision with patient or immobilization devices, and challenges using non-coplanar fields with certain UPRS. Our experience showed the combination of a U-shaped bolus exclusively for an HN target and an MRS as the complimentary device for head-and-neck targets as well as for all other treatment sites may be ideal to preserve the dosimetric advantages of pencil beam scanning proton treatments across the body. Conclusion We have described how to implement UPRS and MRS for various clinical indications using the PBS technique, and comprehensively reviewed the advantage and disadvantages of UPRS and MRS. We recommend the removable UB only to be employed for the brain and HN treatments while an automated MRS is used for all proton beams that require RS but not convenient or feasible to use UB

    Daily streamflow simulation based on the improved machine learning method

    Get PDF
    Kan, G., He, X., Ding, L., Li, J., Hong, Y., Ren, M., Lei, T., Liang, K., Zuo, D., & Huang, P. (March-April, 2017). Daily streamflow simulation based on the improved machine learning method. Water Technology and Sciences (in Spanish), 8(2), 51-60. Daily streamflow simulation has usually been implemented by conceptual or distributed hydrological models. Nowadays, hydrological data, which can be easily obtained from automatic measuring systems, are more than enough. Therefore, machine learning turns into an effective and popular tool which is highly suited for the streamflow simulation task. In this paper, we propose an improved machine learning method referred to as PKEK model based on the previously proposed NU-PEK model for the purpose of generating daily streamflow simulation results with better accuracy and stability. Comparison results between the PKEK model and the NU-PEK model indicated that the improved model has better accuracy and stability and has a bright application prospect for daily streamflow simulation tasks

    PSR J1926-0652: A Pulsar with Interesting Emission Properties Discovered at FAST

    Get PDF
    We describe PSR J1926-0652, a pulsar recently discovered with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). Using sensitive single-pulse detections from FAST and long-term timing observations from the Parkes 64-m radio telescope, we probed phenomena on both long and short time scales. The FAST observations covered a wide frequency range from 270 to 800 MHz, enabling individual pulses to be studied in detail. The pulsar exhibits at least four profile components, short-term nulling lasting from 4 to 450 pulses, complex subpulse drifting behaviours and intermittency on scales of tens of minutes. While the average band spacing P3 is relatively constant across different bursts and components, significant variations in the separation of adjacent bands are seen, especially near the beginning and end of a burst. Band shapes and slopes are quite variable, especially for the trailing components and for the shorter bursts. We show that for each burst the last detectable pulse prior to emission ceasing has different properties compared to other pulses. These complexities pose challenges for the classic carousel-type models.Comment: 13pages with 12 figure

    Phrase embedding learning based on external and internal context with compositionality constraint

    Get PDF
    Different methods are proposed to learn phrase embedding, which can be mainly divided into two strands. The first strand is based on the distributional hypothesis to treat a phrase as one non-divisible unit and to learn phrase embedding based on its external context similar to learn word embedding. However, distributional methods cannot make use of the information embedded in component words and they also face data spareness problem. The second strand is based on the principle of compositionality to infer phrase embedding based on the embedding of its component words. Compositional methods would give erroneous result if a phrase is non-compositional. In this paper, we propose a hybrid method by a linear combination of the distributional component and the compositional component with an individualized phrase compositionality constraint. The phrase compositionality is automatically computed based on the distributional embedding of the phrase and its component words. Evaluation on five phrase level semantic tasks and experiments show that our proposed method has overall best performance. Most importantly, our method is more robust as it is less sensitive to datasets
    corecore