20 research outputs found

    An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation

    Full text link
    Despite making significant progress in multi-modal tasks, current Multi-modal Large Language Models (MLLMs) encounter the significant challenge of hallucination, which may lead to harmful consequences. Therefore, evaluating MLLMs' hallucinations is becoming increasingly important in model improvement and practical application deployment. Previous works are limited in high evaluation costs (e.g., relying on humans or advanced LLMs) and insufficient evaluation dimensions (e.g., types of hallucination and task). In this paper, we propose an LLM-free multi-dimensional benchmark AMBER, which can be used to evaluate both generative task and discriminative task including object existence, object attribute and object relation hallucination. Based on AMBER, we design a low-cost and efficient evaluation pipeline. Additionally, we conduct a comprehensive evaluation and detailed analysis of mainstream MLLMs including GPT-4V(ision), and also give guideline suggestions for mitigating hallucinations. The data and code of AMBER are available at https://github.com/junyangwang0410/AMBER.Comment: 11 pages, 4 figure

    Evaluation and Analysis of Hallucination in Large Vision-Language Models

    Full text link
    Large Vision-Language Models (LVLMs) have recently achieved remarkable success. However, LVLMs are still plagued by the hallucination problem, which limits the practicality in many scenarios. Hallucination refers to the information of LVLMs' responses that does not exist in the visual input, which poses potential risks of substantial consequences. There has been limited work studying hallucination evaluation in LVLMs. In this paper, we propose Hallucination Evaluation based on Large Language Models (HaELM), an LLM-based hallucination evaluation framework. HaELM achieves an approximate 95% performance comparable to ChatGPT and has additional advantages including low cost, reproducibility, privacy preservation and local deployment. Leveraging the HaELM, we evaluate the hallucination in current LVLMs. Furthermore, we analyze the factors contributing to hallucination in LVLMs and offer helpful suggestions to mitigate the hallucination problem. Our training data and human annotation hallucination data will be made public soon.Comment: 11 pages, 5 figure

    Retrospective seroepidemiology indicated that human enterovirus 71 and coxsackievirus A16 circulated wildly in central and southern China before large-scale outbreaks from 2008

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Large nationwide outbreaks of hand, foot, and mouth disease (HFMD) occurred in China from 2008; most of the cases were in children under 5 years. This study aims to identify the situation of natural human enterovirus 71 (HEV71) and coxsackievirus A16 (CVA16) infections in children before 2008 in China.</p> <p>Results</p> <p>Retrospective seroepidemiologic studies of HEV71 and CVA16 were performed with 900 serum samples collected from children ≤5 years of age in 2005. The samples were collected from 6 different geographical areas (Anhui, Guangdong, Hunan, Xinjiang, Yunnan, and Heilongjiang provinces) in mainland China. Of the 900 samples, 288 were positive for HEV71; the total positive rate was 32.0% and the geometric mean titer (GMT) was 1:8.5. Guangdong (43.7% and 1:10.8), Xinjiang (45.4% and 1:11.1), and Yunnan (43.4% and 1:12.0) provinces had relatively high rates of infection, while Heilongjiang province (8.1% and 1:4.9) had the lowest rate of infection. On the other hand, 390 samples were positive for CVA16; the total positive rate was 43.4% and the GMT was 1:9.5. Anhui (62.2% and 1:16.0) and Hunan (61.1% and 1:23.1) had relatively high rates, while Heilongjiang (8.0% and 1:4.6) had the lowest rate. Although there is a geographical difference in HEV71 and CVA16 infections, low neutralizing antibody positive rate and titer of both viruses were found in all 6 provinces.</p> <p>Conclusions</p> <p>This report confirmed that HEV71 and CVA16 had wildly circulated in a couple provinces in China before the large-scale outbreaks from 2008. This finding also suggests that public health measures to control the spread of HEV71 and CVA16 should be devised according to the different regional characteristics.</p

    Gigahertz-rate-switchable wavefront shaping through integration of metasurfaces with photonic integrated circuit

    Full text link
    Achieving spatiotemporal control of light at high-speeds presents immense possibilities for various applications in communication, computation, metrology, and sensing. The integration of subwavelength metasurfaces and optical waveguides offers a promising approach to manipulate light across multiple degrees of freedom at high-speed in compact photonic integrated circuit (PICs) devices. Here, we demonstrate a gigahertz-rate-switchable wavefront shaping by integrating metasurface, lithium niobite on insulator (LNOI) photonic waveguide and electrodes within a PIC device. As proofs of concept, we showcase the generation of a focus beam with reconfigurable arbitrary polarizations, switchable focusing with lateral focal positions and focal length, orbital angular momentum light beams (OAMs) as well as Bessel beams. Our measurements indicate modulation speeds of up to gigahertz rate. This integrated platform offers a versatile and efficient means of controlling light field at high-speed within a compact system, paving the way for potential applications in optical communication, computation, sensing, and imaging

    Amino Acid Homeostasis Modulates Salicylic Acid–Associated Redox Status and Defense Responses in Arabidopsis[C][W][OA]

    No full text
    This study investigates the relationship between nitrogen metabolism and disease responses in Arabidopsis and shows that knockout of Arabidopsis LHT1, a single amino acid transporter, imparts broad-spectrum resistance to pathogens

    Neural network assisted high-spatial-resolution polarimetry with non-interleaved chiral metasurfaces

    No full text
    Abstract Polarimetry plays an indispensable role in modern optics. Nevertheless, the current strategies generally suffer from bulky system volume or spatial multiplexing scheme, resulting in limited performances when dealing with inhomogeneous polarizations. Here, we propose a non-interleaved, interferometric method to analyze the polarizations based on a tri-channel chiral metasurface. A deep convolutional neural network is also incorporated to enable fast, robust and accurate polarimetry. Spatially uniform and nonuniform polarizations are both measured through the metasurface experimentally. Distinction between two semblable glasses is also demonstrated. Our strategy features the merits of compactness and high spatial resolution, and would inspire more intriguing design for detecting and sensing

    Bound-extended mode transition in type-II synthetic photonic Weyl heterostructures

    No full text
    Photonic structures with Weyl points (WPs), including type-I and type-II, promise nontrivial surface modes and intriguing light manipulations for their three-dimensional topological bands. While previous studies mainly focus on exploring WPs in a uniform Weyl structure, here we establish Weyl heterostructures (i.e., a nonuniform Weyl lattice) with different rotational orientations in the synthetic dimension by nanostructured photonic waveguides. In this work, we unveil a transition between bound and extended modes on the interface of type-II Weyl heterostructures by tuning their rotational phases, despite the reversed topological order across the interface. This mode transition is also manifested from the total transmission to total reflection at the interface. All of these unconventional effects are attributed to the tilted dispersion of type-II Weyl band structure that can lead to mismatched bands and gaps across the interface. As a comparison, the type-I Weyl heterostructures lack the phase transition due to the untilted band structure. This work establishes a flexible scheme of artificial Weyl heterostructures that opens a new avenue towards high-dimensional topological effects and significantly enhances our capabilities in on-chip light manipulations
    corecore