2 research outputs found
VCD: Knowledge Base Guided Visual Commonsense Discovery in Images
Visual commonsense contains knowledge about object properties, relationships,
and behaviors in visual data. Discovering visual commonsense can provide a more
comprehensive and richer understanding of images, and enhance the reasoning and
decision-making capabilities of computer vision systems. However, the visual
commonsense defined in existing visual commonsense discovery studies is
coarse-grained and incomplete. In this work, we draw inspiration from a
commonsense knowledge base ConceptNet in natural language processing, and
systematically define the types of visual commonsense. Based on this, we
introduce a new task, Visual Commonsense Discovery (VCD), aiming to extract
fine-grained commonsense of different types contained within different objects
in the image. We accordingly construct a dataset (VCDD) from Visual Genome and
ConceptNet for VCD, featuring over 100,000 images and 14 million
object-commonsense pairs. We furthermore propose a generative model (VCDM) that
integrates a vision-language model with instruction tuning to tackle VCD.
Automatic and human evaluations demonstrate VCDM's proficiency in VCD,
particularly outperforming GPT-4V in implicit commonsense discovery. The value
of VCD is further demonstrated by its application to two downstream tasks,
including visual commonsense evaluation and visual question answering. The data
and code will be made available on GitHub
Elemental Geochemical Characterization of Sedimentary Conditions and Organic Matter Enrichment for Lower Cambrian Shale Formations in Northern Guizhou, South China
The black shale deposited in the Niutitang Formation and its adjacent strata is considered to be a favorable source rock in northern Guizhou of south China and has become a target horizon for shale gas exploration in recent years. Based on SQ-1 and CY-1 core samples, the organic matter properties and geochemical elements were obtained through experimental analysis. Provenance, paleoredox, paleoclimate, paleoproductivity and deposition conditions were analyzed, and the sedimentary effects on organic matter enrichment were discussed. The results show that total organic carbon (TOC) is between 0.22–10.10 wt.% in SQ-1, with an average of 2.60 wt.%, and TOC is between 0.23–7.7 wt.% in CY-1, with an average of 1.45 wt.%. The geochemical data of the samples indicate that the black shale of the Niutitang Formation and adjacent strata are deposited in the tectonic background of the passive continental margin. The provenance shows moderate weathering, with hot and humid paleoenvironmental characteristics and fast deposition rate. Using multiple ancient redox indicators, it is concluded that the formation has undergone changes in the oxidizing environment and anaerobic environment during deposition. According to the (La/Yb)N value (the average value of SQ-1 is 1.23 and the average value of CY-1 is 1.26), it shows a faster deposition rate of the two wells and shortens the residence time of organic matter in the microbial degradation zone. The Babio indicates that the bottom has a high paleoproductivity when deposited. Considering the influencing factors, the paleoproductivity mainly controls the organic matter enrichment, followed by ancient redox conditions and the deposition rate. The research results provide a reference for deepening sedimentary understanding and shale gas exploration in the study area