4,984 research outputs found
PVLR: Prompt-driven Visual-Linguistic Representation Learning for Multi-Label Image Recognition
Multi-label image recognition is a fundamental task in computer vision.
Recently, vision-language models have made notable advancements in this area.
However, previous methods often failed to effectively leverage the rich
knowledge within language models and instead incorporated label semantics into
visual features in a unidirectional manner. In this paper, we propose a
Prompt-driven Visual-Linguistic Representation Learning (PVLR) framework to
better leverage the capabilities of the linguistic modality. In PVLR, we first
introduce a dual-prompting strategy comprising Knowledge-Aware Prompting (KAP)
and Context-Aware Prompting (CAP). KAP utilizes fixed prompts to capture the
intrinsic semantic knowledge and relationships across all labels, while CAP
employs learnable prompts to capture context-aware label semantics and
relationships. Later, we propose an Interaction and Fusion Module (IFM) to
interact and fuse the representations obtained from KAP and CAP. In contrast to
the unidirectional fusion in previous works, we introduce a Dual-Modal
Attention (DMA) that enables bidirectional interaction between textual and
visual features, yielding context-aware label representations and
semantic-related visual representations, which are subsequently used to
calculate similarities and generate final predictions for all labels. Extensive
experiments on three popular datasets including MS-COCO, Pascal VOC 2007, and
NUS-WIDE demonstrate the superiority of PVLR.Comment: 15 pages, 8 figure
Benefits of developing mental hospital with the mode of "combined psychiatry and comprehensive medical"
精神病专科医院生存状况不容乐观。以“大专科、大综合”模式发展精神病专科医院能提高医院管理水平,改善财政状况,改变设备不全、人才缺乏等现状,提高危急重症病人救治能力及科研教学水平,促进医院发展。The contemporary performance of mental hospitals goes below our best expectation. Developing mental hospital with the mode of “combined psychiatry and comprehensive medical” would increase the ability of management, improve financial state, change the present circumstance of incomplete equipment and personnel lack, enhance endangered patients’ treatment as well as scientific research and teaching and, finally, promote the development of hospital
Bis(4-aminobenzenesulfonato-κN)diaquabis(dimethylformamide-κO)nickel(II) dihydrate
In the title compound, [Ni(C6H6NO3S)2(C3H7NO)2(H2O)2]·2H2O, the NiII ion (site symmetry ) is coordinated by two –NH2 groups from two 4-aminobenzenesulfonate anions, two O atoms from two dimethylformamide molecules and two water molecules, forming a slightly distorted trans-NiN2O4 octahedral geometry. In the crystal structure, intermolecular O—H⋯O, O—H⋯(O,O) and N—H⋯O hydrogen bonds link the components into a three-dimensional network. The O atoms of the sulfonate group are disordered over two sets of sites in a 0.833 (4):0.167 (4) ratio and the O atom of the uncoordinated water molecule is disordered over two sites in a 0.637 (18):0.363 (18) ratio
- …