26 research outputs found
Relation Rectification in Diffusion Model
Despite their exceptional generative abilities, large text-to-image diffusion
models, much like skilled but careless artists, often struggle with accurately
depicting visual relationships between objects. This issue, as we uncover
through careful analysis, arises from a misaligned text encoder that struggles
to interpret specific relationships and differentiate the logical order of
associated objects. To resolve this, we introduce a novel task termed Relation
Rectification, aiming to refine the model to accurately represent a given
relationship it initially fails to generate. To address this, we propose an
innovative solution utilizing a Heterogeneous Graph Convolutional Network
(HGCN). It models the directional relationships between relation terms and
corresponding objects within the input prompts. Specifically, we optimize the
HGCN on a pair of prompts with identical relational words but reversed object
orders, supplemented by a few reference images. The lightweight HGCN adjusts
the text embeddings generated by the text encoder, ensuring the accurate
reflection of the textual relation in the embedding space. Crucially, our
method retains the parameters of the text encoder and diffusion model,
preserving the model's robust performance on unrelated descriptions. We
validated our approach on a newly curated dataset of diverse relational data,
demonstrating both quantitative and qualitative enhancements in generating
images with precise visual relations. Project page:
https://wuyinwei-hah.github.io/rrnet.github.io/
MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding
Given an untrimmed video and natural language query, video sentence grounding
aims to localize the target temporal moment in the video. Existing methods
mainly tackle this task by matching and aligning semantics of the descriptive
sentence and video segments on a single temporal resolution, while neglecting
the temporal consistency of video content in different resolutions. In this
work, we propose a novel multi-resolution temporal video sentence grounding
network: MRTNet, which consists of a multi-modal feature encoder, a
Multi-Resolution Temporal (MRT) module, and a predictor module. MRT module is
an encoder-decoder network, and output features in the decoder part are in
conjunction with Transformers to predict the final start and end timestamps.
Particularly, our MRT module is hot-pluggable, which means it can be seamlessly
incorporated into any anchor-free models. Besides, we utilize a hybrid loss to
supervise cross-modal features in MRT module for more accurate grounding in
three scales: frame-level, clip-level and sequence-level. Extensive experiments
on three prevalent datasets have shown the effectiveness of MRTNet.Comment: work in progres
Methods to Measure the Network Path Connectivity
The functionalities, such as connectivity and communication capability of complex networks, are related to the number and length of paths between node pairs in the networks. In this paper, we propose a new path connectivity measure by considering the number and length of paths of the network (PCNL) to evaluate network path connectivity. By comparing the PCNL with the typical natural connectivity, we prove the effectiveness of the PCNL to measure the path connectivity of networks. Because of the importance of the shortest paths, we further propose the shortest paths connectivity measure (SPCNL) based on the number and length of the shortest paths. Then, we use edge-betweenness-based malicious attacks to study the relationship between the SPCNL and network topology in five types of networks. The results show that the SPCNLs of the networks have a significant corresponding relationship and similar changing trend with their network topology heterogeneities with the increase of the number of deleted edges. These findings mean that the SPCNL is positively correlated with the heterogeneity of the network topology, which provides a new perspective for designing complex networks with high path connectivity
Recommended from our members
Pressure-induced structural modulations in coesite
Silica phases, SiO2, have attracted significant attention as important phases in the fields of condensed matter physics, materials science, and (in view of their abundance in Earth’s crust) geoscience. Here, we experimentally and theoretically demonstrate that coesite undergoes structural modulations under high pressure. Coesite transforms to a distorted modulated structure, coesite-II, at 22-25 GPa with modulation wave vector q = 0.5b*. Coesite-II displays further commensurate modulation along the y-axis at 36-40 GPa and the long-range ordered crystalline structure collapses beyond ~40 GPa and starts amorphizing. First-principles calculations illuminate the nature of the modulated phase transitions of coesite and elucidate the modulated structures of coesite caused by modulations along y-axis direction. The structural modulations are demonstrated to result from phonon instability, preceding pressured-induced amorphization. The recovered sample after decompression develops a rim of crystalline coesite structure, but its interior remains low crystalline or partially amorphous. Our results not only clarify that the pressure-induced reversible phase transitions and amorphization in coesite originate from structural modulations along y-axis direction, but also shed light on the densification mechanism of silica under high pressure.NERC grant NE/P012167/
A High-Frequency Vibration Error Compensation Method for Terahertz SAR Imaging Based on Short-Time Fourier Transform
High-frequency vibration error of a moving radar platform easily introduces a non-negligible phase of periodic modulation in radar echoes and greatly degrades terahertz synthetic aperture radar (THz-SAR) image quality. For solving the problem of THz-SAR image-quality degradation, the paper proposes a multi-component high-frequency vibration error estimation and compensation approach based on the short-time Fourier transform (STFT). To improve the robustness of the method against noise effects, STFT is used to extract the instantaneous frequency (IF) of a high-frequency vibration error signal, and the vibration parameters are coarsely obtained by the least square (LS) method. To reduce the influence of the STFT window widths, a method based on the maximum likelihood function (MLF) is developed for determining the optimal window width by a one-dimensional search of the window widths. In the case of high noise, many IF estimation values seriously deviate from the true ones. To avoid the singular values of IF estimation in the LS regression, the random sample consensus (RANSAC) is introduced to improve estimation accuracy. Then, performing the STFT with the optimal window width, the accurate vibration parameters are estimated by LS regression, where the singular values of IF estimation are excluded. Finally, the vibration error is reconstructed to compensate for the non-negligible phase of the platform-induced periodic modulation. The simulation results prove that the error compensation method can meet THz-SAR imaging requirements, even at a low signal-to-noise ratio (SNR)
A Systematic Overview of Android Malware Detection
Due to the completely open-source nature of Android, the exploitable vulnerability of malware attacks is increasing. To stay ahead of other similar review work attempting to deal with the serious security problem of the Android environment, this work not only summarizes the approaches in the malware classification phase but also lays emphasis on the Android feature selection algorithm and presents some areas neglected in previous works in the field of Android malware detection, like limitations and commonly applied datasets in machine learning-based models. In this paper, the Android OS environment, feature selection, classification models, and confronted challenges of machine learning detection are described in detail. Based on the brief introduction to Android background knowledge, feature selection methods are elaborated from key perspectives as feature extraction, raw data preprocessing, valid feature subsets selection, and machine learning-based selection models. For the algorithms of the malware classification, machine learning methods are categorized according to different standards to present an all-around view. Furthermore, this paper focuses on the study of deterioration problems and evasion attacks in machine learning detectors
<i>Mecp2</i> Deficiency in Peripheral Sensory Neuron Improves Cognitive Function by Enhancing Hippocampal Dendritic Spine Densities in Mice
Methyl-CpG-binding protein 2 (Mecp2) is an epigenetic modulator and numerous studies have explored its impact on the central nervous system manifestations. However, little attention has been given to its potential contributions to the peripheral nervous system (PNS). To investigate the regulation of Mecp2 in the PNS on specific central regions, we generated Mecp2fl/flAdvillincre mice with the sensory-neuron-specific deletion of the Mecp2 gene and found the mutant mice had a heightened sensitivity to temperature, which, however, did not affect the sense of motion, social behaviors, and anxiety-like behavior. Notably, in comparison to Mecp2fl/fl mice, Mecp2fl/flAdvillincre mice exhibited improved learning and memory abilities. The levels of hippocampal synaptophysin and PSD95 proteins were higher in Mecp2fl/flAdvillincre mice than in Mecp2fl/fl mice. Golgi staining revealed a significant increase in total spine density, and dendritic arborization in the hippocampal pyramidal neurons of Mecp2fl/flAdvillincre mice compared to Mecp2fl/fl mice. In addition, the activation of the BDNF-TrkB-CREB1 pathway was observed in the hippocampus and spinal cord of Mecp2fl/flAdvillincre mice. Intriguingly, the hippocampal BDNF/CREB1 signaling pathway in mutant mice was initiated within 5 days after birth. Our findings suggest a potential therapeutic strategy targeting the BDNF-TrkB-CREB1 signaling pathway and peripheral somasensory neurons to treat learning and cognitive deficits associated with Mecp2 disorders
Establishment and Verification of the UAV Coupled Rotor Airflow Backward Tilt Angle Controller
At present, all the flight controllers of agricultural UAVs cannot accurately and quickly control the influencing factors of the UAV coupled rotor airflow backward tilt angle during the application process. To solve the above problem, a Rotor Airflow Backward Tilt Angle (RABTA) controller is established in this paper. The RABTA controller integrates advanced sensor technology with a novel algorithmic approach, utilizing real-time data acquisition and state–space analysis to dynamically adjust the UAV’s rotor airflow, ensuring precise control of the backward tilt angle. The control effect of the traditional flight controller and RABTA controller in the process of pesticide application and the corresponding operation effect are compared and analyzed. The comparison results show that the RABTA controller reduces the control error to less than 1 degree, achieving a 48.3% improvement in the uniformity of the distribution of pesticides droplets across the crop canopy, which means that the UAV field application effect is implemented and the innovation of the UAV field application control mode is realized
An outbreak of norovirus-associated acute gastroenteritis associated with contaminated barrelled water in many schools in Zhejiang, China.
OBJECTIVES:More than 900 students and teachers at many schools in Jiaxing city developed acute gastroenteritis in February 2014. An immediate epidemiological investigation was conducted to identify the pathogen, infection sources and route of transmission. METHODS:The probable cases and confirmed cases were defined as students or teachers with diarrhoea or vomiting present since the term began in February 2014. An active search was conducted for undiagnosed cases among students and teachers. Details such as demographic characteristics, gastrointestinal symptoms, and drinking water preference and frequency were collected via a uniform epidemiological questionnaire. A case-control study was implemented, and odds ratios (ORs) and 95% confidence intervals were calculated. Rectal swabs from several patients, food handlers and barrelled water factory workers, as well as water and food samples, were collected to test for potential bacteria and viruses. RESULTS:A total of 924 cases fit the definition of the probable case, including 8 cases of laboratory-confirmed norovirus infection at 13 schools in Jiaxing city between February 12 and February 21, 2014. The case-control study demonstrated that barrelled water was a risk factor (OR: 20.15, 95% CI: 2.59-156.76) and that bottled water and boiled barrelled water were protective factors (OR: 0.31, 95% CI: 0.13-0.70, and OR: 0.36, 95% CI: 0.16-0.77). A total of 11 rectal samples and 8 barrelled water samples were detected as norovirus-positive, and the genotypes of viral strains were the same (GII). The norovirus that contaminated the barrelled water largely came from the asymptomatic workers. CONCLUSIONS:This acute gastroenteritis outbreak was caused by barrelled water contaminated by norovirus. The outbreak was controlled after stopping the supply of barrelled water. The barrelled water supply in China represents a potential source of acute gastroenteritis outbreaks due to the lack of surveillance and supervision. Therefore, more attention should be paid to this area