129 research outputs found
Distributionally Robust Ground Delay Programs with Learning-Driven Airport Capacity Predictions
Strategic Traffic Management Initiatives (TMIs) such as Ground Delay Programs
(GDPs) play a crucial role in mitigating operational costs associated with
demand-capacity imbalances. However, GDPs can only be planned (e.g., duration,
delay assignments) with confidence if the future capacities at constrained
resources (i.e., airports) are predictable. In reality, such future capacities
are uncertain, and predictive models may provide forecasts that are vulnerable
to errors and distribution shifts. Motivated by the goal of planning optimal
GDPs that are \emph{distributionally robust} against airport capacity
prediction errors, we study a fully integrated learning-driven optimization
framework. We design a deep learning-based prediction model capable of
forecasting arrival and departure capacity distributions across a network of
airports. We then integrate the forecasts into a distributionally robust
formulation of the multi-airport ground holding problem (\textsc{dr-MAGHP}). We
show how \textsc{dr-MAGHP} can outperform stochastic optimization when
distribution shifts occur, and conclude with future research directions to
improve both the learning and optimization stages.Comment: 8 pages, 6 figure
Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision
Visual Grounding (VG) aims at localizing target objects from an image based
on given expressions and has made significant progress with the development of
detection and vision transformer. However, existing VG methods tend to generate
false-alarm objects when presented with inaccurate or irrelevant descriptions,
which commonly occur in practical applications. Moreover, existing methods fail
to capture fine-grained features, accurate localization, and sufficient context
comprehension from the whole image and textual descriptions. To address both
issues, we propose an Iterative Robust Visual Grounding (IR-VG) framework with
Masked Reference based Centerpoint Supervision (MRCS). The framework introduces
iterative multi-level vision-language fusion (IMVF) for better alignment. We
use MRCS to ahieve more accurate localization with point-wised feature
supervision. Then, to improve the robustness of VG, we also present a
multi-stage false-alarm sensitive decoder (MFSD) to prevent the generation of
false-alarm objects when presented with inaccurate expressions. The proposed
framework is evaluated on five regular VG datasets and two newly constructed
robust VG datasets. Extensive experiments demonstrate that IR-VG achieves new
state-of-the-art (SOTA) results, with improvements of 25\% and 10\% compared to
existing SOTA approaches on the two newly proposed robust VG datasets.
Moreover, the proposed framework is also verified effective on five regular VG
datasets. Codes and models will be publicly at
https://github.com/cv516Buaa/IR-VG
A Dataset And Benchmark Of Underwater Object Detection For Robot Picking
Underwater object detection for robot picking has attracted a lot of
interest. However, it is still an unsolved problem due to several challenges.
We take steps towards making it more realistic by addressing the following
challenges. Firstly, the currently available datasets basically lack the test
set annotations, causing researchers must compare their method with other SOTAs
on a self-divided test set (from the training set). Training other methods lead
to an increase in workload and different researchers divide different datasets,
resulting there is no unified benchmark to compare the performance of different
algorithms. Secondly, these datasets also have other shortcomings, e.g., too
many similar images or incomplete labels. Towards these challenges we introduce
a dataset, Detecting Underwater Objects (DUO), and a corresponding benchmark,
based on the collection and re-annotation of all relevant datasets. DUO
contains a collection of diverse underwater images with more rational
annotations. The corresponding benchmark provides indicators of both efficiency
and accuracy of SOTAs (under the MMDtection framework) for academic research
and industrial applications, where JETSON AGX XAVIER is used to assess detector
speed to simulate the robot-embedded environment
Fairness in Recommendation: Foundations, Methods and Applications
As one of the most pervasive applications of machine learning, recommender
systems are playing an important role on assisting human decision making. The
satisfaction of users and the interests of platforms are closely related to the
quality of the generated recommendation results. However, as a highly
data-driven system, recommender system could be affected by data or algorithmic
bias and thus generate unfair results, which could weaken the reliance of the
systems. As a result, it is crucial to address the potential unfairness
problems in recommendation settings. Recently, there has been growing attention
on fairness considerations in recommender systems with more and more literature
on approaches to promote fairness in recommendation. However, the studies are
rather fragmented and lack a systematic organization, thus making it difficult
to penetrate for new researchers to the domain. This motivates us to provide a
systematic survey of existing works on fairness in recommendation. This survey
focuses on the foundations for fairness in recommendation literature. It first
presents a brief introduction about fairness in basic machine learning tasks
such as classification and ranking in order to provide a general overview of
fairness research, as well as introduce the more complex situations and
challenges that need to be considered when studying fairness in recommender
systems. After that, the survey will introduce fairness in recommendation with
a focus on the taxonomies of current fairness definitions, the typical
techniques for improving fairness, as well as the datasets for fairness studies
in recommendation. The survey also talks about the challenges and opportunities
in fairness research with the hope of promoting the fair recommendation
research area and beyond.Comment: Accepted by ACM Transactions on Intelligent Systems and Technology
(TIST
OV-VG: A Benchmark for Open-Vocabulary Visual Grounding
Open-vocabulary learning has emerged as a cutting-edge research area,
particularly in light of the widespread adoption of vision-based foundational
models. Its primary objective is to comprehend novel concepts that are not
encompassed within a predefined vocabulary. One key facet of this endeavor is
Visual Grounding, which entails locating a specific region within an image
based on a corresponding language description. While current foundational
models excel at various visual language tasks, there's a noticeable absence of
models specifically tailored for open-vocabulary visual grounding. This
research endeavor introduces novel and challenging OV tasks, namely
Open-Vocabulary Visual Grounding and Open-Vocabulary Phrase Localization. The
overarching aim is to establish connections between language descriptions and
the localization of novel objects. To facilitate this, we have curated a
comprehensive annotated benchmark, encompassing 7,272 OV-VG images and 1,000
OV-PL images. In our pursuit of addressing these challenges, we delved into
various baseline methodologies rooted in existing open-vocabulary object
detection, VG, and phrase localization frameworks. Surprisingly, we discovered
that state-of-the-art methods often falter in diverse scenarios. Consequently,
we developed a novel framework that integrates two critical components:
Text-Image Query Selection and Language-Guided Feature Attention. These modules
are designed to bolster the recognition of novel categories and enhance the
alignment between visual and linguistic information. Extensive experiments
demonstrate the efficacy of our proposed framework, which consistently attains
SOTA performance across the OV-VG task. Additionally, ablation studies provide
further evidence of the effectiveness of our innovative models. Codes and
datasets will be made publicly available at https://github.com/cv516Buaa/OV-VG
Spectrum Superposition Based Chromatic Dispersion Estimation for Digital Coherent Receivers
We propose and experimentally demonstrate a fast blind CD estimation method based on signal spectrum superposition. With only 4096 samples, a maximum estimation error of 0.25% of the accumulated CD for 7 x 112 Gbps DP-QPSK WDM signal is verified
One-Step Preparation of High Performance TiO 2 /CNT/CQD Nanocomposites Bactericidal Coating with Ultrasonic Radiation
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).As an environmental semiconductor material, TiO2 has important applications in the fields of environmental protection and water treatment. The preparation of P25 particles into nano-functional material films with a high specific surface area has always been a bottleneck limiting its large-scale application. In this paper, a one-step method of preparing TiO2 nanocomposites by doping carbon nanotube (CNT) and carbon quantum dots (CQD) with tetrabutyltitanate and P25 TiO2 under ultrasonic radiation is proposed to synthesize a novel antifouling material, which both eliminates the bacterium of Escherichia coli and shows good photoelectric properties, indicating a great value for the industrial promotion of TiO2/CNT. This mesoporous composite exhibits a high specific surface area of 78.07 M2/g (BET) and a tested pore width range within 10–120 nm. The surface morphology of this composite is characterized by TEM and the microstructure is characterized through XRD. This preparation method can fabricate P25 particles into a nano-functional material film with a high specific surface area at a very low cost.Peer reviewe
- …