178 research outputs found
Beyond Static Datasets: A Deep Interaction Approach to LLM Evaluation
Large Language Models (LLMs) have made progress in various real-world tasks,
which stimulates requirements for the evaluation of LLMs. Existing LLM
evaluation methods are mainly supervised signal-based which depends on static
datasets and cannot evaluate the ability of LLMs in dynamic real-world
scenarios where deep interaction widely exists. Other LLM evaluation methods
are human-based which are costly and time-consuming and are incapable of
large-scale evaluation of LLMs. To address the issues above, we propose a novel
Deep Interaction-based LLM-evaluation framework. In our proposed framework,
LLMs' performances in real-world domains can be evaluated from their deep
interaction with other LLMs in elaborately designed evaluation tasks.
Furthermore, our proposed framework is a general evaluation method that can be
applied to a host of real-world tasks such as machine translation and code
generation. We demonstrate the effectiveness of our proposed method through
extensive experiments on four elaborately designed evaluation tasks
Combating tracking drift : developing robust object tracking methods
University of Technology Sydney. Faculty of Engineering and Information Technology.Visual object tracking plays an important role in many computer vision applications, such as video surveillance, unmanned aerial vehicle image processing, human computer interaction and automatic control. This research aims to develop robust object tracking methods, which are capable of tracking general object without the prior knowledge of the target. Tracker drift is one of the most challenging issues in object tracking due to target deformations, illumination variations, abrupt motions, occlusions and background clutters. This thesis focuses on the tracking drift problem, and adopts three main solutions. These include: designing an efficient target shape feature extraction method, comparing target features with metric learning and using the ensemble tracking method to tackle the tracking drift during tracker online update
Deep Cooking: Predicting Relative Food Ingredient Amounts from Images
In this paper, we study the novel problem of not only predicting ingredients
from a food image, but also predicting the relative amounts of the detected
ingredients. We propose two prediction-based models using deep learning that
output sparse and dense predictions, coupled with important semi-automatic
multi-database integrative data pre-processing, to solve the problem.
Experiments on a dataset of recipes collected from the Internet show the models
generate encouraging experimental results
Algorithmic subsampling under multiway clustering
This paper proposes a novel method of algorithmic subsampling (data
sketching) for multiway cluster dependent data. We establish a new uniform weak
law of large numbers and a new central limit theorem for the multiway
algorithmic subsample means. Consequently, we discover an additional advantage
of the algorithmic subsampling that it allows for robustness against potential
degeneracy, and even non-Gaussian degeneracy, of the asymptotic distribution
under multiway clustering. Simulation studies support this novel result, and
demonstrate that inference with the algorithmic subsampling entails more
accuracy than that without the algorithmic subsampling. Applying these basic
asymptotic theories, we derive the consistency and the asymptotic normality for
the multiway algorithmic subsampling generalized method of moments estimator
and for the multiway algorithmic subsampling M-estimator. We illustrate an
application to scanner data
Fast Graph Condensation with Structure-based Neural Tangent Kernel
The rapid development of Internet technology has given rise to a vast amount
of graph-structured data. Graph Neural Networks (GNNs), as an effective method
for various graph mining tasks, incurs substantial computational resource costs
when dealing with large-scale graph data. A data-centric manner solution is
proposed to condense the large graph dataset into a smaller one without
sacrificing the predictive performance of GNNs. However, existing efforts
condense graph-structured data through a computational intensive bi-level
optimization architecture also suffer from massive computation costs. In this
paper, we propose reforming the graph condensation problem as a Kernel Ridge
Regression (KRR) task instead of iteratively training GNNs in the inner loop of
bi-level optimization. More specifically, We propose a novel dataset
condensation framework (GC-SNTK) for graph-structured data, where a
Structure-based Neural Tangent Kernel (SNTK) is developed to capture the
topology of graph and serves as the kernel function in KRR paradigm.
Comprehensive experiments demonstrate the effectiveness of our proposed model
in accelerating graph condensation while maintaining high prediction
performance. The source code is available on
https://github.com/WANGLin0126/GCSNTK.Comment: 10 pages, 6 figures, 5 table
Conflicts, Villains, Resolutions: Towards models of Narrative Media Framing
Despite increasing interest in the automatic detection of media frames in
NLP, the problem is typically simplified as single-label classification and
adopts a topic-like view on frames, evading modelling the broader
document-level narrative. In this work, we revisit a widely used
conceptualization of framing from the communication sciences which explicitly
captures elements of narratives, including conflict and its resolution, and
integrate it with the narrative framing of key entities in the story as heroes,
victims or villains. We adapt an effective annotation paradigm that breaks a
complex annotation task into a series of simpler binary questions, and present
an annotated data set of English news articles, and a case study on the framing
of climate change in articles from news outlets across the political spectrum.
Finally, we explore automatic multi-label prediction of our frames with
supervised and semi-supervised approaches, and present a novel retrieval-based
method which is both effective and transparent in its predictions. We conclude
with a discussion of opportunities and challenges for future work on
document-level models of narrative framing.Comment: To appear in ACL 2023 (main conference
Models of preconception care implementation in selected countries.
Globally, maternal and child health faces diverse challenges depending on the status of the development of the country. Some countries have introduced or explored preconception care for various reasons. Falling birth rates and increasing knowledge about risk factors for adverse pregnancy outcomes led to the introduction of preconception care in Hong Kong in 1998, and South Korea in 2004. In Hong Kong, comprehensive preconception care including laboratory tests are provided to over 4000 women each year at a cost of 12) for preconception health care services. These case studies illustrate programmatic feasibility of preconception care services to address maternal and child health and other public health challenges in developed and emerging economies
- …