32 research outputs found
Adaptive Tag Selection for Image Annotation
Not all tags are relevant to an image, and the number of relevant tags is
image-dependent. Although many methods have been proposed for image
auto-annotation, the question of how to determine the number of tags to be
selected per image remains open. The main challenge is that for a large tag
vocabulary, there is often a lack of ground truth data for acquiring optimal
cutoff thresholds per tag. In contrast to previous works that pre-specify the
number of tags to be selected, we propose in this paper adaptive tag selection.
The key insight is to divide the vocabulary into two disjoint subsets, namely a
seen set consisting of tags having ground truth available for optimizing their
thresholds and a novel set consisting of tags without any ground truth. Such a
division allows us to estimate how many tags shall be selected from the novel
set according to the tags that have been selected from the seen set. The
effectiveness of the proposed method is justified by our participation in the
ImageCLEF 2014 image annotation task. On a set of 2,065 test images with ground
truth available for 207 tags, the benchmark evaluation shows that compared to
the popular top- strategy which obtains an F-score of 0.122, adaptive tag
selection achieves a higher F-score of 0.223. Moreover, by treating the
underlying image annotation system as a black box, the new method can be used
as an easy plug-in to boost the performance of existing systems
Optimizing Taxi Carpool Policies via Reinforcement Learning and Spatio-Temporal Mining
In this paper, we develop a reinforcement learning (RL) based system to learn
an effective policy for carpooling that maximizes transportation efficiency so
that fewer cars are required to fulfill the given amount of trip demand. For
this purpose, first, we develop a deep neural network model, called ST-NN
(Spatio-Temporal Neural Network), to predict taxi trip time from the raw GPS
trip data. Secondly, we develop a carpooling simulation environment for RL
training, with the output of ST-NN and using the NYC taxi trip dataset. In
order to maximize transportation efficiency and minimize traffic congestion, we
choose the effective distance covered by the driver on a carpool trip as the
reward. Therefore, the more effective distance a driver achieves over a trip
(i.e. to satisfy more trip demand) the higher the efficiency and the less will
be the traffic congestion. We compared the performance of RL learned policy to
a fixed policy (which always accepts carpool) as a baseline and obtained
promising results that are interpretable and demonstrate the advantage of our
RL approach. We also compare the performance of ST-NN to that of
state-of-the-art travel time estimation methods and observe that ST-NN
significantly improves the prediction performance and is more robust to
outliers.Comment: Accepted at IEEE International Conference on Big Data 2018. arXiv
admin note: text overlap with arXiv:1710.0435
CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms
How to optimally dispatch orders to vehicles and how to tradeoff between
immediate and future returns are fundamental questions for a typical
ride-hailing platform. We model ride-hailing as a large-scale parallel ranking
problem and study the joint decision-making task of order dispatching and fleet
management in online ride-hailing platforms. This task brings unique challenges
in the following four aspects. First, to facilitate a huge number of vehicles
to act and learn efficiently and robustly, we treat each region cell as an
agent and build a multi-agent reinforcement learning framework. Second, to
coordinate the agents from different regions to achieve long-term benefits, we
leverage the geographical hierarchy of the region grids to perform hierarchical
reinforcement learning. Third, to deal with the heterogeneous and variant
action space for joint order dispatching and fleet management, we design the
action as the ranking weight vector to rank and select the specific order or
the fleet management destination in a unified formulation. Fourth, to achieve
the multi-scale ride-hailing platform, we conduct the decision-making process
in a hierarchical way where a multi-head attention mechanism is utilized to
incorporate the impacts of neighbor agents and capture the key agent in each
scale. The whole novel framework is named as CoRide. Extensive experiments
based on multiple cities real-world data as well as analytic synthetic data
demonstrate that CoRide provides superior performance in terms of platform
revenue and user experience in the task of city-wide hybrid order dispatching
and fleet management over strong baselines.Comment: CIKM 201
Assessment of variation in immunosuppressive pathway genes reveals TGFBR2 to be associated with prognosis of estrogen receptor-negative breast cancer after chemotherapy
This is an Open Access article distributed under the terms of the Creative Commons
Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.Abstract
Introduction
Tumor lymphocyte infiltration is associated with clinical response to chemotherapy in estrogen receptor (ER) negative breast cancer. To identify variants in immunosuppressive pathway genes associated with prognosis after adjuvant chemotherapy for ER-negative patients, we studied stage I-III invasive breast cancer patients of European ancestry, including 9,334 ER-positive (3,151 treated with chemotherapy) and 2,334 ER-negative patients (1,499 treated with chemotherapy).
Methods
We pooled data from sixteen studies from the Breast Cancer Association Consortium (BCAC), and employed two independent studies for replications. Overall 3,610 single nucleotide polymorphisms (SNPs) in 133 genes were genotyped as part of the Collaborative Oncological Gene-environment Study, in which phenotype and clinical data were collected and harmonized. Multivariable Cox proportional hazard regression was used to assess genetic associations with overall survival (OS) and breast cancer-specific survival (BCSS). Heterogeneity according to chemotherapy or ER status was evaluated with the log-likelihood ratio test.
Results
Three independent SNPs in TGFBR2 and IL12B were associated with OS (P C) (per allele hazard ratio (HR) 1.54 (95% confidence interval (CI) 1.22 to 1.95), P = 3.08 × 10−4) was not found in ER-negative patients without chemotherapy or ER-positive patients with chemotherapy (P for interaction A) with poorer OS (HR 1.50 (95% CI 1.21 to 1.86), P = 1.81 × 10−4), and rs2853694 (A > C) with improved OS (HR 0.73 (95% CI 0.61 to 0.87), P = 3.67 × 10−4). Similar associations were observed with BCSS. Association with TGFBR2 rs1367610 but not IL12B variants replicated using BCAC Asian samples and the independent Prospective Study of Outcomes in Sporadic versus Hereditary Breast Cancer Study and yielded a combined HR of 1.57 ((95% CI 1.28 to 1.94), P = 2.05 × 10−5) without study heterogeneity.
Conclusions
TGFBR2 variants may have prognostic and predictive value in ER-negative breast cancer patients treated with adjuvant chemotherapy. Our findings provide further insights into the development of immunotherapeutic targets for ER-negative breast cancer
Genetic variation in the immunosuppression pathway genes and breast cancer susceptibility : a pooled analysis of 42,510 cases and 40,577 controls from the Breast Cancer Association Consortium
Immunosuppression plays a pivotal role in assisting tumors to evade immune destruction and promoting tumor development. We hypothesized that genetic variation in the immunosuppression pathway genes may be implicated in breast cancer tumorigenesis. We included 42,510 female breast cancer cases and 40,577 controls of European ancestry from 37 studies in the Breast Cancer Association Consortium (2015) with available genotype data for 3595 single nucleotide polymorphisms (SNPs) in 133 candidate genes. Associations between genotyped SNPs and overall breast cancer risk, and secondarily according to estrogen receptor (ER) status, were assessed using multiple logistic regression models. Gene-level associations were assessed based on principal component analysis. Gene expression analyses were conducted using RNA sequencing level 3 data from The Cancer Genome Atlas for 989 breast tumor samples and 113 matched normal tissue samples. SNP rs1905339 (A > G) in the STAT3 region was associated with an increased breast cancer risk (per allele odds ratio 1.05, 95 % confidence interval 1.03-1.08; p value = 1.4 x 10(-6)). The association did not differ significantly by ER status. On the gene level, in addition to TGFBR2 and CCND1, IL5 and GM-CSF showed the strongest associations with overall breast cancer risk (p value = 1.0 x 10(-3) and 7.0 x 10(-3), respectively). Furthermore, STAT3 and IL5 but not GM-CSF were differentially expressed between breast tumor tissue and normal tissue (p value = 2.5 x 10(-3), 4.5 x 10(-4) and 0.63, respectively). Our data provide evidence that the immunosuppression pathway genes STAT3, IL5, and GM-CSF may be novel susceptibility loci for breast cancer in women of European ancestry.Peer reviewe
Assessment of variation in immunosuppressive pathway genes reveals TGFBR2 to be associated with prognosis of estrogen receptor-negative breast cancer after chemotherapy
Abstract
Introduction
Tumor lymphocyte infiltration is associated with clinical response to chemotherapy in estrogen receptor (ER) negative breast cancer. To identify variants in immunosuppressive pathway genes associated with prognosis after adjuvant chemotherapy for ER-negative patients, we studied stage I-III invasive breast cancer patients of European ancestry, including 9,334 ER-positive (3,151 treated with chemotherapy) and 2,334 ER-negative patients (1,499 treated with chemotherapy).
Methods
We pooled data from sixteen studies from the Breast Cancer Association Consortium (BCAC), and employed two independent studies for replications. Overall 3,610 single nucleotide polymorphisms (SNPs) in 133 genes were genotyped as part of the Collaborative Oncological Gene-environment Study, in which phenotype and clinical data were collected and harmonized. Multivariable Cox proportional hazard regression was used to assess genetic associations with overall survival (OS) and breast cancer-specific survival (BCSS). Heterogeneity according to chemotherapy or ER status was evaluated with the log-likelihood ratio test.
Results
Three independent SNPs in TGFBR2 and IL12B were associated with OS (P C) (per allele hazard ratio (HR) 1.54 (95% confidence interval (CI) 1.22 to 1.95), P = 3.08 × 10−4) was not found in ER-negative patients without chemotherapy or ER-positive patients with chemotherapy (P for interaction A) with poorer OS (HR 1.50 (95% CI 1.21 to 1.86), P = 1.81 × 10−4), and rs2853694 (A > C) with improved OS (HR 0.73 (95% CI 0.61 to 0.87), P = 3.67 × 10−4). Similar associations were observed with BCSS. Association with TGFBR2 rs1367610 but not IL12B variants replicated using BCAC Asian samples and the independent Prospective Study of Outcomes in Sporadic versus Hereditary Breast Cancer Study and yielded a combined HR of 1.57 ((95% CI 1.28 to 1.94), P = 2.05 × 10−5) without study heterogeneity.
Conclusions
TGFBR2 variants may have prognostic and predictive value in ER-negative breast cancer patients treated with adjuvant chemotherapy. Our findings provide further insights into the development of immunotherapeutic targets for ER-negative breast cancer