457 research outputs found
Stochastic Primal-Dual Coordinate Method for Regularized Empirical Risk Minimization
We consider a generic convex optimization problem associated with regularized
empirical risk minimization of linear predictors. The problem structure allows
us to reformulate it as a convex-concave saddle point problem. We propose a
stochastic primal-dual coordinate (SPDC) method, which alternates between
maximizing over a randomly chosen dual variable and minimizing over the primal
variable. An extrapolation step on the primal variable is performed to obtain
accelerated convergence rate. We also develop a mini-batch version of the SPDC
method which facilitates parallel computing, and an extension with weighted
sampling probabilities on the dual variables, which has a better complexity
than uniform sampling on unnormalized data. Both theoretically and empirically,
we show that the SPDC method has comparable or better performance than several
state-of-the-art optimization methods
Auto Car Sales Prediction: A Statistical Study Using Functional Data Analysis and Time Series.
Honors (Bachelor's)StatisticsUniversity of Michiganhttp://deepblue.lib.umich.edu/bitstream/2027.42/112123/1/lycumich.pd
IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization
Fine-tuning pre-trained language models (PTLMs), such as BERT and its better
variant RoBERTa, has been a common practice for advancing performance in
natural language understanding (NLU) tasks. Recent advance in representation
learning shows that isotropic (i.e., unit-variance and uncorrelated) embeddings
can significantly improve performance on downstream tasks with faster
convergence and better generalization. The isotropy of the pre-trained
embeddings in PTLMs, however, is relatively under-explored. In this paper, we
analyze the isotropy of the pre-trained [CLS] embeddings of PTLMs with
straightforward visualization, and point out two major issues: high variance in
their standard deviation, and high correlation between different dimensions. We
also propose a new network regularization method, isotropic batch normalization
(IsoBN) to address the issues, towards learning more isotropic representations
in fine-tuning by dynamically penalizing dominating principal components. This
simple yet effective fine-tuning method yields about 1.0 absolute increment on
the average of seven NLU tasks.Comment: AAAI 202
Automatic Extraction of Commonsense LocatedNear Knowledge
LocatedNear relation is a kind of commonsense knowledge describing two
physical objects that are typically found near each other in real life. In this
paper, we study how to automatically extract such relationship through a
sentence-level relation classifier and aggregating the scores of entity pairs
from a large corpus. Also, we release two benchmark datasets for evaluation and
future research.Comment: Accepted by ACL 2018. A preliminary version is presented on
AKBC@NIPS'1
On Designing of a Low Leakage Patient-Centric Provider Network
When a patient in a provider network seeks services outside of their
community, the community experiences a leakage. Leakage is undesirable as it
typically leads to higher out-of-network cost for patient and increases barrier
for care coordination, which is particularly problematic for Accountable Care
Organization (ACO) as the in-network providers are financially responsible for
patient quality and outcome. We aim to design a data-driven method to identify
naturally occurring provider networks driven by diabetic patient choices, and
understand the relationship among provider composition, patient composition,
and service leakage pattern. We construct a healthcare provider network based
on patients' historical medical insurance claims. A community detection
algorithm is used to identify naturally occurring communities of collaborating
providers. Finally, import-export analysis is conducted to benchmark their
leakage pattern and identify further leakage reduction opportunity. The design
yields six major provider communities with diverse profiles. Some communities
are geographically concentrated, while others tend to draw patients with
certain diabetic co-morbidities. Providers from the same healthcare institution
are likely to be assigned to the same community. While most communities have
high within-community utilization and spending, at 85% and 86% respectively,
leakage still persists. Hence, we utilize a metric from import-export analysis
to detect leakage, gaining insight on how to minimizing leakage. In conclusion,
we identify patient-driven provider organization by surfacing providers who
share a large number of patients. By analyzing the import-export behavior of
each identified community using a novel approach and profiling community
patient and provider composition we understand the key features of having a
balanced number of PCP and specialists and provider heterogeneity
- …