157 research outputs found
Balancing reaction-diffusion network for cell polarization pattern with stability and asymmetry
Cell polarization is a critical process that separates molecules into two
distinct regions in prokaryotic and eukaryotic cells, guiding biological
processes such as cell division and cell differentiation. Although several
underlying antagonistic reaction-diffusion networks capable of setting up cell
polarization have been identified experimentally and theoretically, our
understanding of how to manipulate pattern stability and asymmetry remains
incomplete, especially when only a subset of network components are known. Here
we present numerical results to show that the polarized pattern of an
antagonistic 2-node network collapses into a homogeneous state when subjected
to single-sided self-regulation, single-sided additional regulation, or unequal
system parameters. However, polarity can be restored through a combination of
two modifications that have opposing effects. Additionally, spatially
inhomogeneous parameters favoring respective domains stabilize their interface
at designated locations. To connect our findings to cell polarity studies of
the nematode Caenorhabditis elegans zygote, we reconstituted a 5-node network
where a 4-node circuit with full mutual inhibitions between anterior and
posterior is modified by a mutual activation in the anterior and an additional
mutual inhibition between the anterior and the posterior. Once again, a generic
set of kinetic parameters moves the interface towards either the anterior or
posterior end, yet a polarized pattern can be stabilized through spatial tuning
of one or more parameters coupled to intracellular or extracellular cues. A
user-friendly software, PolarSim, is introduced to facilitate the exploration
of networks with alternative node numbers, parameter values, and regulatory
pathways
InvestLM: A Large Language Model for Investment using Financial Domain Instruction Tuning
We present a new financial domain large language model, InvestLM, tuned on
LLaMA-65B (Touvron et al., 2023), using a carefully curated instruction dataset
related to financial investment. Inspired by less-is-more-for-alignment (Zhou
et al., 2023), we manually curate a small yet diverse instruction dataset,
covering a wide range of financial related topics, from Chartered Financial
Analyst (CFA) exam questions to SEC filings to Stackexchange quantitative
finance discussions. InvestLM shows strong capabilities in understanding
financial text and provides helpful responses to investment related questions.
Financial experts, including hedge fund managers and research analysts, rate
InvestLM's response as comparable to those of state-of-the-art commercial
models (GPT-3.5, GPT-4 and Claude-2). Zero-shot evaluation on a set of
financial NLP benchmarks demonstrates strong generalizability. From a research
perspective, this work suggests that a high-quality domain specific LLM can be
tuned using a small set of carefully curated instructions on a well-trained
foundation model, which is consistent with the Superficial Alignment Hypothesis
(Zhou et al., 2023). From a practical perspective, this work develops a
state-of-the-art financial domain LLM with superior capability in understanding
financial texts and providing helpful investment advice, potentially enhancing
the work efficiency of financial professionals. We release the model parameters
to the research community.Comment: Link: https://github.com/AbaciNLP/InvestL
The Lifecycle and Cascade of WeChat Social Messaging Groups
Social instant messaging services are emerging as a transformative form with
which people connect, communicate with friends in their daily life - they
catalyze the formation of social groups, and they bring people stronger sense
of community and connection. However, research community still knows little
about the formation and evolution of groups in the context of social messaging
- their lifecycles, the change in their underlying structures over time, and
the diffusion processes by which they develop new members. In this paper, we
analyze the daily usage logs from WeChat group messaging platform - the largest
standalone messaging communication service in China - with the goal of
understanding the processes by which social messaging groups come together,
grow new members, and evolve over time. Specifically, we discover a strong
dichotomy among groups in terms of their lifecycle, and develop a separability
model by taking into account a broad range of group-level features, showing
that long-term and short-term groups are inherently distinct. We also found
that the lifecycle of messaging groups is largely dependent on their social
roles and functions in users' daily social experiences and specific purposes.
Given the strong separability between the long-term and short-term groups, we
further address the problem concerning the early prediction of successful
communities. In addition to modeling the growth and evolution from group-level
perspective, we investigate the individual-level attributes of group members
and study the diffusion process by which groups gain new members. By
considering members' historical engagement behavior as well as the local social
network structure that they embedded in, we develop a membership cascade model
and demonstrate the effectiveness by achieving AUC of 95.31% in predicting
inviter, and an AUC of 98.66% in predicting invitee.Comment: 10 pages, 8 figures, to appear in proceedings of the 25th
International World Wide Web Conference (WWW 2016
FinEntity: Entity-level Sentiment Classification for Financial Texts
In the financial domain, conducting entity-level sentiment analysis is
crucial for accurately assessing the sentiment directed toward a specific
financial entity. To our knowledge, no publicly available dataset currently
exists for this purpose. In this work, we introduce an entity-level sentiment
classification dataset, called \textbf{FinEntity}, that annotates financial
entity spans and their sentiment (positive, neutral, and negative) in financial
news. We document the dataset construction process in the paper. Additionally,
we benchmark several pre-trained models (BERT, FinBERT, etc.) and ChatGPT on
entity-level sentiment classification. In a case study, we demonstrate the
practical utility of using FinEntity in monitoring cryptocurrency markets. The
data and code of FinEntity is available at
\url{https://github.com/yixuantt/FinEntity}Comment: EMNLP'23 Main Conference Short Pape
Using Medical Claims Database to Develop a Population Disease Progression Model for Leuprorelin-Treated Subjects with Hormone-Sensitive Prostate Cancer
Androgen deprivation therapy (ADT) is a widely used treatment for patients with hormone-sensitive prostate cancer (PCa). However, duration of treatment response varies, and most patients eventually experience disease progression despite treatment. Leuprorelin is a luteinizing hormone-releasing hormone (LHRH) agonist, a commonly used form of ADT. Prostate-specific antigen (PSA) is a biomarker for monitoring disease progression and predicting treatment response and survival in PCa. However, time-dependent profile of tumor regression and growth in patients with hormone-sensitive PCa on ADT has never been fully characterized. In this analysis, nationwide medical claims database provided by Humana from 2007 to 2011 was used to construct a population-based disease progression model for patients with hormone-sensitive PCa on leuprorelin. Data were analyzed by nonlinear mixed effects modeling utilizing Monte Carlo Parametric Expectation Maximization (MCPEM) method in NONMEM. Covariate selection was performed using a modified Wald’s approximation method with backward elimination (WAM-BE) proposed by our group. 1113 PSA observations from 264 subjects with malignant PCa were used for model development. PSA kinetics were well described by the final covariate model. Model parameters were well estimated, but large between-patient variability was observed. Hemoglobin significantly affected proportion of drug-resistant cells in the original tumor, while baseline PSA and antiandrogen use significantly affected treatment effect on drug-sensitive PCa cells (Ds). Population estimate of Ds was 3.78 x 10−2 day-1. Population estimates of growth rates for drug-sensitive (Gs) and drug-resistant PCa cells (GR) were 1.96 x 10−3 and 6.54 x 10−4 day-1, corresponding to a PSA doubling time of 354 and 1060 days, respectively. Proportion of the original PCa cells inherently resistant to treatment was estimated to be 1.94%. Application of population-based disease progression model to clinical data allowed characterization of tumor resistant patterns and growth/regression rates that enhances our understanding of how PCa responds to ADT
Exploring the Relationship between In-Context Learning and Instruction Tuning
In-Context Learning (ICL) and Instruction Tuning (IT) are two primary
paradigms of adopting Large Language Models (LLMs) to downstream applications.
However, they are significantly different. In ICL, a set of demonstrations are
provided at inference time but the LLM's parameters are not updated. In IT, a
set of demonstrations are used to tune LLM's parameters in training time but no
demonstrations are used at inference time. Although a growing body of
literature has explored ICL and IT, studies on these topics have largely been
conducted in isolation, leading to a disconnect between these two paradigms. In
this work, we explore the relationship between ICL and IT by examining how the
hidden states of LLMs change in these two paradigms. Through carefully designed
experiments conducted with LLaMA-2 (7B and 13B), we find that ICL is implicit
IT. In other words, ICL changes an LLM's hidden states as if the demonstrations
were used to instructionally tune the model. Furthermore, the convergence
between ICL and IT is largely contingent upon several factors related to the
provided demonstrations. Overall, this work offers a unique perspective to
explore the connection between ICL and IT and sheds light on understanding the
behaviors of LLM
On the Perturbation Function of Ranking and Balance for Weighted Online Bipartite Matching
Ranking and Balance are arguably the two most important algorithms in the online matching literature. They achieve the same optimal competitive ratio of 1-1/e for the integral version and fractional version of online bipartite matching by Karp, Vazirani, and Vazirani (STOC 1990) respectively. The two algorithms have been generalized to weighted online bipartite matching problems, including vertex-weighted online bipartite matching and AdWords, by utilizing a perturbation function. The canonical choice of the perturbation function is f(x) = 1-e^{x-1} as it leads to the optimal competitive ratio of 1-1/e in both settings.
We advance the understanding of the weighted generalizations of Ranking and Balance in this paper, with a focus on studying the effect of different perturbation functions. First, we prove that the canonical perturbation function is the unique optimal perturbation function for vertex-weighted online bipartite matching. In stark contrast, all perturbation functions achieve the optimal competitive ratio of 1-1/e in the unweighted setting. Second, we prove that the generalization of Ranking to AdWords with unknown budgets using the canonical perturbation function is at most 0.624 competitive, refuting a conjecture of Vazirani (2021). More generally, as an application of the first result, we prove that no perturbation function leads to the prominent competitive ratio of 1-1/e by establishing an upper bound of 1-1/e-0.0003. Finally, we propose the online budget-additive welfare maximization problem that is intermediate between AdWords and AdWords with unknown budgets, and we design an optimal 1-1/e competitive algorithm by generalizing Balance
- …