Search CORE

5,799 research outputs found

Scalable Population Synthesis with Deep Generative Modeling

Author: Borysov Stanislav S.
Pereira Francisco C.
Rich Jeppe
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Population synthesis is concerned with the generation of synthetic yet realistic representations of populations. It is a fundamental problem in the modeling of transport where the synthetic populations of micro-agents represent a key input to most agent-based models. In this paper, a new methodological framework for how to 'grow' pools of micro-agents is presented. The model framework adopts a deep generative modeling approach from machine learning based on a Variational Autoencoder (VAE). Compared to the previous population synthesis approaches, including Iterative Proportional Fitting (IPF), Gibbs sampling and traditional generative models such as Bayesian Networks or Hidden Markov Models, the proposed method allows fitting the full joint distribution for high dimensions. The proposed methodology is compared with a conventional Gibbs sampler and a Bayesian Network by using a large-scale Danish trip diary. It is shown that, while these two methods outperform the VAE in the low-dimensional case, they both suffer from scalability issues when the number of modeled attributes increases. It is also shown that the Gibbs sampler essentially replicates the agents from the original sample when the required conditional distributions are estimated as frequency tables. In contrast, the VAE allows addressing the problem of sampling zeros by generating agents that are virtually different from those in the original data but have similar statistical properties. The presented approach can support agent-based modeling at all levels by enabling richer synthetic populations with smaller zones and more detailed individual characteristics.Comment: 27 pages, 15 figures, 4 table

arXiv.org e-Print Archive

Online Research Database In Technology

Deep generative modeling for single-cell transcriptomics.

Author: A Regev
A Tanay
A Wagner
A Zeisel
AP Patel
B Wang
BK Tusi
CA Vallejos
CA Vallejos
D DeTomaso
D Grün
D Risso
DM Blei
E Pierson
FA Wolf
G Finak
G Görgün
GXY Zheng
HI Nakaya
J Ding
J Fan
Jeffrey Regier
JT Gaublomme
K Shekhar
L Haghverdi
L Held
M Stoeckius
MD Robinson
MI Love
Michael B. Cole
Michael I. Jordan
Nir Yosef
PV Kharchenko
Q Li
RE Kass
Romain Lopez
S Prabhakaran
S Semrau
U Shaham
WE Johnson
Publication venue: eScholarship, University of California
Publication date: 01/12/2018
Field of study

Single-cell transcriptome measurements can reveal unexplored biological diversity, but they suffer from technical noise and bias that must be modeled to account for the resulting uncertainty in downstream analyses. Here we introduce single-cell variational inference (scVI), a ready-to-use scalable framework for the probabilistic representation and analysis of gene expression in single cells ( https://github.com/YosefLab/scVI ). scVI uses stochastic optimization and deep neural networks to aggregate information across similar cells and genes and to approximate the distributions that underlie observed expression values, while accounting for batch effects and limited sensitivity. We used scVI for a range of fundamental analysis tasks including batch correction, visualization, clustering, and differential expression, and achieved high accuracy for each task

Crossref

eScholarship - University of California

Scalable Text and Link Analysis with Mixed-Topic Link Models

Author: Chang J.
Cohn D.
Cohn D.
Getoor L.
Gopalan P.
Gruber A.
Lu Q.
Lu Q.
Yang T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 28/03/2013
Field of study

Many data sets contain rich information about objects, as well as pairwise relations between them. For instance, in networks of websites, scientific papers, and other documents, each node has content consisting of a collection of words, as well as hyperlinks or citations to other nodes. In order to perform inference on such data sets, and make predictions and recommendations, it is useful to have models that are able to capture the processes which generate the text at each node and the links between them. In this paper, we combine classic ideas in topic modeling with a variant of the mixed-membership block model recently developed in the statistical physics community. The resulting model has the advantage that its parameters, including the mixture of topics of each document and the resulting overlapping communities, can be inferred with a simple and scalable expectation-maximization algorithm. We test our model on three data sets, performing unsupervised topic classification and link prediction. For both tasks, our model outperforms several existing state-of-the-art methods, achieving higher accuracy with significantly less computation, analyzing a data set with 1.3 million words and 44 thousand links in a few minutes.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref