44,305 research outputs found
Probabilistic performance estimators for computational chemistry methods: Systematic Improvement Probability and Ranking Probability Matrix. I. Theory
The comparison of benchmark error sets is an essential tool for the
evaluation of theories in computational chemistry. The standard ranking of
methods by their Mean Unsigned Error is unsatisfactory for several reasons
linked to the non-normality of the error distributions and the presence of
underlying trends. Complementary statistics have recently been proposed to
palliate such deficiencies, such as quantiles of the absolute errors
distribution or the mean prediction uncertainty. We introduce here a new score,
the systematic improvement probability (SIP), based on the direct system-wise
comparison of absolute errors. Independently of the chosen scoring rule, the
uncertainty of the statistics due to the incompleteness of the benchmark data
sets is also generally overlooked. However, this uncertainty is essential to
appreciate the robustness of rankings. In the present article, we develop two
indicators based on robust statistics to address this problem: P_{inv}, the
inversion probability between two values of a statistic, and \mathbf{P}_{r},
the ranking probability matrix. We demonstrate also the essential contribution
of the correlations between error sets in these scores comparisons
Tracking Human Mobility using WiFi signals
We study six months of human mobility data, including WiFi and GPS traces
recorded with high temporal resolution, and find that time series of WiFi scans
contain a strong latent location signal. In fact, due to inherent stability and
low entropy of human mobility, it is possible to assign location to WiFi access
points based on a very small number of GPS samples and then use these access
points as location beacons. Using just one GPS observation per day per person
allows us to estimate the location of, and subsequently use, WiFi access points
to account for 80\% of mobility across a population. These results reveal a
great opportunity for using ubiquitous WiFi routers for high-resolution outdoor
positioning, but also significant privacy implications of such side-channel
location tracking
Information-Theoretic Active Learning for Content-Based Image Retrieval
We propose Information-Theoretic Active Learning (ITAL), a novel batch-mode
active learning method for binary classification, and apply it for acquiring
meaningful user feedback in the context of content-based image retrieval.
Instead of combining different heuristics such as uncertainty, diversity, or
density, our method is based on maximizing the mutual information between the
predicted relevance of the images and the expected user feedback regarding the
selected batch. We propose suitable approximations to this computationally
demanding problem and also integrate an explicit model of user behavior that
accounts for possible incorrect labels and unnameable instances. Furthermore,
our approach does not only take the structure of the data but also the expected
model output change caused by the user feedback into account. In contrast to
other methods, ITAL turns out to be highly flexible and provides
state-of-the-art performance across various datasets, such as MIRFLICKR and
ImageNet.Comment: GCPR 2018 paper (14 pages text + 2 pages references + 6 pages
appendix
Безопасность и качество пищевых продуктов = Practical Food Safety and Food Quality : практикум
Даны описания практических и лабораторных работ, проводимых в рамках курсов «Международные стандарты и безопасность продуктов питания» и «Контроль качества биотехнологических продуктов». Задания сопровождаются подробными комментариями. Используются активные формы обучения, такие как работа в команде, игровые технологии и пр. Для иностранных и российских студентов, обучающихся на английском языке, изучающих пищевую биотехнологию, контроль качества и безопасность пищевых продуктов
Recommended from our members
PATTERNA: transcriptome-wide search for functional RNA elements via structural data signatures.
Establishing a link between RNA structure and function remains a great challenge in RNA biology. The emergence of high-throughput structure profiling experiments is revolutionizing our ability to decipher structure, yet principled approaches for extracting information on structural elements directly from these data sets are lacking. We present PATTERNA, an unsupervised pattern recognition algorithm that rapidly mines RNA structure motifs from profiling data. We demonstrate that PATTERNA detects motifs with an accuracy comparable to commonly used thermodynamic models and highlight its utility in automating data-directed structure modeling from large data sets. PATTERNA is versatile and compatible with diverse profiling techniques and experimental conditions
- …