Search CORE

38,452 research outputs found

maigesPack: A Computational Environment for Microarray Data Analysis

Author: Esteves Gustavo H.
Hirata Jr Roberto
Publication venue
Publication date: 11/11/2015
Field of study

Microarray technology is still an important way to assess gene expression in molecular biology, mainly because it measures expression profiles for thousands of genes simultaneously, what makes this technology a good option for some studies focused on systems biology. One of its main problem is complexity of experimental procedure, presenting several sources of variability, hindering statistical modeling. So far, there is no standard protocol for generation and evaluation of microarray data. To mitigate the analysis process this paper presents an R package, named maigesPack, that helps with data organization. Besides that, it makes data analysis process more robust, reliable and reproducible. Also, maigesPack aggregates several data analysis procedures reported in literature, for instance: cluster analysis, differential expression, supervised classifiers, relevance networks and functional classification of gene groups or gene networks

arXiv.org e-Print Archive

CiteSeerX

EPiK-a Workflow for Electron Tomography in Kepler.

Author: Altintas Ilkay
Chen Ruijuan
Crawl Daniel
Ellisman Mark
Lawrence Albert
Phan Sébastien
Wan Xiaohua
Wang Jianwu
Publication venue: eScholarship, University of California
Publication date: 01/01/2014
Field of study

Scientific workflows integrate data and computing interfaces as configurable, semi-automatic graphs to solve a scientific problem. Kepler is such a software system for designing, executing, reusing, evolving, archiving and sharing scientific workflows. Electron tomography (ET) enables high-resolution views of complex cellular structures, such as cytoskeletons, organelles, viruses and chromosomes. Imaging investigations produce large datasets. For instance, in Electron Tomography, the size of a 16 fold image tilt series is about 65 Gigabytes with each projection image including 4096 by 4096 pixels. When we use serial sections or montage technique for large field ET, the dataset will be even larger. For higher resolution images with multiple tilt series, the data size may be in terabyte range. Demands of mass data processing and complex algorithms require the integration of diverse codes into flexible software structures. This paper describes a workflow for Electron Tomography Programs in Kepler (EPiK). This EPiK workflow embeds the tracking process of IMOD, and realizes the main algorithms including filtered backprojection (FBP) from TxBR and iterative reconstruction methods. We have tested the three dimensional (3D) reconstruction process using EPiK on ET data. EPiK can be a potential toolkit for biology researchers with the advantage of logical viewing, easy handling, convenient sharing and future extensibility

Elsevier - Publisher Connector

Crossref

PubMed Central

eScholarship - University of California

Supporting Data mining of large databases by visual feedback queries

Author: Keim Daniel A.
Kriegel Hans-Peter
Seidl Thomas
Publication venue
Publication date: 01/01/1993
Field of study

In this paper, we describe a query system that provides visual relevance feedback in querying large databases. Our goal is to support the process of data mining by representing as many data items as possible on the display. By arranging and coloring the data items as pixels according to their relevance for the query, the user gets a visual impression of the resulting data set. Using an interactive query interface, the user may change the query dynamically and receives immediate feedback by the visual representation of the resulting data set. Furthermore, by using multiple windows for different parts of a complex query, the user gets visual feedback for each part of the query and, therefore, may easier understand the overall result. Our system allows to represent the largest amount of data that can be visualized on current display technology, provides valuable feedback in querying the database, and allows the user to find results which, otherwise, would remain hidden in the database

KOPS - The Institutional Repository of the University of Konstanz

Open Access LMU

The Luminosity Function of Galaxies in SDSS Commissioning Data

Author: Anderson J E
Annis J
Bahcall Neta A
Bernardi M
Blanton M R
Brinkmann J
Brunner R J
Burles S M
Carey L D
Castander F J
Connolly A J
Csabai I
Dalcanton J J
Doi M
Eisenstein D J
Finkbeiner D
French-Leger R
Friedman S
Frieman Joshua A
Fukugita M
Gunn J E
Hennessy G S
Hindsley R B
Ichikawa T
Ivezic Z
Kent S
Knapp G R
Lamb D Q
Long D C
Loveday J
Lupton R H
McKay T A
Meiksin A
Merelli A
Munn J A
Narayanan V K
Newcomb M
Nichol R C
Okamura S
Owen R
Pier J R
Pope A C
Postman M
Quinn M
Rockosi C M
Schlegel D J
Schneider D P
Shimasaku K
Siegmund W A
Smee S
Snir Y
Stoughton C
Strauss M A
Stubbs C
Subba-Rao M
Szalay A S
Szokoly G P
Thakar A R
Tremonti C A
Tucker D L
Uomoto A
Vanden Berk Daniel E
Vogeley M S
Waddell P
Weinberg D H
Yanny B
Yasuda N
York D G
Publication venue: 'University of Chicago Press'
Publication date: 05/12/2000
Field of study

During commissioning observations, the Sloan Digital Sky Survey (SDSS) has produced one of the largest existing galaxy redshift samples selected from CCD images. Using 11,275 galaxies complete to r^* = 17.6 over 140 square degrees, we compute the luminosity function of galaxies in the r^* band over a range -23 < M < -16 (for h=1). The result is well-described by a Schechter function with parameters phi_* = 0.0146 +/- 0.0012 h^3 Mpc^{-3}, M_* = -20.83 +/- 0.03, and alpha = -1.20 +/- 0.03. The implied luminosity density in r^* is j = (2.6 +/- 0.3) x 10^8 h L_sun Mpc^{-3}. The surface brightness selection threshold has a negligible impact for M < -18. We measure the luminosity function in the u^*, g^*, i^*, and z^* bands as well; the slope at low luminosities ranges from alpha=-1.35 to alpha=-1.2. We measure the bivariate distribution of r^* luminosity with half-light surface brightness, intrinsic color, and morphology. High surface brightness, red, highly concentrated galaxies are on average more luminous than low surface brightness, blue, less concentrated galaxies. If we synthesize results for R-band or b_j-band using the Petrosian magnitudes with which the SDSS measures galaxy fluxes, we obtain luminosity densities 2.0 times that found by the Las Campanas Redshift Survey in R and 1.4 times that found by the Two-degree Field Galaxy Redshift Survey in b_j. We are able to reproduce the luminosity functions obtained by these surveys if we also mimic their isophotal limits for defining galaxy magnitudes, which are shallower and more redshift dependent than the Petrosian magnitudes used by the SDSS. (Abridged)Comment: 49 pages, including 23 figures, accepted by AJ; some minor textual changes, plus an important change in comparison to LCR

arXiv.org e-Print Archive

Crossref

CERN Document Server

Supporting Data Mining of Large Databases by Visual Feedback Queries

Author: Keim Daniel A.
Kriegel Hans-Peter
Publication venue
Publication date: 01/01/1994
Field of study

Open Access LMU

Speech Processing in Computer Vision Applications

Author: Waterworth Nicholas
Publication venue: ScholarWorks@UARK
Publication date: 01/05/2020
Field of study

Deep learning has been recently proven to be a viable asset in determining features in the field of Speech Analysis. Deep learning methods like Convolutional Neural Networks facilitate the expansion of specific feature information in waveforms, allowing networks to create more feature dense representations of data. Our work attempts to address the problem of re-creating a face given a speaker\u27s voice and speaker identification using deep learning methods. In this work, we first review the fundamental background in speech processing and its related applications. Then we introduce novel deep learning-based methods to speech feature analysis. Finally, we will present our deep learning approaches to speaker identification and speech to face synthesis. The presented method can convert a speaker audio sample to an image of their predicted face. This framework is composed of several chained together networks, each with an essential step in the conversion process. These include Audio embedding, encoding, and face generation networks, respectively. Our experiments show that certain features can map to the face and that with a speaker\u27s voice, DNNs can create their face and that a GUI could be used in conjunction to display a speaker recognition network\u27s data

ScholarWorks@UARK

UARK (University of Arkansas )

A review of the literature on citation impact indicators

Author: Waltman Ludo
Publication venue
Publication date: 25/02/2016
Field of study

Citation impact indicators nowadays play an important role in research evaluation, and consequently these indicators have received a lot of attention in the bibliometric and scientometric literature. This paper provides an in-depth review of the literature on citation impact indicators. First, an overview is given of the literature on bibliographic databases that can be used to calculate citation impact indicators (Web of Science, Scopus, and Google Scholar). Next, selected topics in the literature on citation impact indicators are reviewed in detail. The first topic is the selection of publications and citations to be included in the calculation of citation impact indicators. The second topic is the normalization of citation impact indicators, in particular normalization for field differences. Counting methods for dealing with co-authored publications are the third topic, and citation impact indicators for journals are the last topic. The paper concludes by offering some recommendations for future research

arXiv.org e-Print Archive

Crossref

Leiden University Scholary Publications

MIDAS, prototype Multivariate Interactive Digital Analysis System for large area earth resources surveys. Volume 1: System description

Author: Christenson D.
Gordon M.
Kistler R.
Kriegler F.
Lampert S.
Marshall R.
Mclaughlin R.
Publication venue
Publication date
Field of study

A third-generation, fast, low cost, multispectral recognition system (MIDAS) able to keep pace with the large quantity and high rates of data acquisition from large regions with present and projected sensots is described. The program can process a complete ERTS frame in forty seconds and provide a color map of sixteen constituent categories in a few minutes. A principle objective of the MIDAS program is to provide a system well interfaced with the human operator and thus to obtain large overall reductions in turn-around time and significant gains in throughput. The hardware and software generated in the overall program is described. The system contains a midi-computer to control the various high speed processing elements in the data path, a preprocessor to condition data, and a classifier which implements an all digital prototype multivariate Gaussian maximum likelihood or a Bayesian decision algorithm. Sufficient software was developed to perform signature extraction, control the preprocessor, compute classifier coefficients, control the classifier operation, operate the color display and printer, and diagnose operation

NASA Technical Reports Server