Search CORE

586 research outputs found

Boosting accuracy of automated classification of fluorescence microscope images for location proteomics

Author: Huang Kai
Murphy Robert F
Publication venue: BioMed Central
Publication date: 01/01/2004
Field of study

BACKGROUND: Detailed knowledge of the subcellular location of each expressed protein is critical to a full understanding of its function. Fluorescence microscopy, in combination with methods for fluorescent tagging, is the most suitable current method for proteome-wide determination of subcellular location. Previous work has shown that neural network classifiers can distinguish all major protein subcellular location patterns in both 2D and 3D fluorescence microscope images. Building on these results, we evaluate here new classifiers and features to improve the recognition of protein subcellular location patterns in both 2D and 3D fluorescence microscope images. RESULTS: We report here a thorough comparison of the performance on this problem of eight different state-of-the-art classification methods, including neural networks, support vector machines with linear, polynomial, radial basis, and exponential radial basis kernel functions, and ensemble methods such as AdaBoost, Bagging, and Mixtures-of-Experts. Ten-fold cross validation was used to evaluate each classifier with various parameters on different Subcellular Location Feature sets representing both 2D and 3D fluorescence microscope images, including new feature sets incorporating features derived from Gabor and Daubechies wavelet transforms. After optimal parameters were chosen for each of the eight classifiers, optimal majority-voting ensemble classifiers were formed for each feature set. Comparison of results for each image for all eight classifiers permits estimation of the lower bound classification error rate for each subcellular pattern, which we interpret to reflect the fraction of cells whose patterns are distorted by mitosis, cell death or acquisition errors. Overall, we obtained statistically significant improvements in classification accuracy over the best previously published results, with the overall error rate being reduced by one-third to one-half and with the average accuracy for single 2D images being higher than 90% for the first time. In particular, the classification accuracy for the easily confused endomembrane compartments (endoplasmic reticulum, Golgi, endosomes, lysosomes) was improved by 5–15%. We achieved further improvements when classification was conducted on image sets rather than on individual cell images. CONCLUSIONS: The availability of accurate, fast, automated classification systems for protein location patterns in conjunction with high throughput fluorescence microscope imaging techniques enables a new subfield of proteomics, location proteomics. The accuracy and sensitivity of this approach represents an important alternative to low-resolution assignments by curation or sequence-based prediction

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

An incremental approach to automated protein localisation

Author: A Chebira
A Danckaert
A Khotanzad
A Kumar
B Vigdor
C Conrad
CM Wu
DA Schiffmann
DD Hegedus
E García Osuna
E Glory
Franz Kummert
GA Carpenter
GA Carpenter
J Hua
J Kuzio
JC Simpson
JL Vaughn
K Huang
K Huang
K Huang
K Logg
KC Chou
KC Chou
M Doverskog
M Kurisu
M Tscherepanow
M Tscherepanow
M Tscherepanow
Marko Tscherepanow
MD Summers
MFA Goosen
MT Madigan
MT Vakil-Baghmisheh
MV Boland
MV Boland
N Tomiya
Nickels Jensen
OA Koroleva
P Soille
PM Kasson
RF Murphy
RF Murphy
RF Murphy
S Raman
SAS Institute Inc
SC Chen
U Liebel
WK Huh
X Chen
X Chen
X Chen
Y Hu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Tscherepanow M, Jensen N, Kummert F. An incremental approach to automated protein localisation. BMC Bioinformatics. 2008;9(1): 445.Background: The subcellular localisation of proteins in intact living cells is an important means for gaining information about protein functions. Even dynamic processes can be captured, which can barely be predicted based on amino acid sequences. Besides increasing our knowledge about intracellular processes, this information facilitates the development of innovative therapies and new diagnostic methods. In order to perform such a localisation, the proteins under analysis are usually fused with a fluorescent protein. So, they can be observed by means of a fluorescence microscope and analysed. In recent years, several automated methods have been proposed for performing such analyses. Here, two different types of approaches can be distinguished: techniques which enable the recognition of a fixed set of protein locations and methods that identify new ones. To our knowledge, a combination of both approaches – i.e. a technique, which enables supervised learning using a known set of protein locations and is able to identify and incorporate new protein locations afterwards – has not been presented yet. Furthermore, associated problems, e.g. the recognition of cells to be analysed, have usually been neglected. Results: We introduce a novel approach to automated protein localisation in living cells. In contrast to well-known techniques, the protein localisation technique presented in this article aims at combining the two types of approaches described above: After an automatic identification of unknown protein locations, a potential user is enabled to incorporate them into the pre-trained system. An incremental neural network allows the classification of a fixed set of protein location as well as the detection, clustering and incorporation of additional patterns that occur during an experiment. Here, the proposed technique achieves promising results with respect to both tasks. In addition, the protein localisation procedure has been adapted to an existing cell recognition approach. Therefore, it is especially well-suited for high-throughput investigations where user interactions have to be avoided. Conclusion: We have shown that several aspects required for developing an automatic protein localisation technique – namely the recognition of cells, the classification of protein distribution patterns into a set of learnt protein locations, and the detection and learning of new locations – can be combined successfully. So, the proposed method constitutes a crucial step to render image-based protein localisation techniques amenable to large-scale experiments

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Publications at Bielefeld University

Mining Images in Biomedical Publications: Detection and Analysis of Gel Diagrams

Author: Krauthammer Michael
Kuhn Tobias
Luong ThaiBinh
Nagy Mate Levente
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Authors of biomedical publications use gel images to report experimental results such as protein-protein interactions or protein expressions under different conditions. Gel images offer a concise way to communicate such findings, not all of which need to be explicitly discussed in the article text. This fact together with the abundance of gel images and their shared common patterns makes them prime candidates for automated image mining and parsing. We introduce an approach for the detection of gel images, and present a workflow to analyze them. We are able to detect gel segments and panels at high accuracy, and present preliminary results for the identification of gene names in these images. While we cannot provide a complete solution at this point, we present evidence that this kind of image mining is feasible.Comment: arXiv admin note: substantial text overlap with arXiv:1209.148

arXiv.org e-Print Archive

Repository for Publications and Research Data

Springer - Publisher Connector

PubMed Central

A graphical model approach to automated classification of protein subcellular location patterns in multi-cell images

Author: Chen Shann-Ching
Murphy Robert F
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Knowledge of the subcellular location of a protein is critical to understanding how that protein works in a cell. This location is frequently determined by the interpretation of fluorescence microscope images. In recent years, automated systems have been developed for consistent and objective interpretation of such images so that the protein pattern in a single cell can be assigned to a known location category. While these systems perform with nearly perfect accuracy for single cell images of all major subcellular structures, their ability to distinguish subpatterns of an organelle (such as two Golgi proteins) is not perfect. Our goal in the work described here was to improve the ability of an automated system to decide which of two similar patterns is present in a field of cells by considering more than one cell at a time. Since cells displaying the same location pattern are often clustered together, considering multiple cells may be expected to improve discrimination between similar patterns. RESULTS: We describe how to take advantage of information on experimental conditions to construct a graphical representation for multiple cells in a field. Assuming that a field is composed of a small number of classes, the classification accuracy can be improved by allowing the computed probability of each pattern for each cell to be influenced by the probabilities of its neighboring cells in the model. We describe a novel way to allow this influence to occur, in which we adjust the prior probabilities of each class to reflect the patterns that are present. When this graphical model approach is used on synthetic multi-cell images in which the true class of each cell is known, we observe that the ability to distinguish similar classes is improved without suffering any degradation in ability to distinguish dissimilar classes. The computational complexity of the method is sufficiently low that improved assignments of classes can be obtained for fields of twelve cells in under 0.04 second on a 1600 megahertz processor. CONCLUSION: We demonstrate that graphical models can be used to improve the accuracy of classification of subcellular patterns in multi-cell fluorescence microscope images. We also describe a novel algorithm for inferring classes from a graphical model. The performance and speed suggest that the method will be particularly valuable for analysis of images from high-throughput microscopy. We also anticipate that it will be useful for analyzing the mixtures of cell types typically present in images of tissues. Lastly, we anticipate that the method can be generalized to other problems

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Random subwindows and extremely randomized trees for image classification in cell biology

Author: A Bhattacharya
C Conrad
C Schmid
D DeCoste
D Keysers
DG Lowe
E Glory
J Dahmen
J Dahmen
J Matas
J Ponce
J Simpson
J Zhou
JH Price
K Huang
K Mikolajczyk
K Mikolajczyk
L Breiman
L Wehenkel
Louis Wehenkel
M Boland
M Ranzato
M Schonfeld
MV Boland
O Lezoray
P Geurts
P Geurts
P Geurts
Pierre Geurts
R Marée
R Marée
R Marée
Raphaël Marée
RF Murphy
RF Murphy
T Kölsch
V Kovalev
V Lepetit
X Chen
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background: With the improvements in biosensors and high-throughput image acquisition technologies, life science laboratories are able to perform an increasing number of experiments that involve the generation of a large amount of images at different imaging modalities/scales. It stresses the need for computer vision methods that automate image classification tasks. Results: We illustrate the potential of our image classification method in cell biology by evaluating it on four datasets of images related to protein distributions or subcellular localizations, and red-blood cell shapes. Accuracy results are quite good without any specific pre-processing neither domain knowledge incorporation. The method is implemented in Java and available upon request for evaluation and research purpose. Conclusion: Our method is directly applicable to any image classification problems. We foresee the use of this automatic approach as a baseline method and first try on various biological image classification problems

Crossref

Springer - Publisher Connector

PubMed Central

Open Repository and Bibliography - Liège

Phenotype Recognition with Combined Features and Random Subspace Classifier Ensemble

Author: A Chebira
A Majumdar
AE Carpenter
B Manjunath
Bailing Zhang
CJ Echeverri
D Clausi
D Donoho
D Gabor
E Candes
E Candes
GJ Hannon
H Agaisse
H Peng
I Daubechies
I Sumana
J Ma
J Newberg
J Starck
J Wang
J Zhou
JC Clemens
JC Yarrow
K Huang
L Breiman
L Breiman
L Nanni
L Ni
L Shamir
L Shamir
L Soh
LH Loo
LI Kuncheva
LI Kuncheva
MJ Gangeh
MT Weirauch
MV Boland
MV Boland
N Orlov
N Orlov
NA Hamilton
P Gehler
R Haralick
T Geback
T Mandal
TK Ho
TR Jones
Tuan D Pham
V Vapnik
Y Freund
ZE Perlman
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Automated, image based high-content screening is a fundamental tool for discovery in biological science. Modern robotic fluorescence microscopes are able to capture thousands of images from massively parallel experiments such as RNA interference (RNAi) or small-molecule screens. As such, efficient computational methods are required for automatic cellular phenotype identification capable of dealing with large image data sets. In this paper we investigated an efficient method for the extraction of quantitative features from images by combining second order statistics, or Haralick features, with curvelet transform. A random subspace based classifier ensemble with multiple layer perceptron (MLP) as the base classifier was then exploited for classification. Haralick features estimate image properties related to second-order statistics based on the grey level co-occurrence matrix (GLCM), which has been extensively used for various image processing applications. The curvelet transform has a more sparse representation of the image than wavelet, thus offering a description with higher time frequency resolution and high degree of directionality and anisotropy, which is particularly appropriate for many images rich with edges and curves. A combined feature description from Haralick feature and curvelet transform can further increase the accuracy of classification by taking their complementary information. We then investigate the applicability of the random subspace (RS) ensemble method for phenotype classification based on microscopy images. A base classifier is trained with a RS sampled subset of the original feature set and the ensemble assigns a class label by majority voting. Results Experimental results on the phenotype recognition from three benchmarking image sets including HeLa, CHO and RNAi show the effectiveness of the proposed approach. The combined feature is better than any individual one in the classification accuracy. The ensemble model produces better classification performance compared to the component neural networks trained. For the three images sets HeLa, CHO and RNAi, the Random Subspace Ensembles offers the classification rates 91.20%, 98.86% and 91.03% respectively, which compares sharply with the published result 84%, 93% and 82% from a multi-purpose image classifier WND-CHARM which applied wavelet transforms and other feature extraction methods. We investigated the problem of estimation of ensemble parameters and found that satisfactory performance improvement could be brought by a relative medium dimensionality of feature subsets and small ensemble size. Conclusions The characteristics of curvelet transform of being multiscale and multidirectional suit the description of microscopy images very well. It is empirically demonstrated that the curvelet-based feature is clearly preferred to wavelet-based feature for bioimage descriptions. The random subspace ensemble of MLPs is much better than a number of commonly applied multi-class classifiers in the investigated application of phenotype recognition.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A multiresolution approach to automated classification of protein subcellular location images

Author: A Danckaert
Amina Chebira
C Conrad
Charles Jackson
E Glory
G Srinivasa
Gowri Srinivasa
J Kovačević
Jelena Kovačević
K Huang
M Boland
M Boland
M Boland
M Vetterli
N Saito
N Saito
P Perner
PH Yeomans
R Coifman
R Haralick
Robert F Murphy
S Mallat
S Mallat
T Merryman
Thomas Merryman
X Chen
X Chen
Yann Barbotin
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Fluorescence microscopy is widely used to determine the subcellular location of proteins. Efforts to determine location on a proteome-wide basis create a need for automated methods to analyze the resulting images. Over the past ten years, the feasibility of using machine learning methods to recognize all major subcellular location patterns has been convincingly demonstrated, using diverse feature sets and classifiers. On a well-studied data set of 2D HeLa single-cell images, the best performance to date, 91.5%, was obtained by including a set of multiresolution features. This demonstrates the value of multiresolution approaches to this important problem. Results We report here a novel approach for the classification of subcellular location patterns by classifying in multiresolution subspaces. Our system is able to work with any feature set and any classifier. It consists of multiresolution (MR) decomposition, followed by feature computation and classification in each MR subspace, yielding local decisions that are then combined into a global decision. With 26 texture features alone and a neural network classifier, we obtained an increase in accuracy on the 2D HeLa data set to 95.3%. Conclusion We demonstrate that the space-frequency localized information in the multiresolution subspaces adds significantly to the discriminative power of the system. Moreover, we show that a vastly reduced set of features is sufficient, consisting of our novel modified Haralick texture features. Our proposed system is general, allowing for any combinations of sets of features and any combination of classifiers.</p

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Chemical address tags of fluorescent bioimaging probes

Author: Abraham
Boland
Carpenter
Cervantes
Chen
Chen
Giuliano
Giuliano
Giuliano
Giuliano
Hansch
Hansch
Hu
Huang
Huang
Jones
Lee
Lee
Li
Ljosa
Loo
Mitchison
Murphy
Murphy
Perlman
Rosania
Shedden
Shedden
Shedden
Slack
Taylor
Wagner
Publication venue: 'Wiley'
Publication date: 01/01/2010
Field of study

Chemical address tags can be defined as specific structural features shared by a set of bioimaging probes having a predictable influence on cell-associated visual signals obtained from these probes. Here, using a large image dataset acquired with a high content screening instrument, machine vision and cheminformatics analysis have been applied to reveal chemical address tags. With a combinatorial library of fluorescent molecules, fluorescence signal intensity, spectral, and spatial features characterizing each one of the probes' visual signals were extracted from images acquired with the three different excitation and emission channels of the imaging instrument. With multivariate regression, the additive contribution from each one of the different building blocks of the bioimaging probes toward each measured, cell-associated image-based feature was calculated. In this manner, variations in the chemical features of the molecules were associated with the resulting staining patterns, facilitating quantitative, objective analysis of chemical address tags. Hierarchical clustering and paired image-cheminformatics analysis revealed key structure–property relationships amongst many building blocks of the fluorescent molecules. The results point to different chemical modifications of the bioimaging probes that can exert similar (or different) effects on the probes' visual signals. Inspection of the clustered structures suggests intramolecular charge migration or partial charge distribution as potential mechanistic determinants of chemical address tag behavior. © 2010 International Society for Advancement of CytometryPeer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/71379/1/20847_ftp.pd

Crossref

PubMed Central

Deep Blue Documents at the University of Michigan