95 research outputs found

    Predicting Panel Ratings for Semantic Characteristics of Lung Nodules

    Get PDF
    In reading CT scans with potentially malignant lung nodules, radiologists make use of high level information (semantic characteristics) in their analysis. CAD systems can assist radiologists by offering a “second opinion” - predicting these semantic characteristics for lung nodules. In our previous work, we developed such a CAD system, training and testing it on the publicly available Lung Image Database Consortium (LIDC) dataset, which includes semantic annotations by up to four human radiologists for every nodule. However, due to the lack of ground truth and the uncertainty in the dataset, each nodule was viewed as four distinct instances when training the classifier. In this work, we propose a way of predicting the distribution of opinions of the four radiologists using a multiple-label classification algorithm based on belief decision trees. We evaluate our results using a distance-threshold curve and, measuring the area under this curve, obtain 69% accuracy on the testing subset. We conclude that multiple-label classification algorithms are an appropriate method of representing the diagnoses of multiple radiologists on lung CT scans when a single ground truth is not available

    A Run-Length Encoding Approach for Path Analysis of C. elegans

    Get PDF
    The nematode Caenorhabditis elegans explores the environment using a combination of different movement patterns, which include straight movement, reversal, and turns. We propose to quantify C. elegans movement behavior using a computer vision approach based on run-length encoding of step-length data. In this approach, the path of C. elegans is encoded as a string of characters, where each character represents a path segment of a specific type of movement. With these encoded string data, we perform k-means cluster analysis to distinguish movement behaviors resulting from different genotypes and food availability. We found that shallow and sharp turns are the most critical factors in distinguishing the differences among the movement behaviors. To validate our approach, we examined the movement behavior of tph-1 mutants that lack an enzyme responsible for serotonin biosynthesis. A k-means cluster analysis with the path string-encoded data showed that tph-1 movement behavior on food is similar to that of wild-type animals off food. We suggest that this run-length encoding approach is applicable to trajectory data in animal or human mobility data

    Presynaptic BK channel localization is dependent on the hierarchical organization of alpha-catulin and dystrobrevin and fine-tuned by CaV2 calcium channels

    Get PDF
    BACKGROUND: Large conductance, calcium-activated BK channels regulate many important physiological processes, including smooth muscle excitation, hormone release and synaptic transmission. The biological roles of these channels hinge on their unique ability to respond synergistically to both voltage and cytosolic calcium elevations. Because calcium influx is meticulously regulated both spatially and temporally, the localization of BK channels near calcium channels is critical for their proper function. However, the mechanism underlying BK channel localization near calcium channels is not fully understood. RESULTS: We show here that in C. elegans the localization of SLO-1/BK channels to presynaptic terminals, where UNC-2/CaV2 calcium channels regulate neurotransmitter release, is controlled by the hierarchical organization of CTN-1/alpha-catulin and DYB-1/dystrobrevin, two proteins that interact with cortical cytoskeletal proteins. CTN-1 organizes a macromolecular SLO-1 channel complex at presynaptic terminals by direct physical interaction. DYB-1 contributes to the maintenance or stabilization of the complex at presynaptic terminals by interacting with CTN-1. We also show that SLO-1 channels are functionally coupled with UNC-2 calcium channels, and that normal localization of SLO-1 to presynaptic terminals requires UNC-2. In the absence of UNC-2, SLO-1 clusters lose the localization specificity, thus accumulating inside and outside of presynaptic terminals. Moreover, CTN-1 is also similarly localized in unc-2 mutants, consistent with the direct interaction between CTN-1 and SLO-1. However, localization of UNC-2 at the presynaptic terminals is not dependent on either CTN-1 or SLO-1. Taken together, our data strongly suggest that the absence of UNC-2 indirectly influences SLO-1 localization via the reorganization of cytoskeletal proteins. CONCLUSION: CTN-1 and DYB-1, which interact with cortical cytoskeletal proteins, are required for the presynaptic punctate localization of SLO-1 in a hierarchical manner. In addition, UNC-2 calcium channels indirectly control the fidelity of SLO-1 puncta localization at presynaptic terminals. We suggest that the absence of UNC-2 leads to the reorganization of the cytoskeletal structure that includes CTN-1, which in turn influences SLO-1 puncta localization

    Biomedical heterogeneous data categorization and schema mapping toward data integration

    Get PDF
    Data integration is a well-motivated problem in the clinical data science domain. Availability of patient data, reference clinical cases, and datasets for research have the potential to advance the healthcare industry. However, the unstructured (text, audio, or video data) and heterogeneous nature of the data, the variety of data standards and formats, and patient privacy constraint make data interoperability and integration a challenge. The clinical text is further categorized into different semantic groups and may be stored in different files and formats. Even the same organization may store cases in different data structures, making data integration more challenging. With such inherent complexity, domain experts and domain knowledge are often necessary to perform data integration. However, expert human labor is time and cost prohibitive. To overcome the variability in the structure, format, and content of the different data sources, we map the text into common categories and compute similarity within those. In this paper, we present a method to categorize and merge clinical data by considering the underlying semantics behind the cases and use reference information about the cases to perform data integration. Evaluation shows that we were able to merge 88% of clinical data from five different sources

    BRISC—An Open Source Pulmonary Nodule Image Retrieval Framework

    Get PDF
    We have created a content-based image retrieval framework for computed tomography images of pulmonary nodules. When presented with a nodule image, the system retrieves images of similar nodules from a collection prepared by the Lung Image Database Consortium (LIDC). The system (1) extracts images of individual nodules from the LIDC collection based on LIDC expert annotations, (2) stores the extracted data in a flat XML database, (3) calculates a set of quantitative descriptors for each nodule that provide a high-level characterization of its texture, and (4) uses various measures to determine the similarity of two nodules and perform queries on a selected query nodule. Using our framework, we compared three feature extraction methods: Haralick co-occurrence, Gabor filters, and Markov random fields. Gabor and Markov descriptors perform better at retrieving similar nodules than do Haralick co-occurrence techniques, with best retrieval precisions in excess of 88%. Because the software we have developed and the reference images are both open source and publicly available they may be incorporated into both commercial and academic imaging workstations and extended by others in their research
    • …
    corecore