65 research outputs found
Evaluation of machine-learning methods for ligand-based virtual screening
Machine-learning methods can be used for virtual screening by analysing the structural characteristics of molecules of known (in)activity, and we here discuss the use of kernel discrimination and naive Bayesian classifier (NBC) methods for this purpose. We report a kernel method that allows the processing of molecules represented by binary, integer and real-valued descriptors, and show that it is little different in screening performance from a previously described kernel that had been developed specifically for the analysis of binary fingerprint representations of molecular structure. We then evaluate the performance of an NBC when the training-set contains only a very few active molecules. In such cases, a simpler approach based on group fusion would appear to provide superior screening performance, especially when structurally heterogeneous datasets are to be processed
Characterization of digital medical images utilizing support vector machines
BACKGROUND: In this paper we discuss an efficient methodology for the image analysis and characterization of digital images containing skin lesions using Support Vector Machines and present the results of a preliminary study. METHODS: The methodology is based on the support vector machines algorithm for data classification and it has been applied to the problem of the recognition of malignant melanoma versus dysplastic naevus. Border and colour based features were extracted from digital images of skin lesions acquired under reproducible conditions, using basic image processing techniques. Two alternative classification methods, the statistical discriminant analysis and the application of neural networks were also applied to the same problem and the results are compared. RESULTS: The SVM (Support Vector Machines) algorithm performed quite well achieving 94.1% correct classification, which is better than the performance of the other two classification methodologies. The method of discriminant analysis classified correctly 88% of cases (71% of Malignant Melanoma and 100% of Dysplastic Naevi), while the neural networks performed approximately the same. CONCLUSION: The use of a computer-based system, like the one described in this paper, is intended to avoid human subjectivity and to perform specific tasks according to a number of criteria. However the presence of an expert dermatologist is considered necessary for the overall visual assessment of the skin lesion and the final diagnosis
Host specificity and experimental assessment of the early establishment of the mistletoe Phoradendron crassifolium (Pohl ex DC.) Eichler (Santalaceae) in a fragment of Atlantic Forest in southeast Brazil
Remoção dos frutos de Miconia albicans (sw.) Triana (Melastomataceae) por formigas na borda e no interior de um fragmento de Cerrado, Curvelo, MG
Frugivory and seed dispersal of Miconia theaezans (Bonpl.) Cogniaux (Melastomataceae) by birds in a transition palm swamp: gallery forest in Central Brazil
Gene-flow through space and time: dispersal, dormancy and adaptation to changing environments
Vascular flora of the cerrado of Bauru-SP
Information on the cerrado vascular flora of the municipality of Bauru has been provided in lists of floristic surveys carried out in fragments of this vegetation type at different times, applying different criteria, and conforming to current taxonomic classifications. We organized this information according to APG III and revised synonymies, aiming at producing a single floristic list of species occurring in cerrado sensu lato or ecotonal areas (transitions between cerrado and seasonal forest) in municipality of Bauru to inform conservation proposals. For this purpose, we referred to all floristic lists of vascular plants found in cerrado fragments in Bauru and to botanic material collected and deposited in the herbaria of the Department of Biological Sciences, School of Sciences, Bauru Campus, UNESP (UNBA), and of the Bauru Botanical Garden (JBMB). We recorded 371 species from 78 families. Fabaceae was the richest in species. We also indicated each mentioned species’ habit and the vegetation types where plants occur in the municipality
BioTIME 2.0 : expanding and improving a database of biodiversity time series
Funding: H2020 European Research Council (Grant Number(s): GA 101044975, GA 101098020).Motivation: Here, we make available a second version of the BioTIME database, which compiles records of abundance estimates for species in sample events of ecological assemblages through time. The updated version expands version 1.0 of the database by doubling the number of studies and includes substantial additional curation to the taxonomic accuracy of the records, as well as the metadata. Moreover, we now provide an R package (BioTIMEr) to facilitate use of the database. Main Types of Variables: Included The database is composed of one main data table containing the abundance records and 11 metadata tables. The data are organised in a hierarchy of scales where 11,989,233 records are nested in 1,603,067 sample events, from 553,253 sampling locations, which are nested in 708 studies. A study is defined as a sampling methodology applied to an assemblage for a minimum of 2 years. Spatial Location and Grain: Sampling locations in BioTIME are distributed across the planet, including marine, terrestrial and freshwater realms. Spatial grain size and extent vary across studies depending on sampling methodology. We recommend gridding of sampling locations into areas of consistent size. Time Period and Grain: The earliest time series in BioTIME start in 1874, and the most recent records are from 2023. Temporal grain and duration vary across studies. We recommend doing sample-level rarefaction to ensure consistent sampling effort through time before calculating any diversity metric. Major Taxa and Level of Measurement: The database includes any eukaryotic taxa, with a combined total of 56,400 taxa. Software Format: csv and. SQL.Peer reviewe
- …
