Search CORE

422 research outputs found

Fell bundles associated to groupoid morphisms

Author: Deaconu Valentin
Kumjian Alex
Ramazan Birant
Publication venue
Publication date: 01/01/2006
Field of study

Given a continuous open surjective morphism

\pi :G\to H

of \'etale groupoids with amenable kernel, we construct a Fell bundle

E

over

H

and prove that its C*-algebra

C^*_r(E)

is isomorphic to

C^*_r(G)

. This is related to results of Fell concerning C*-algebraic bundles over groups. The case

H=X

, a locally compact space, was treated earlier by Ramazan. We conclude that

C^*_r(G)

is strongly Morita equivalent to a crossed product, the C*-algebra of a Fell bundle arising from an action of the groupoid

H

on a C*-bundle over

H^0

. We apply the theory to groupoid morphisms obtained from extensions of dynamical systems and from morphisms of directed graphs with the path lifting property. We also prove a structure theorem for abelian Fell bundles.Comment: 12 pages, revised version, references added; to appear in Mathematica Scandinavic

arXiv.org e-Print Archive

CiteSeerX

Crossref

Tidsskrift.dk (Det Kongelige Bibliotek)

Data Mining Using RFM Analysis

Author: Derya Birant
Publication venue: 'IntechOpen'
Publication date: 21/01/2011
Field of study

IntechOpen

Service-Oriented Data Mining

Author: Derya Birant
Publication venue: 'IntechOpen'
Publication date: 21/01/2011
Field of study

IntechOpen

Cluster analysis for physical oceanographic data and oceanographic surveys in Turkish seas

Author: Birant Derya
Kut Alp
Publication venue: EliScholar – A Digital Platform for Scholarly Publishing at Yale
Publication date: 01/01/2006
Field of study

Cluster analysis is a useful data mining method to obtain detailed information on the physical state of the ocean. The primary objective of this study is the development of a new spatio-temporal density-based algorithm for clustering physical oceanographic data. This study extends the regular spatial cluster analysis to deal with spatial data at different epochs. It also presents the sensitivity of the new algorithm to different parameter settings. The purpose of the sensitivity analysis presented in this paper is to identify the response of the algorithm to variations in input parameter values and boundary conditions. In order to demonstrate the usage of the new algorithm, this paper presents two oceanographic applications that cluster the sea-surface temperature (SST) and the sea-surface height residual (SSH) data which records the satellite observations of the Turkish Seas. It also evaluates and justifies the clustering results by using a cluster validation technique

Yale University

An Impossibility Result for High Dimensional Supervised Learning

Author: Ishwar Prakash
Karl William C.
Orten Birant
Rohban Mohammad Hossein
Saligrama Venkatesh
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

We study high-dimensional asymptotic performance limits of binary supervised classification problems where the class conditional densities are Gaussian with unknown means and covariances and the number of signal dimensions scales faster than the number of labeled training samples. We show that the Bayes error, namely the minimum attainable error probability with complete distributional knowledge and equally likely classes, can be arbitrarily close to zero and yet the limiting minimax error probability of every supervised learning algorithm is no better than a random coin toss. In contrast to related studies where the classification difficulty (Bayes error) is made to vanish, we hold it constant when taking high-dimensional limits. In contrast to VC-dimension based minimax lower bounds that consider the worst case error probability over all distributions that have a fixed Bayes error, our worst case is over the family of Gaussian distributions with constant Bayes error. We also show that a nontrivial asymptotic minimax error probability can only be attained for parametric subsets of zero measure (in a suitable measure space). These results expose the fundamental importance of prior knowledge and suggest that unless we impose strong structural constraints, such as sparsity, on the parametric space, supervised learning may be ineffective in high dimensional small sample settings.Comment: This paper was submitted to the IEEE Information Theory Workshop (ITW) 2013 on April 23, 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Ensemble Methods in Environmental Data Mining

Author: Birant Derya
Pala Aysegul
Tuysuzoglu Goksu
Publication venue: 'IntechOpen'
Publication date: 17/02/2018
Field of study

Environmental data mining is the nontrivial process of identifying valid, novel, and potentially useful patterns in data from environmental sciences. This chapter proposes ensemble methods in environmental data mining that combines the outputs from multiple classification models to obtain better results than the outputs that could be obtained by an individual model. The study presented in this chapter focuses on several ensemble strategies in addition to the standard single classifiers such as decision tree, naive Bayes, support vector machine, and k-nearest neighbor (KNN), popularly used in literature. This is the first study that compares four ensemble strategies for environmental data mining: (i) bagging, (ii) bagging combined with random feature subset selection (the random forest algorithm), (iii) boosting (the AdaBoost algorithm), and (iv) voting of different algorithms. In the experimental studies, ensemble methods are tested on different real-world environmental datasets in various subjects such as air, ecology, rainfall, and soil

IntechOpen

Crossref

FARKLI BAĞLANTI YÖNTEMLERİ İLE HİYERARŞİK KÜMELEME TOPLULUĞU

Author: BİRANT Derya
Publication venue: Konya Teknik Üniversitesi Mühendislik ve Doğa Bilimleri Fakültesi
Publication date: 28/02/2019
Field of study

Kümeleme topluluğu, yüksek kümeleme performansı sağlaması nedeniyle son yıllarda tercih edilen bir teknik haline gelmiştir. Bu çalışmada, Bağlantı-tabanlı Hiyerarşik Kümeleme Topluluğu (BHKT) olarak isimlendirilen yeni bir yaklaşım önerilmektedir. Önerilen yaklaşımda, topluluk elemanları farklı bağlantı yöntemleri kullanarak hiyerarşik kümeleme yapmakta ve sonrasında çoğunluk oylaması ile ortak karar üretmektedir. Çalışmada kullanılan bağlantı yöntemleri: tek bağlantı, tam bağlantı, ortalama bağlantı, merkez bağlantı, Ward yöntemi, komşu birleştirme yöntemi ve ayarlı tam bağlantıdır. Ayrıca çalışmada, farklı boyutlardaki hiyerarşik kümeleme toplulukları incelenmiş ve birbiriyle karşılaştırılmıştır. Deneysel çalışmalarda, hiyerarşik kümeleme toplulukları 8 farklı veri setinde uygulanmış ve tek bir kümeleme algoritmasına göre daha iyi sonuçlar elde edilmiştir

Selcuk University Journal of Engineering, Science and Technology

Data Mining in Banking Sector Using Weighted Decision Jungle Method

Author: Birant Derya
Publication venue: 'IntechOpen'
Publication date: 20/04/2020
Field of study

Classification, as one of the most popular data mining techniques, has been used in the banking sector for different purposes, for example, for bank customer churn prediction, credit approval, fraud detection, bank failure estimation, and bank telemarketing prediction. However, traditional classification algorithms do not take into account the class distribution, which results into undesirable performance on imbalanced banking data. To solve this problem, this paper proposes an approach which improves the decision jungle (DJ) method with a class-based weighting mechanism. The experiments conducted on 17 real-world bank datasets show that the proposed approach outperforms the decision jungle method when handling imbalanced banking data

IntechOpen

Crossref

Multidisciplinary Management of Benign Jaw Tumors in Children

Author: Akay Mehmet Cemal
Aras Işıl
Zeytinoğlu Mert
Şimşek Birant
Publication venue: 'IntechOpen'
Publication date: 22/04/2015
Field of study

IntechOpen