Search CORE

37 research outputs found

Interactivity, Fairness and Explanations in Recommendations

Author: Giannopoulos Giorgos
Papastefanatos George
Sacharidis Dimitris
Stefanidis Kostas
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/06/2021
Field of study

More and more aspects of our everyday lives are influenced by automated decisions made by systems that statistically analyze traces of our activities. It is thus natural to question whether such systems are trustworthy, particularly given the opaqueness and complexity of their internal workings. In this paper, we present our ongoing work towards a framework that aims to increase trust in machine-generated recommendations by combining ideas from three separate recent research directions, namely explainability, fairness and user interactive visualization. The goal is to enable different stakeholders, with potentially varying levels of background and diverse needs, to query, understand, and fix sources of distrust.acceptedVersionPeer reviewe

Trepo - Institutional Repository of Tampere University

Fairness Aware Counterfactuals for Subgroups

Author: Emiris Ioannis
Fotakis Dimitris
Giannopoulos Giorgos
Kavouras Loukas
Psaroudaki Eleni
Rontogiannis Dimitrios
Sacharidis Dimitris
Theologitis Nikolaos
Tsopelas Konstantinos
Publication venue
Publication date: 26/06/2023
Field of study

In this work, we present Fairness Aware Counterfactuals for Subgroups (FACTS), a framework for auditing subgroup fairness through counterfactual explanations. We start with revisiting (and generalizing) existing notions and introducing new, more refined notions of subgroup fairness. We aim to (a) formulate different aspects of the difficulty of individuals in certain subgroups to achieve recourse, i.e. receive the desired outcome, either at the micro level, considering members of the subgroup individually, or at the macro level, considering the subgroup as a whole, and (b) introduce notions of subgroup fairness that are robust, if not totally oblivious, to the cost of achieving recourse. We accompany these notions with an efficient, model-agnostic, highly parameterizable, and explainable framework for evaluating subgroup fairness. We demonstrate the advantages, the wide applicability, and the efficiency of our approach through a thorough experimental evaluation of different benchmark datasets

arXiv.org e-Print Archive

Review of solar energetic particle models

Author: A.Young Matthew
Abunin Artem A.
Abunina Maria A.
Alberti Tommaso
Ali Aatiya
Allison Clayton
Aminalragia-Giamini Sigiava
Anastasiadis Anastasios
Angryk Rafal
Aran Angels
Aydin Berkay
Balch Christopher C.
Barnett Ben
Barzilla Janet
Belov Anatoly
Benella Simone
Boubrahimi Soukaina F.
Bruno Alessandro
Caplan Ronald M.
Chatterjee Subhamoy
Chen Yang
Cliver Edward W.
Crosby Norma
Dalla Silvia
Dayeh Maher A.
Dechev Momchil
Dierckxsens Mark
Downs Cooper
Effenberger Frederic
Egeland Ricky
Engell Alec
Eroshenko Eugenia A.
F.Smart Don
Falconer David
Farmer Harold
Fereira Sheldon H.
Fernandes Ian
G.Ling Alan
G.Luhmann Janet
G.Richardson Ian
Giannopoulos Giorgos
Gombosi Tamas
Gorby Matthew
Grey Nathan
Heber Bernd
Hosseinzadeh Pouya
Hu Junxiang
Huang Zhenguang
Illarionov Egor
Jacobs Carla
Ji Anli
Jiang Yucheng
Jiggens Piers
Jin Meng
K.Georgoulis Manolis
Kadadi Yash
Karavolos Michalis
Kasapis Spiridon
Kempton Dustin
Kerber Thuha
Kitiashvili Irina
Kosovichev Alexander G.
Kouloumvakos Athanasios
Kozarev Kamen
Kühl Patrick
Labrenz Johannes
Lario David
Laurenza Monica
Lavasa Eleni
Li Gang
Linker Jon A.
Lionello Roberto
Loomis Jeremy
M.Baini Hazel
M.O’Keefe Patrick
Malandraki Olga
Martens Petrus
Mays M. Leila
Maze Brianna
Miteva Rositsa
Moreland Kimberly D.
Muñoz-Jaramillo Andrés
Nedal Mohamed
Nita Gelu M.
Núñez Marlon
O.Lee Christina
Oria Vincent
P.Zank Gary
Paassilta Miikka
Palmerio Erika
Pandey Chetraj
Paouris Evangelos
Papaioannou Athanasios
Poedts Stefaan
Quinn Philip
Roussev Ilia I.
S.Marsh Michael
Sadykov Viacheslav
Sanahuja Blai
Sandberg Ingmar
Schwadron Nathan A.
Semones Edward
Shea Margaret A.
Sokolov Igor V.
Steyn Ruhann
Strauss Du Toit
Stumpo Mirko
Swalwell Bill
T.Lee Kerry
Tadesse Tilaye
Taktakishvili Aleksandre
Tenishev Valeriy
Thompson Barbara J.
Titov Viacheslav
Török Tibor
Vainio Rami
van den Berg Jabus
W.Kahler Stephen
Wang Xiantong
Waterfall Charlotte
White Stephen M.
Whitman Kathryn
Wijsen Nicolas
Zhang Ming
Zhao Lulu
Zucca Pietro
Publication venue: 'Elsevier BV'
Publication date: 29/11/2022
Field of study

Solar Energetic Particle (SEP) events are interesting from a scientific perspective as they are the product of a broad set of physical processes from the corona out through the extent of the heliosphere, and provide insight into processes of particle acceleration and transport that are widely applicable in astrophysics. From the operations perspective, SEP events pose a radiation hazard for aviation, electronics in space, and human space exploration, in particular for missions outside of the Earth’s protective magnetosphere including to the Moon and Mars. Thus, it is critical to improve the scientific understanding of SEP events and use this understanding to develop and improve SEP forecasting capabilities to support operations. Many SEP models exist or are in development using a wide variety of approaches and with differing goals. These include computationally intensive physics-based models, fast and light empirical models, machine learning-based models, and mixed-model approaches. The aim of this paper is to summarize all of the SEP models currently developed in the scientific community, including a description of model approach, inputs and outputs, free parameters, and any published validations or comparisons with data.</p

UTUPub

Methods on retrieving sources and data from the Web for supporting scientific innovation

Author: Giannopoulos Giorgos P.
Γιαννόπουλος Γεώργιος Π.
Publication venue
Publication date: 26/05/2014
Field of study

162 σ.Η διατριβή πραγματεύεται ζητήματα αναταξινόμησης (εξατομίκευσης, διαφοροποίησης, συνδυασμού) αποτελεσμάτων αναζήτησης στον ιστό. Συγκεκριμένα, μελετώνται και προτείνονται μέθοδοι για αναταξινόμηση των αποτελεσμάτων μηχανών αναζήτησης, ώστε να ανταποκρίνονται στις εκάστοτε ανάγκες αναζήτησης πληροφορίας ενός χρήστη ή ομάδας χρηστών. Ως βάση χρησιμοποιούνται προσεγγίσεις που στηρίζονται στην εκπαίδευση συναρτήσεων αναταξινόμησης αποτελεσμάτων, χρησιμοποιώντας πληροφορία που εξάγεται από το ιστορικό αναζήτησης του χρήστη (ερωτήματα αναζήτησης, αποτελέσματα και επιλεγμένα αποτελέσματα). Επιπλέον, προτείνονται μέθοδοι για ημι-αυτόματη σημασιολογική επισημείωση εγγράφων με χρήση οντολογιών, για υβριδική αναζήτηση εγγράφων (με λέξεις κλειδιά και με έννοιες οντολογίας) και για εξατομίκευση αναζήτησης με λέξεις κλειδιά σε σημασιολογικά δεδομένα. Επίσης, εφαρμόζονται ευριστικές και ορίζονται κριτήρια για διαφοροποίηση σχολίων χρηστών σε κοινωνικά δίκτυα, καθώς και σημασιολογικών, δομημένων δεδομένων για αναζήτηση με λέξεις κλειδιά. Τέλος, εξετάζεται το πρόβλημα της αναταξινόμησης αποτελεσμάτων αναζήτησης σε οντότητες με αλλαγές στην ονοματολογία τους (βιολογικές οντότητες). Στα πλαίσια της διατριβής μελετήθηκαν και υλοποιήθηκαν μέθοδοι για την αποτελεσματικότερη και αποδοτικότερη χρησιμοποίηση του ιστορικού αναζήτησης, μέσω της εκπαίδευσης εξειδικευμένων συναρτήσεων ταξινόμησης. Συγκεκριμένα, σε πρώτη φάση υλοποιήθηκε μία μέθοδος εμπλουτισμού της εξαγόμενης πληροφορίας από το ιστορικό του χρήστη, για ταχύτερη εκπαίδευση των συναρτήσεων. Στη συνέχεια, αναπτύχθηκαν μέθοδοι εκπαίδευσης πολλαπλών συναρτήσεων με βάση, είτε το περιεχόμενο αναζήτησης, είτε τη συμπεριφορά αναζήτησης του χρήστη. Η καινοτομία των μεθόδων έγκειται στη συγκέντρωση συνεργατικής πληροφορίας από το ιστορικό του συνόλου των χρηστών και στο διαχωρισμό αυτής της πληροφορίας σε συστάδες που αντιπροσωπεύουν διαφορετικό περιεχόμενο ή συμπεριφορά αναζήτησης. Η τελική αναταξινόμηση επιτυγχάνεται με το συνδυασμό των αποτελεσμάτων από τις συναρτήσεις που έχουν εκπαιδευτεί χρησιμοποιώντας τις παραπάνω συστάδες. Επιπλέον, στα πλαίσια της διατριβής μελετήθηκε η προσαρμογή μεθόδων διαφοροποίησης αποτελεσμάτων αναζήτησης, στο σενάριο διαφοροποίησης σχολίων χρηστών σε κοινωνικά δίκτυα. Ορίστηκαν εξειδικευμένα κριτήρια διαφοροποίησης και εφαρμόστηκαν διαφορετικοί ευριστικοί αλγόριθμοι διαφοροποίησης. Για να καταδειχθεί η αποτελεσματικότητα των προτεινόμενων προσεγγίσεων, ορίστηκαν ειδικές μετρικές αξιολόγησης της ετερογένειας συνόλων σχολίων χρηστών. Πέρα από το σενάριο διαφοροποίησης σχολίων, έγινε μία πρώτη προεργασία για τη διαφοροποίηση αναζήτησης με λέξεις κλειδιά σε σημασιολογικά δεδομένα, δηλαδή δομημένα δεδομένα που ακολουθούν ορισμένο σχήμα και διασυνδέονται μέσω ιδιοτήτων. Επιπρόσθετα, προτάθηκαν μέθοδοι για τη βελτίωση της αναζήτησης εγγράφων μέσω σημασιολογικής επισημείωσής τους και μετέπειτα υβριδικής (με λέξεις κλειδιά και σημασιολογικής) αναζήτησής τους, καθώς και εξατομικευμένης αναζήτησης σημασιολογικών δεδομένων. Τέλος, εξετάστηκαν σχήματα ευρετηρίασης και αλγόριθμοι βαθμολόγησης οντοτήτων των οποίων η ονοματολογία μεταβάλλεται με το χρόνο, όπως συμβαίνει, για παράδειγμα, σε ορισμένες βιολογικές οντότητες. Οι παραπάνω εργασίες αξιολογήθηκαν σε διαφορετικά σενάρια αναζήτησης, καθώς και σε ετερογενή σύνολα δεδομένων, όπως έγγραφα-ιστοσελίδες, σχόλια χρηστών, σημασιολογικές επισημειώσεις κειμένων και βιολογικές οντότητες. Επέφεραν δε αποτελέσματα που βελτίωναν τις προϋπάρχουσες βασικές μεθόδους στο κάθε πρόβλημα και οδήγησαν σε περισσότερες από δέκα δημοσιεύσεις σε διεθνή συνέδρια, workshops και περιοδικά. Επιπλέον, μέσω των παραπάνω εργασιών, προέκυψαν περαιτέρω ερευνητικά προβλήματα, τα οποία έχουν περιγραφεί στις δημοσιευμένες εργασίες και θα μπορούσαν να αποτελέσουν αντικείμενο μελλοντικής δουλειάς.The thesis handles re-ranking problems, including personalization, diversification, and hybrid search of entities on the web. Specifically, we studied and proposed novel methods for re-ranking web search results by capturing information needs of users or groups of users. We base our methods on ranking function training models, utilizing information extractedfrom user's search history (clickstream data - queries, results and clicked results). Further, we propose methods for semi-automatic semantic annotation of documents using ontology classes, for hybrid document search (using keywords and ontology classes) and for personalization of keyword search on semantic (RDF) data. Moreover, we evaluate/propose heuristics and introduce criteria for diversification of user comments on social networks, as well as for diversification of keyword search on semantic, structured data. Finally, we propose a first cut approach on re-ranking search results on name changing biological entities. Next, we discuss each of the above methods in more detail. Through the presented research, we implemented methods for more effective utilization of users' search histories, through ranking function training. Specifically, first, we proposed a method for enriching the extracted information from user's clickstream data (search history), for faster ranking function training. Next, we proposed and implemented methods for training multiple ranking functions, based either on search content or on user behavior. The novelty of the methods lies on gathering collaborative information from all users and grouping this information into clusters that represent diverse content or diverse search behavior. The final ranking of the results is achieved by combining rankings produced by models trained on different clusters. Moreover, we studied the adaptation of the problem of search result diversification into the scenario of diversifying user comments on news articles. We defined problem specific diversification criteria and applied several heuristic diversification algorithms. In order to assess the effectiveness of the proposed methods, we defined problem specific evaluation measures. Beyond that, we proposed a first cut approach for diversifying keyword search results on semantic (RDF) data, utilizing the schema and structure characterizing the data and the properties interconnecting the data. Finally, we examined indexing schemes and ranking algorithms for entities whose naming changes through time, as it stands for certain categories of biological entities. The aforementioned works were evaluated in several search scenarios, as well as on diverse datasets, such as documents-web pages, user comments, semantic annotations on documents and biological entities. The evaluation results showed that the above methods improved the effectiveness of baseline methods in the specific research problems, leading to the publication of more than ten articles in international conferences, workshops and journals. Further, through the work done on the specific areas, new, interesting problems arised, that are described in the individual publications and can be handled in future works.Γεώργιος Π. Γιαννόπουλο

DSpace at NTUA

GoNTogle: a tool for semantic annotation and search

Author: Bikakis Nikos
Dalamagas Theodore
Giannopoulos Giorgos
Sellis Timos
Publication venue: Springer Berlin Heidelberg
Publication date: 01/01/2010
Field of study

This paper presents GoNTogle, a tool which provides advanced document annotation and search facilities. GoNTogle allows users to annotate several document formats, using ontology concepts. It also produces automatic annotation suggestions based on textual similarity and previous document annotations. Finally, GoNTogle combines keyword and semantic-based search, offering advanced ontology query facilities

Crossref

DSpace at NTUA

Swinburne Research Bank

Auditing for Spatial Fairness

Author: Giannopoulos Giorgos
Papastefanatos George
Sacharidis Dimitris
Stefanidis Kostas
Publication venue: OpenProceedings.org
Publication date: 20/03/2023
Field of study

This paper studies algorithmic fairness when the protected attribute is location. To handle protected attributes that are continuous, such as age or income, the standard approach is to discretize the domain into predefined groups, and compare algorithmic outcomes across groups. However, applying this idea to location raises concerns of gerrymandering and may introduce statistical bias. Prior work addresses these concerns but only for regularly spaced locations, while raising other issues, most notably its inability to discern regions that are likely to exhibit spatial unfairness. Similar to established notions of algorithmic fairness, we define spatial fairness as the statistical independence of outcomes from location. This translates into requiring that for each region of space, the distribution of outcomes is identical inside and outside the region. To allow for localized discrepancies in the distribution of outcomes, we compare how well two competing hypotheses explain the observed outcomes. The null hypothesis assumes spatial fairness, while the alternate allows different distributions inside and outside regions. Their goodness of fit is then assessed by a likelihood ratio test. If there is no significant difference in how well the two hypotheses explain the observed outcomes, we conclude that the algorithm is spatially fair.Peer reviewe

Trepo - Institutional Repository of Tampere University