117 research outputs found

    Cross-systems Personalisierung

    Get PDF
    The World Wide Web provides access to a wealth of information and services to a huge and heterogeneous user population on a global scale. One important and successful design mechanism in dealing with this diversity of users is to personalize Web sites and services, i.e. to customize system content, characteristics, or appearance with respect to a specific user. Each system independently builds up user proïŹles and uses this information to personalize the service offering. Such isolated approaches have two major drawbacks: firstly, investments of users in personalizing a system either through explicit provision of information or through long and regular use are not transferable to other systems. Secondly, users have little or no control over the information that defines their profile, since user data are deeply buried in personalization engines running on the server side. Cross system personalization (CSP) (Mehta, Niederee, & Stewart, 2005) allows for sharing information across different information systems in a user-centric way and can overcome the aforementioned problems. Information about users, which is originally scattered across multiple systems, is combined to obtain maximum leverage and reuse of information. Our initial approaches to cross system personalization relied on each user having a unified profile which different systems can understand. The unified profile contains facets modeling aspects of a multidimensional user which is stored inside a "Context Passport" that the user carries along in his/her journey across information space. The user’s Context Passport is presented to a system, which can then understand the context in which the user wants to use the system. The basis of ’understanding’ in this approach is of a semantic nature, i.e. the semantics of the facets and dimensions of the uniïŹed proïŹle are known, so that the latter can be aligned with the proïŹles maintained internally at a specific site. The results of the personalization process are then transfered back to the user’s Context Passport via a protocol understood by both parties. The main challenge in this approach is to establish some common and globally accepted vocabulary and to create a standard every system will comply with. Machine Learning techniques provide an alternative approach to enable CSP without the need of accepted semantic standards or ontologies. The key idea is that one can try to learn dependencies between proïŹles maintained within one system and profiles maintained within a second system based on data provided by users who use both systems and who are willing to share their proïŹles across systems – which we assume is in the interest of the user. Here, instead of requiring a common semantic framework, it is only required that a sufficient number of users cross between systems and that there is enough regularity among users that one can learn within a user population, a fact that is commonly exploited in collaborative filtering. In this thesis, we aim to provide a principled approach towards achieving cross system personalization. We describe both semantic and learning approaches, with a stronger emphasis on the learning approach. We also investigate the privacy and scalability aspects of CSP and provide solutions to these problems. Finally, we also explore in detail the aspect of robustness in recommender systems. We motivate several approaches for robustifying collaborative filtering and provide the best performing algorithm for detecting malicious attacks reported so far.Die Personalisierung von Software Systemen ist von stetig zunehmender Bedeutung, insbesondere im Zusammenhang mit Web-Applikationen wie Suchmaschinen, Community-Portalen oder Electronic Commerce Sites, die große, stark diversifizierte Nutzergruppen ansprechen. Da explizite Personalisierung typischerweise mit einem erheblichen zeitlichem Aufwand fĂŒr den Nutzer verbunden ist, greift man in vielen Applikationen auf implizite Techniken zur automatischen Personalisierung zurĂŒck, insbesondere auf Empfehlungssysteme (Recommender Systems), die typischerweise Methoden wie das Collaborative oder Social Filtering verwenden. WĂ€hrend diese Verfahren keine explizite Erzeugung von Benutzerprofilen mittels Beantwortung von Fragen und explizitem Feedback erfordern, ist die QualitĂ€t der impliziten Personalisierung jedoch stark vom verfĂŒgbaren Datenvolumen, etwa Transaktions-, Query- oder Click-Logs, abhĂ€ngig. Ist in diesem Sinne von einem Nutzer wenig bekannt, so können auch keine zuverlĂ€ssigen persönlichen Anpassungen oder Empfehlungen vorgenommen werden. Die vorgelegte Dissertation behandelt die Frage, wie Personalisierung ĂŒber Systemgrenzen hinweg („cross system“) ermöglicht und unterstĂŒtzt werden kann, wobei hauptsĂ€chlich implizite Personalisierungstechniken, aber eingeschrĂ€nkt auch explizite Methodiken wie der semantische Context Passport diskutiert werden. Damit behandelt die Dissertation eine wichtige Forschungs-frage von hoher praktischer Relevanz, die in der neueren wissenschaftlichen Literatur zu diesem Thema nur recht unvollstĂ€ndig und unbefriedigend gelöst wurde. Automatische Empfehlungssysteme unter Verwendung von Techniken des Social Filtering sind etwas seit Mitte der 90er Jahre mit dem Aufkommen der ersten E-Commerce Welle popularisiert orden, insbesondere durch Projekte wie Information Tapistery, Grouplens und Firefly. In den spĂ€ten 90er Jahren und Anfang dieses Jahrzehnts lag der Hauptfokus der Forschungsliteratur dann auf verbesserten statistischen Verfahren und fortgeschrittenen Inferenz-Methodiken, mit deren Hilfe die impliziten Beobachtungen auf konkrete Anpassungs- oder Empfehlungsaktionen abgebildet werden können. In den letzten Jahren sind vor allem Fragen in den Vordergrund gerĂŒckt, wie Personalisierungssysteme besser auf die praktischen Anforderungen bestimmter Applikationen angepasst werden können, wobei es insbesondere um eine geeignete Anpassung und Erweiterung existierender Techniken geht. In diesem Rahmen stellt sich die vorgelegte Arbeit

    Multi-view Latent Factor Models for Recommender Systems

    Get PDF

    Semantic discovery and reuse of business process patterns

    Get PDF
    Patterns currently play an important role in modern information systems (IS) development and their use has mainly been restricted to the design and implementation phases of the development lifecycle. Given the increasing significance of business modelling in IS development, patterns have the potential of providing a viable solution for promoting reusability of recurrent generalized models in the very early stages of development. As a statement of research-in-progress this paper focuses on business process patterns and proposes an initial methodological framework for the discovery and reuse of business process patterns within the IS development lifecycle. The framework borrows ideas from the domain engineering literature and proposes the use of semantics to drive both the discovery of patterns as well as their reuse

    Neural recommender models for sparse and skewed behavioral data

    Get PDF
    Modern online platforms offer recommendations and personalized search and services to a large and diverse user base while still aiming to acquaint users with the broader community on the platform. Prior work backed by large volumes of user data has shown that user retention is reliant on catering to their specific eccentric tastes, in addition to providing them popular services or content on the platform. Long-tailed distributions are a fundamental characteristic of human activity, owing to the bursty nature of human attention. As a result, we often observe skew in data facets that involve human interaction. While there are superficial similarities to Zipf's law in textual data and other domains, the challenges with user data extend further. Individual words may have skewed frequencies in the corpus, but the long-tail words by themselves do not significantly impact downstream text-mining tasks. On the contrary, while sparse users (a majority on most online platforms) contribute little to the training data, they are equally crucial at inference time. Perhaps more so, since they are likely to churn. In this thesis, we study platforms and applications that elicit user participation in rich social settings incorporating user-generated content, user-user interaction, and other modalities of user participation and data generation. For instance, users on the Yelp review platform participate in a follower-followee network and also create and interact with review text (two modalities of user data). Similarly, community question-answer (CQA) platforms incorporate user interaction and collaboratively authored content over diverse domains and discussion threads. Since user participation is multimodal, we develop generalizable abstractions beyond any single data modality. Specifically, we aim to address the distributional mismatch that occurs with user data independent of dataset specifics; While a minority of the users generates most training samples, it is insufficient only to learn the preferences of this subset of users. As a result, the data's overall skew and individual users' sparsity are closely interlinked: sparse users with uncommon preferences are under-represented. Thus, we propose to treat these problems jointly with a skew-aware grouping mechanism that iteratively sharpens the identification of preference groups within the user population. As a result, we improve user characterization; content recommendation and activity prediction (+6-22% AUC, +6-43% AUC, +12-25% RMSE over state-of-the-art baselines), primarily for users with sparse activity. The size of the item or content inventories compounds the skew problem. Recommendation models can achieve very high aggregate performance while recommending only a tiny proportion of the inventory (as little as 5%) to users. We propose a data-driven solution guided by the aggregate co-occurrence information across items in the dataset. We specifically note that different co-occurrences are not equally significant; For example, some co-occurring items are easily substituted while others are not. We develop a self-supervised learning framework where the aggregate co-occurrences guide the recommendation problem while providing room to learn these variations among the item associations. As a result, we improve coverage to ~100% (up from 5%) of the inventory and increase long-tail item recall up to 25%. We also note that the skew and sparsity problems repeat across data modalities. For instance, social interactions and review content both exhibit aggregate skew, although individual users who actively generate reviews may not participate socially and vice-versa. It is necessary to differentially weight and merge different data sources for each user towards inference tasks in such cases. We show that the problem is inherently adversarial since the user participation modalities compete to describe a user accurately. We develop a framework to unify these representations while algorithmically tackling mode collapse, a well-known pitfall with adversarial models. A more challenging but important instantiation of sparsity is the few-shot setting or cross-domain setting. We may only have a single or a few interactions for users or items in the sparse domains or partitions. We show that contextualizing user-item interactions helps us infer behavioral invariants in the dense domain, allowing us to correlate sparse participants to their active counterparts (resulting in 3x faster training, ~19% recall gains in multi-domain settings). Finally, we consider the multi-task setting, where the platform incorporates multiple distinct recommendations and prediction tasks for each user. A single-user representation is insufficient for users who exhibit different preferences along each dimension. At the same time, it is counter-productive to handle correlated prediction or inference tasks in isolation. We develop a multi-faceted representation approach grounded on residual learning with heterogeneous knowledge graph representations, which provides us an expressive data representation for specialized domains and applications with multimodal user data. We achieve knowledge sharing by unifying task-independent and task-specific representations of each entity with a unified knowledge graph framework. In each chapter, we also discuss and demonstrate how the proposed frameworks directly incorporate a wide range of gradient-optimizable recommendation and behavior models, maximizing their applicability and pertinence to user-centered inference tasks and platforms

    Analysis and design of individual information systems to support health behavior change

    Get PDF
    As a wide-ranging socio-technical transformation, the digitalization has significantly influenced the world, bringing opportunities and challenges to our lives. Despite numerous benefits like the possibility to stay connected with people around the world, the increasing dispersion and use of digital technologies and media (DTM) pose risks to individuals’ well-being and health. Rising demands emerging from the digital world have been linked to digital stress, that is, stress directly or indirectly resulting from DTM (Ayyagari et al. 2011; Ragu-Nathan et al. 2008; Tarafdar et al. 2019; Weil and Rosen 1997), potentially intensifying individuals’ overall exposure to stress. Individuals experiencing this adverse consequence of digitalization are at elevated risk of developing severe mental health impairments (Alhassan et al. 2018; Haidt and Allen 2020; Scott et al. 2017), which is why various scholars emphasize that research should place a stronger focus on analyzing and shaping the role of the individual in a digital world, pursuing instrumental as well as humanistic objectives (Ameen et al. 2021; Baskerville 2011b). Information Systems (IS) research has long placed emphasis on the use of information and communication technology (ICT) in organizations, viewing an information system as the socio-technical system that emerges from individuals’ interaction with DTM in organizations. However, socio-technical information systems, as the essence of the IS discipline (Lee 2004; Sarker et al. 2019), are also present in different social contexts from private life. Acknowledging the increasing private use of DTM, such as smartphones and social networks, IS scholars have recently intensified their efforts to understand the human factor of IS (Avison and Fitzgerald 1991; Turel et al. 2021). A framework recently proposed by Matt et al. (2019) suggests three research angles: analyzing individuals’ behavior associated with their DTM use, analyzing what consequences arise from their DTM use behavior, and designing new technologies that promote positive or mitigate negative effects of individuals’ DTM use. Various recent studies suggest that individuals’ behavior seems to be an important lever influencing the outcomes of their DTM use (Salo et al. 2017; Salo et al. 2020; Weinstein et al. 2016). Therefore, this dissertation aims to contribute to IS research targeting the facilitation of a healthy DTM use behavior. It explores the use behavior, consequences, and design of DTM for individuals' use with the objective to deliver humanistic value by increasing individuals' health through supporting a behavior change related to their DTM use. The dissertation combines behavioral science and design science perspectives and applies pluralistic methodological approaches from qualitative (e.g., interviews, prototyping) and quantitative research (e.g., survey research, field studies), including mixed-methods approaches mixing both. Following the framework from Matt et al. (2019), the dissertation takes three perspectives therein: analyzing individuals’ behavior, analyzing individuals’ responses to consequences of DTM use, and designing information systems assisting DTM users. First, the dissertation presents new descriptive knowledge on individuals’ behavior related to their use of DTM. Specifically, it investigates how individuals behave when interacting with DTM, why they behave the way they do, and how their behavior can be influenced. Today, a variety of digital workplace technologies offer employees different ways of pursuing their goals or performing their tasks (Köffer 2015). As a result, individuals exhibit different behaviors when interacting with these technologies. The dissertation analyzes what interactional roles DTM users can take at the digital workplace and what may influence their behavior. It uses a mixed-methods approach and combines a quantitative study building on trace data from a popular digital workplace suite and qualitative interviews with users of this digital workplace suite. The empirical analysis yields eight user roles that advance the understanding of users’ behavior at the digital workplace and first insights into what factors may influence this behavior. A second study adds another perspective and investigates how habitual behavior can be changed by means of DTM design elements. Real-time feedback has been discussed as a promising way to do so (Schibuola et al. 2016; Weinmann et al. 2016). In a field experiment, employees working at the digital workplace are provided with an external display that presents real-time feedback on their office’s indoor environmental quality. The experiment examines if and to what extent the feedback influences their ventilation behavior to understand the effect of feedback as a means of influencing individuals’ behavior. The results suggest that real-time feedback can effectively alter individuals’ behavior, yet the feedback’s effectiveness reduces over time, possibly as a result of habituation to the feedback. Second, the dissertation presents new descriptive and prescriptive knowledge on individuals’ ways to mitigate adverse consequences arising from the digitalization of individuals. A frequently discussed consequence that digitalization has on individuals is digital stress. Although research efforts strive to determine what measures individuals can take to effectively cope with digital stress (Salo et al. 2017; Salo et al. 2020; Weinert 2018), further understanding of individuals’ coping behavior is needed (Weinert 2018). A group at high risk of suffering from the adverse effects of digital stress is adolescents because they grow up using DTM daily and are still developing their identity, acquiring mental strength, and adopting essential social skills. To facilitate a healthy DTM use, the dissertation explores what strategies adolescents use to cope with the demands of their DTM use. Combining a qualitative and a quantitative study, it presents 30 coping responses used by adolescents, develops five factors underlying adolescents’ activation of coping responses, and identifies gender- and age-related differences in their coping behavior. Third, the dissertation presents new prescriptive knowledge on the design of individual information systems supporting individuals in understanding and mitigating their perceived stress. Facilitated by the sensing capabilities of modern mobile devices, it explores the design and development of mobile systems that assess stress and support individuals in coping with stress by initiating a change of stress-related behavior. Since there is currently limited understanding of how to develop such systems, this dissertation explores various facets of their design and development. As a first step, it presents the development of a prototype aiming for life-integrated stress assessment, that is, the mobile sensor-based assessment of an individual’s stress without interfering with their daily routines. Data collected with the prototype yields a stress model relating sensor data to individuals’ perception of stress. To deliver a more generalized perspective on mobile stress assessment, the dissertation further presents a literature- and experience-based design theory comprising a design blueprint, design requirements, design principles, design features, and a discussion of potentially required trade-offs. Mobile stress assessment may be used for the development of mobile coping assistants. Aiming to assist individuals in effectively coping with stress and preventing future stress, a mobile coping assistant should recommend adequate coping strategies to the stressed individual in real-time or execute targeted actions within a defined scope of action automatically. While the implementation of a mobile coping assistant is yet up to future research, the dissertation presents an abstract design and algorithm for selecting appropriate coping strategies. To sum up, this dissertation contributes new knowledge on the digitalization of individuals to the IS knowledge bases, expanding both descriptive and prescriptive knowledge. Through the combination of diverse methodological approaches, it delivers knowledge on individuals’ behavior when using DTM, on the mitigation of consequences that may arise from individuals’ use of DTM, and on the design of individual information systems with the goal of facilitating a behavior change, specifically, regarding individuals’ coping with stress. Overall, the research contained in this dissertation may promote the development of digital assistants that support individuals’ in adopting a healthy DTM use behavior and thereby contribute to shaping a socio-technical environment that creates more benefit than harm for all individuals

    Towards Robust Deep Learning for Medical Image Analysis

    Get PDF
    Multi-dimensional medical data are rapidly collected to enhance healthcare. With the recent advance in artificial intelligence, deep learning techniques have been widely applied to medical images, constituting a significant proportion of medical data. The techniques of automated medical image analysis have the potential to benefit general clinical procedures, e.g., disease screening, malignancy diagnosis, patient risk prediction, and surgical planning. Although preliminary success takes place, the robustness of these approaches requires to be cautiously validated and sufficiently guaranteed before their application to real-world clinical problems. In this thesis, we propose different approaches to improve the robustness of deep learning algorithms for automated medical image analysis. (i) In terms of network architecture, we leverage the advantages of both 2D and 3D networks, and propose an alternative 2.5D approach for 3D organ segmentation. (ii) To improve data efficiency and utilize large-scale unlabeled medical data, we propose a unified framework for semi-supervised medical image segmentation and domain adaptation. (iii) For the safety-critical applications, we design a unified approach for failure detection and anomaly segmentation. (iv) We study the problem of Federated Learning, which enables collaborative learning and preserves data privacy, and improve the robustness of the algorithm in the non-i.i.d setting. (v) We incorporate multi-phase information for more accurate pancreatic tumor detection. (vi) Finally, we show our discovery for potential pancreatic cancer screening on non-contrast CT scans which outperform expert radiologists

    Multimodal analysis for object classification and event detection

    Get PDF
    • 

    corecore