756 research outputs found

    Automatic Synchronization of Multi-User Photo Galleries

    Full text link
    In this paper we address the issue of photo galleries synchronization, where pictures related to the same event are collected by different users. Existing solutions to address the problem are usually based on unrealistic assumptions, like time consistency across photo galleries, and often heavily rely on heuristics, limiting therefore the applicability to real-world scenarios. We propose a solution that achieves better generalization performance for the synchronization task compared to the available literature. The method is characterized by three stages: at first, deep convolutional neural network features are used to assess the visual similarity among the photos; then, pairs of similar photos are detected across different galleries and used to construct a graph; eventually, a probabilistic graphical model is used to estimate the temporal offset of each pair of galleries, by traversing the minimum spanning tree extracted from this graph. The experimental evaluation is conducted on four publicly available datasets covering different types of events, demonstrating the strength of our proposed method. A thorough discussion of the obtained results is provided for a critical assessment of the quality in synchronization.Comment: ACCEPTED to IEEE Transactions on Multimedi

    Smartphone picture organization: a hierarchical approach

    Get PDF
    We live in a society where the large majority of the population has a camera-equipped smartphone. In addition, hard drives and cloud storage are getting cheaper and cheaper, leading to a tremendous growth in stored personal photos. Unlike photo collections captured by a digital camera, which typically are pre-processed by the user who organizes them into event-related folders, smartphone pictures are automatically stored in the cloud. As a consequence, photo collections captured by a smartphone are highly unstructured and because smartphones are ubiquitous, they present a larger variability compared to pictures captured by a digital camera. To solve the need of organizing large smartphone photo collections automatically, we propose here a new methodology for hierarchical photo organization into topics and topic-related categories. Our approach successfully estimates latent topics in the pictures by applying probabilistic Latent Semantic Analysis, and automatically assigns a name to each topic by relying on a lexical database. Topic-related categories are then estimated by using a set of topic-specific Convolutional Neuronal Networks. To validate our approach, we ensemble and make public a large dataset of more than 8,000 smartphone pictures from 40 persons. Experimental results demonstrate major user satisfaction with respect to state of the art solutions in terms of organization.Peer ReviewedPreprin

    Event Based Media Indexing

    Get PDF
    Multimedia data, being multidimensional by its nature, requires appropriate approaches for its organizing and sorting. The growing number of sensors for capturing the environmental conditions in the moment of media creation enriches data with context-awareness. This unveils enormous potential for eventcentred multimedia processing paradigm. The essence of this paradigm lies in using events as the primary means for multimedia integration, indexing and management. Events have the ability to semantically encode relationships of different informational modalities. These modalities can include, but are not limited to: time, space, involved agents and objects. As a consequence, media processing based on events facilitates information perception by humans. This, in turn, decreases the individualโ€™s effort for annotation and organization processes. Moreover events can be used for reconstruction of missing data and for information enrichment. The spatio-temporal component of events is a key to contextual analysis. A variety of techniques have recently been presented to leverage contextual information for event-based analysis in multimedia. The content-based approach has demonstrated its weakness in the field of event analysis, especially for the event detection task. However content-based media analysis is important for object detection and recognition and can therefore play a role which is complementary to that of event-driven context recognition. The main contribution of the thesis lies in the investigation of a new eventbased paradigm for multimedia integration, indexing and management. In this dissertation we propose i) a novel model for event based multimedia representation, ii) a robust approach for mining events from multimedia and iii) exploitation of detected events for data reconstruction and knowledge enrichment

    Digital life stories: Semi-automatic (auto)biographies within lifelog collections

    Get PDF
    Our life stories enable us to reflect upon and share our personal histories. Through emerging digital technologies the possibility of collecting life experiences digitally is increasingly feasible; consequently so is the potential to create a digital counterpart to our personal narratives. In this work, lifelogging tools are used to collect digital artifacts continuously and passively throughout our day. These include images, documents, emails and webpages accessed; texts messages and mobile activity. This range of data when brought together is known as a lifelog. Given the complexity, volume and multimodal nature of such collections, it is clear that there are significant challenges to be addressed in order to achieve coherent and meaningful digital narratives of our events from our life histories. This work investigates the construction of personal digital narratives from lifelog collections. It examines the underlying questions, issues and challenges relating to construction of personal digital narratives from lifelogs. Fundamentally, it addresses how to organize and transform data sampled from an individualโ€™s day-to-day activities into a coherent narrative account. This enquiry is enabled by three 20-month long-term lifelogs collected by participants and produces a narrative system which enables the semi-automatic construction of digital stories from lifelog content. Inspired by probative studies conducted into current practices of curation, from which a set of fundamental requirements are established, this solution employs a 2-dimensional spatial framework for storytelling. It delivers integrated support for the structuring of lifelog content and its distillation into storyform through information retrieval approaches. We describe and contribute flexible algorithmic approaches to achieve both. Finally, this research inquiry yields qualitative and quantitative insights into such digital narratives and their generation, composition and construction. The opportunities for such personal narrative accounts to enable recollection, reminiscence and reflection with the collection owners are established and its benefit in sharing past personal experience experiences is outlined. Finally, in a novel investigation with motivated third parties we demonstrate the opportunities such narrative accounts may have beyond the scope of the collection owner in: personal, societal and cultural explorations, artistic endeavours and as a generational heirloom

    Discriminative Probabilistic Pattern Mining using Graph for Electronic Health Records

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ)--์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› :๊ณต๊ณผ๋Œ€ํ•™ ์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€,2019. 8. ๊น€์„ .์ „์ž์˜๋ฃŒ๊ธฐ๋ก(Electronic Health Records)์˜ ์ž„์ƒ ๋…ธํŠธ์—๋Š” ํ™˜์ž์˜ ๋ณ‘๋ ฅ์— ๋Œ€ํ•œ ์œ ์šฉํ•œ ์ •๋ณด๊ฐ€ ๋งŽ์ด ํฌํ•จ๋˜์–ด ์žˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์ž„์ƒ ๋…ธํŠธ๋Š” ์ฒด๊ณ„ํ™”๋˜์ง€ ์•Š์€ ๋ฐ์ดํ„ฐ์ด๋ฉฐ ๊ทธ ์–‘์€ ๋‚˜๋‚ ์ด ์ฆ๊ฐ€ํ•˜๊ณ  ์žˆ๋‹ค. ๋”ฐ๋ผ์„œ ์ž„์ƒ ๋…ธํŠธ๋ฅผ ๊ทธ๋ฃนํ™”ํ•˜๊ณ  ๋ถ„๋ฅ˜ํ•˜๊ธฐ ์œ„ํ•œ ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š” ๋ฐ์ดํ„ฐ ๋งˆ์ด๋‹ ๊ธฐ์ˆ ์ด ํ•„์š”ํ•˜๋‹ค. ๊ธฐ์กด์˜ ๋ฐ์ดํ„ฐ ๋งˆ์ด๋‹ ๊ธฐ์ˆ ์€ ํ‚ค์›Œ๋“œ์˜ ๋นˆ๋„๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ƒ์„ฑ๋œ ๋นˆ๋ฐœ ํŒจํ„ด(frequent patterns)์„ ์ด์šฉํ•˜์—ฌ ๊ทธ๋ฃน ๋ถ„๋ฅ˜ ์ž‘์—…(classification)์„ ์ˆ˜ํ–‰ํ•œ๋‹ค. ํ•˜์ง€๋งŒ ์ด๋Ÿฌํ•œ ๋นˆ๋ฐœ ํŒจํ„ด์€ ์ „์ž์˜๋ฃŒ๊ธฐ๋ก์˜ ์ž„์ƒ ๋…ธํŠธ์™€ ๊ฐ™์ด ๋ณต์žกํ•œ ๋ฐ์ดํ„ฐ์˜ ๋ถ„๋ฅ˜๋ฅผ ์œ„ํ•ด ํ•„์š”ํ•œ ์ถฉ๋ถ„ํžˆ ๊ฐ•๋ ฅํ•˜๊ณ  ๋ช…ํ™•ํ•˜๊ฒŒ ๊ตฌ๋ณ„๋˜๋Š” ํŠน์ง•์„ ๊ฐ–๊ณ  ์žˆ์ง€ ์•Š๋‹ค. ๋˜ํ•œ ๋นˆ๋ฐœ ํŒจํ„ด ๊ธฐ๋ฐ˜ ๊ธฐ์ˆ ์€ ๋Œ€๊ทœ๋ชจ ์ „์ž์˜๋ฃŒ๊ธฐ๋ก ๋ฐ์ดํ„ฐ์— ์ ์šฉ๋  ๋•Œ ํ™•์žฅ์„ฑ๊ณผ ๊ณ„์‚ฐ ๋น„์šฉ์˜ ๋ฌธ์ œ์— ์ง๋ฉดํ•œ๋‹ค. ๋”ฐ๋ผ์„œ ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ด๋Ÿฌํ•œ ๋ฌธ์ œ์ ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ํ™•๋ฅ ์  ํŒ๋ณ„ ํŒจํ„ด ๋งˆ์ด๋‹(discriminative probabilistic pattern mining) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์†Œ๊ฐœํ•œ๋‹ค. ํ™•๋ฅ ์  ํŒ๋ณ„ ํŒจํ„ด ๋งˆ์ด๋‹ ์•Œ๊ณ ๋ฆฌ์ฆ˜์—์„œ๋Š” ์ „์ž์˜๋ฃŒ๊ธฐ๋ก์˜ ์ž„์ƒ ๋…ธํŠธ๋ฅผ ๋ถ„๋ฅ˜ํ•˜๊ธฐ ์œ„ํ•ด ๊ทธ๋ž˜ํ”„ ๊ตฌ์กฐ๋ฅผ ๋„์ž…ํ•˜์—ฌ ๋นˆ๋ฐœ ํŒจํ„ด์˜ ๋ถ€๋ถ„ ๊ทธ๋ž˜ํ”„๋ฅผ ์ƒ์„ฑํ•˜๊ฒŒ ๋œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ํŒ๋ณ„๋ ฅ์„ ๋†’์ด๊ธฐ ์œ„ํ•ด ๊ฐœ๋ณ„ ํ‚ค์›Œ๋“œ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๋Œ€์‹  ์ด์ง„ ํŠน์„ฑ ์กฐํ•ฉ์—์„œ์˜ ๋™์‹œ ์ถœํ˜„(co-occurrence)์„ ์‚ฌ์šฉํ•˜์—ฌ ์ž„์ƒ ๋…ธํŠธ ๋ถ„๋ฅ˜๋ฅผ ์œ„ํ•œ ๋นˆ๋ฐœ ํŒจํ„ด ๊ทธ๋ž˜ํ”„๋ฅผ ๊ตฌ์„ฑํ•œ๋‹ค. ๊ฐ๊ฐ์˜ ๋™์‹œ ์ถœํ˜„์€ ํŒ๋ณ„๋ ฅ(discriminative power)์— ๋”ฐ๋ฅธ log-odds ๊ฐ’์œผ๋กœ ๊ทธ ๊ฐ€์ค‘์น˜๋ฅผ ๊ฐ–๋Š”๋‹ค. ์ž„์ƒ ๋…ธํŠธ์˜ ๋ณธ์งˆ์„ ๋ฐ˜์˜ํ•˜๋Š” ๊ทธ๋ž˜ํ”„๋ฅผ ์ฐพ๊ธฐ ์œ„ํ•ด ํ™•๋ฅ ์  ํŒ๋ณ„ ๋ถ€๋ถ„ ๊ทธ๋ž˜ํ”„ ๊ฒ€์ƒ‰์„ ์ˆ˜ํ–‰ํ•˜๋ฉฐ ๊ทธ๋ž˜ํ”„์˜ ํ—ˆ๋ธŒ(hub) ๋…ธ๋“œ์—์„œ ์‹œ์ž‘ํ•˜์—ฌ ๋™์  ํ”„๋กœ๊ทธ๋ž˜๋ฐ(dynamic programming)์„ ์‚ฌ์šฉํ•˜์—ฌ ๊ฒฝ๋กœ๋ฅผ ์ฐพ๋Š”๋‹ค. ์ด๋Ÿฌํ•œ ๋ฐฉ๋ฒ•์œผ๋กœ ๊ฒ€์ƒ‰ํ•œ ๋นˆ๋ฐœ ๋ถ€๋ถ„ ๊ทธ๋ž˜ํ”„๋ฅผ ์ด์šฉํ•˜์—ฌ ์ „์ž์˜๋ฃŒ๊ธฐ๋ก์˜ ์ž„์ƒ ๋…ธํŠธ์— ๋Œ€ํ•œ ๋ถ„๋ฅ˜ ์ž‘์—…์„ ์ˆ˜ํ–‰ํ•˜๊ฒŒ ๋œ๋‹ค.Electronic Health Records (EHR) contains plenty of useful information about patients medical history. However, EHR is highly unstructured data and amount of it is growing continuously, that is why there is a need in a reliable data mining technique to group and categorize clinical notes. Although, many existing data mining techniques for group classification use frequent patterns generated based on frequencies of keywords, these patterns do not possess strong enough distinguishing characteristics to show the difference between datasets to classify complex data such as clinical notes in EHR. Also, these techniques encounter scalability and computational cost problems when used on large EHR dataset. To address these issues, we introduce discriminative probabilistic pattern mining algorithm that uses a graph (DPPMG) to generate the subgraphs of frequent patterns for classification in electronic health records. We use co-occurrence, a combination of binary features, which is more discriminative than individual keywords to construct discriminative probabilistic frequent patterns graph for clinical notes classification. Each co-occurrence has a weight of log-odds score that is associated with its discriminative power. The graph, which reflects the essence of clinical notes is searched to find discriminative probabilistic frequent subgraphs. To discover the discriminative frequent subgraphs, we start from a hub node in the graph and use dynamic programming to find a path. The discriminative probabilistic frequent subgraphs discovered by this approach are later used to classify clinical notes of electronic health records.Chapter 1 Introduction and Motivation 1 Chapter 2 Background 4 2.1 Frequent Pattern Based Classification 4 2.2 Discriminative Pattern Mining 5 2.3 Electronic Health Records 6 Chapter 3 Related Work 8 Chapter 4 Overview and Design 10 Chapter 5 Implementation 12 5.1 Dataset 12 5.2 Keyword Extraction and Filtering 15 5.3 Co-occurrence Generation and Graph Construction 16 5.4 Dynamic Programming to Discover Optimal Path 17 Chapter 6 Results and Evaluation 20 6.1 Choosing Starting Hub Node 20 6.2 Qualitative Analysis 22 6.3 Discriminative Power of the Probabilistic Frequent Patterns 24 Chapter 7 Conclusion 26 Bibliography 28 ์š”์•ฝ 33Maste

    REAL TIME ASSISTANCE IN PHOTOGRAPHY USING SOCIAL MEDIA

    Get PDF
    Ph.DDOCTOR OF PHILOSOPH
    • โ€ฆ
    corecore