44 research outputs found

    Dissecting the Butterfly: Representation of Disciplines Publishing at the Web Science Conference Series

    Get PDF
    International audienceWeb Science is an interdisciplinary arena. Motivated by the unforeseen scale and impact of the web, it addresses web-related research questions in a holistic manner, incorporating perspectives from a broad set of disciplines. There has been ongoing discussion about which disciplines are more or less present in the community, and about defining Web Science itself: there is, however, a dearth of empirical work in this area. This research note presents an early analysis of the presence of different disciplines in the Web Science community. To gain insight into this area, we applied Natural Language Processing and topic extraction to Web Science papers from 2009 to 2011. We compare the results to two current representations of Web Science: the 'Web Science butterfly' diagram and the Web Science Subject Categorization. We discuss the benefits of such an exploratory analysis, our early results, and steps for producing more robust results

    Facilitating the Exploitation of Linked Open Statistical Data: JSON-QB API Requirements and Design Criteria

    Get PDF
    Recently, many organizations have opened up their data for others to reuse. A major part of these data concern statistics such as demographic and social indicators. Linked Data is a promising paradigm for opening data because it facilitates data integration on the Web. Re- cently, a growing number of organizations adopted linked data paradigm and provided Linked Open Statistical Data (LOSD). These data can be exploited to create added value services and applications that require integrated data from multiple sources. In this paper, we suggest that in order to unleash the full potential of LOSD we need to facilitate the interaction with LOSD and hide most of the complexity. Moreover, we describe the requirements and design criteria of a JSON-QB API that (i) facilitates the development of LOSD tools through a style of interaction familiar to web developers and (ii) offers a uniform way to access LOSD. A proof of concept implementation of the JSON-QB API demonstrates part of the proposed functionality

    Interoperability Conflicts in Linked Open Statistical Data

    No full text
    An important part of Open Data is of a statistical nature and describes economic and social indicators monitoring population size, inflation, trade, and employment. Combining and analyzing Open Data from multiple datasets and sources enable the performance of advanced data analytics scenarios that could result in valuable services and data products. However, it is still difficult to discover and combine Open Statistical Data that reside in different data portals. Although Linked Open Statistical Data (LOSD) provide standards and approaches to facilitate combining statistics on the Web, various interoperability challenges still exist. In this paper, we propose an Interoperability Framework for LOSD, comprising definitions of LOSD interoperability conflicts as well as modelling practices currently used by six official open government data portals. Towards this end, we combine a top-down approach that studies interoperability conflicts in the literature with a bottom-up approach that studies the modelling practices of data portals. We define two types of LOSD schema-level conflicts, namely naming conflicts and structural conflicts. Naming conflicts result from using different URIs. Structural conflicts result from different practices of modelling the structure of data cubes. Only two out of the 19 conflicts are currently resolved and 11 can be resolved according to literature

    Ανάπτυξη μεθόδου ποσοτικού προσδιορισμού ολικής trans - ρεσβερατρόλης σε οίνους με χρήση φασματοσκοπίας NMR

    No full text
    Η trans-ρεσβερατρόλη είναι μια φαινολική ένωση, που ανήκει στην οικογένεια των στιλβενίων, η οποία βρίσκεται σε διάφορα μέρη του σταφυλιού, συμπεριλαμβανομένου και του φλοιού της ράγας. Η ρεσβερατρόλη ανήκει σε μια ευρύτερη κατηγορία ενώσεων που έχουν χαρακτηριστεί ως φυτοαλεξίνες, αφού παράγονται σε συνθήκες βιοτικού και αβιοτικού στρες, όπως για παράδειγμα οι προσβολές από μικροοργανισούς ή οι κακώσεις των ιστών από την υπεριώδη ακτινοβολία, ενώ μεταφέρονται κατά τη διάρκεια της οινοποίησης στο γλεύκος και στον οίνο. Η ρεσβερατρόλη παρουσιάζει ισχυρή αντιοξειδωτική δράση, δηλαδή εξουδετερώνει τις οξυγονούχες ρίζες και οξειδωτικές ενώσεις που παράγονται στους διάφορους ιστούς ή κυκλοφορούν στο πλάσμα του αίματος και αποτρέπει την οξείδωση των μετάλλων στο σώμα. Από την άποψη αυτή η ρεσβερατρόλη διαθέτει καρδιοπροστατευτικές ιδιότητες. Επίσης σύμφωνα με νεότερες μελέτες, η ρεσβερατρόλη φαίνεται ότι προστατεύει και από την ανάπτυξη κακοηθών όγκων, εμποδίζει την εμφάνιση και την ανάπτυξη καρκίνου του δέρματος και έχει αντιφλεγμονώδη δράση. Στην παρούσα εργασία αναπτύχθηκε μια πρωτότυπη και γρήγορη μέθοδος ποσοτικού προσδιορισμού της ολικής trans-ρεσβερατρόλης (πιο συγκεκριμένα των ολικών παράγωγων trans-ρεσβερατρολοειδών) σε οίνους με τη χρήση φασματοσκοπίας ΝΜR. Στη συνέχεια αναλύθηκαν 10 δείγματα κρασιών βιομηχανικής παραγωγής των ενδογενών (Π.Ο.Π.) ερυθρών ποικιλιών Αγιωργίτικο, Ξινόμαυρο, Φωκιανό, Λιάτικο, Κοτσιφάλι, Μαυροδάφνη, Λημνιό και Μοσχάτο Τυρνάβου και προσδιορίστηκε το περιεχόμενο των ολικών ρεσβερατρολοειδών σ’ αυτά. Οι μετρήσεις αυτές έδειξαν ότι την μεγαλύτερη περιεκτικότητα σε ρεσβερατρόλη είχαν το Κοτσιφάλι, το Ξινόμαυρο και το Αγιωργίτικο, δεδομένο το οποίο επαληθεύεται και από παλαιότερες επιστημονικές δημοσιεύσεις.Trans-resveratrol is a phenolic compound belonging to the family of stilbene, which is found in various parts of the grape. Resveratrol belongs to a broader class of compounds that have been identified as phytoalexins, since they are produced in living and abiotic stress conditions, such as microorganisms attacks or tissue damage from ultraviolet radiation, while being transported during the vinification to the must and the wine. Resveratrol has a strong antioxidant effect, ie neutralizes oxygenated roots and oxidative compounds produced in the various tissues or circulating in blood plasma and prevents oxidation of metals in the body. In this respect, resveratrol has cardioprotective properties. According to newer studies, resveratrol appears to protect against the development of malignant tumors, inhibit the appearance and development of skin cancer and have anti-inflammatory properties In this study we developed a novel and rapid method of quantifying the total trans-resveratrol content (in particular of total trans-resveratrolloids) in wines using NMR spectroscopy. Subsequently, 10 samples of the local red P.O.P. varieties Agiorgitiko, Xinomavro, Fokianos, Liatiko, Kotsifali, Mavrodaphni, Limnio and Moschato were analyzed and the content of total resveratrolloids was determined. These measurements showed that the highest content of resveratrol had Kotsifali, Xinomavro and Agiorgitiko, a fact also verified in older scientific publications

    Αναλυτική συνδεδεμένων ανοικτών κυβερνητικών δεδομένων

    No full text
    Public sector produces, collects, maintains, and disseminates a wealth of data. It is widely recognised the potential of exploiting these government data to boost among others economic activity, innovation and public administration transparency.In 2009, responding to the call of Sir Tim Berners-Lee, inventor of the world wide web, governments worldwide started to massively make data available online in open licenses and technical formats that facilitate reuse. They launched Open Government Data (OGD) portals that operate as single points of access for government data.The focus of this thesis is the OGD movement and its contribution in realising the potential of government data. Towards this end, we study OGD in a holistic approach by taking into account the viewpoints of both providers and consumers.Initiatives that provide OGD are part of public sector and as such they inherit deficiencies coming from the decentralised organisational structure of public sector, which comprises multiple administrative levels and functional areas. Moreover, the technological formats and the structure of data that are provided through OGD portals affect data exploitation. Linked Data has been early proposed as the most advanced technological paradigm for opening up data because it facilitates data integration across the Web. Moreover, aggregated statistics (e.g. economic and social indicators) structured as multi-dimensional data cubes constitute a major part of OGD.On the other hand, consumers perceive OGD as a small fraction of massive amounts of data that are daily produced and made available on the Web from various sources such as social media, research institutions, and news media. These data are provided in different technological formats and sometimes with diverse access constraints. In this emerging reality, the integration of OGD with other Web data is of vital importance for addressing the needs of consumers. Moreover, we consider that OGD exploitation has to capitalise on the paradigm of data analytics, which has already enabled organisations to successfully exploit their own data in various problem areas such as business intelligence.Within this problem formulation in this thesis we explore (a) provision, (b) integration, and (c) exploitation in data analytics of OGD and we propose specific solutions, including conceptual models, architectures, and software tools, that contribute towards realising the full potential of government data. The proposed solutions are evaluated in scenarios that involve real-world datasets from OGD portals, social media, clinical trials, etc. Since OGD movement emerged only recently we ground our analysis in traditional conceptual models of electronic government.The contribution of this thesis can be summarised as follows:Provision•An OGD classification scheme that provides an understanding of the domain.•An OGD stage model that can be used as a roadmap for future endeavours.•A process model that describe the lifecycle of multi-dimensional OGD.Integration•Architectures and implementations for integrating OGD and social media data on the Linked Data Web.•A theoretical framework for integrating multi-dimensional OGD.•An analysis of the challenges for integrating multi-dimensional OGD on the Linked Data Web.Exploitation in Data Analytics•A set of software tools that enable performing online analytical processing (OLAP) analytics on top of multiple datasets across the Linked Data Web.•A study of performing exploratory analytics on top of integrated data for elections understanding.•A process model that enables exploiting social media data in predictive analytics. The model was used to evaluate the predictive power of social media and to design a case for predicting winner of 2010 UK elections using integrated Twitter and linked open data.Access•An access control framework that enables combining open and private data on the Linked Data Web.Ο δημόσιος τομέας παράγει, συλλέγει, συντηρεί και διανέμει πληθώρα δεδομένων. Είναι κοινά αποδεκτή η δυναμική της αξιοποίησης των κυβερνητικών δεδομένων για την ενίσχυση, μεταξύ άλλων, της οικονομικής δραστηριότητας, της καινοτομίας, και της διαφάνειας στην δημόσια διοίκηση.Το 2009, ανταποκρινόμενες στην πρόσκληση του Sir Tim Berners-Lee, εφευρέτη του παγκόσμιου ιστού, οι κυβερνήσεις σε όλο τον κόσμο άρχισαν να διαθέτουν μαζικά τα δεδομένα τους χρησιμοποιώντας ανοικτές άδειες και τεχνικές μορφοποιήσεις που διευκολύνουν την επαναχρησιμοποίηση. ́Ιδρυσαν πύλες Ανοικτών Κυβερνητικών Δεδομένων (ΑΚΔ) οι οποίες λειτουργούν ως μοναδικό σημείο πρόσβασης για κυβερνητικά δεδομένα.Το επίκεντρο αυτής της διατριβής είναι το κίνημα των ΑΚΔ και η συμβολή του στην υλοποίηση της δυναμικής των κυβερνητικών δεδομένων. Προς το σκοπό αυτό, μελετούμε τα ΑΚΔ με μία ολιστική προσέγγιση, λαμβάνοντας υπόψη την οπτική τόσο των παρόχων όσο και των καταναλωτών.Οι πρωτοβουλίες που παρέχουν ΑΚΔ αποτελούν μέρος του δημόσιου τομέα και συνεπώς κληρονομούν ελλείψεις που προέρχονται από την αποκεντρωμένη οργανωτική δομή του δημοσίου, η οποία περιλαμβάνει πολλαπλά επίπεδα διοίκησης και λειτουργικές περιοχές. Επιπλέον, οι τεχνολογικές μορφοποιήσεις και η δομή των δεδομένων που παρέχονται μέσω των διαδικτυακών πυλών ΑΚΔ επηρεάζουν την αξιοποίηση των δεδομένων. Τα συνδεδεμένα δεδομένα (linked data) έχουν από νωρίς προταθεί ως το πιο προηγμένο τεχνολογικό παράδειγμα για το «άνοιγμα» των δεδομένων στον Ιστό. Επίσης, συγκεντρωτικά στατιστικά (π.χ. οικονομικοί και κοινωνικοί δείκτες) τα οποία δομούνται ως πολυ-διάστατοι κύβοι αποτελούν ένα σημαντικό μέρος των ΑΚΔ.Από την άλλη πλευρά, οι καταναλωτές αντιλαμβάνονται τα ΑΚΔ ως ένα μικρό κλάσμα από τις τεράστιες ποσότητες δεδομένων που παράγονται και διατίθενται καθημερινά στον ιστό από διάφορες πηγές όπως τα μέσα κοινωνικής δικτύωσης, τα ερευνητικά ιδρύματα, και τα μέσα ενημέρωσης. Αυτά τα δεδομένα παρέχονται με διαφορετικές τεχνολογικές μορφοποιήσεις και κάποιες φορές με ποικίλους περιορισμούς πρόσβασης. Σε αυτή τη νέα πραγματικότητα, η σύνδεση των ΑΚΔ με άλλα δεδομένα του Ιστού είναι απαραίτητη για την ικανοποίηση των αναγκών των καταναλωτών. Επίσης, θεωρούμε ότι η αξιοποίηση των ΑΚΔ θα πρέπει να κεφαλαιοποίηση το παράδειγμα της αναλυτικής δεδομένων (data analytics), το οποίο έχει ήδη επιτρέψει σε οργανισμούς να αξιοποιήσουν τα δικά τους δεδομένα σε ποικίλες περιοχές όπως στην επιχειρηματική ευφυΐα.Μέσα σε αυτήν την διαμόρφωση του προβλήματος στην παρούσα διατριβή διερευνούμε (α) την παροχή, (β) την ολοκλήρωση, και (γ) την αξιοποίηση με αναλυτική δεδομένων των ΑΚΔ και προτείνουμε συγκεκριμένες λύσεις που περιλαμβάνουν θεωρητικά μοντέλα, αρχιτεκτονικές, και εργαλεία λογισμικού, τα οποία συμβάλουν προς την πραγματοποίηση της πλήρης προοπτικής των κυβερνητικών δεδομένων. Οι προτεινόμενες λύσεις αξιολογούνται σε σενάρια που περιλαμβάνουν σύνολα δεδομένα από ΑΚΔ πύλες, μέσα κοινω- νικής δικτύωσης, ερευνητικά πειράματα, κλπ. Καθώς το κίνημα των ΑΚΔ αναδύθηκε μόλις πρόσφατα βασίζουμε την ανάλυση μας σε παραδοσιακά θεωρητικά μοντέλα της ηλεκτρονικής διακυβέρνησης

    Open Government Data: A Stage Model

    No full text
    Part 3: Governance, Openess and InstitutionsInternational audiencePublic sector information constitutes a valuable primary material for added-value services and products, which however remains unexploited. Recently, Open Government Data (OGD) initiatives emerged worldwide aiming to make public data freely available to everyone, without limiting restrictions. Despite its potential however there is currently a lack of roadmaps, guidelines and benchmarking frameworks to drive and measure OGD progress. This is particularly true as proposed stage models for measuring eGovernment progress focus on services and do not sufficiently consider data. In this paper, we capitalize on literature on eGovernment stage models and OGD initiatives to propose a stage model for OGD. The proposed model has two main dimensions, namely organizational & technological complexity and added value for data consumers. We anticipate the proposed model will open up a scientific discussion on OGD stage models and will be used by practitioners for constructing roadmaps and for benchmarking just like the European Union stage model is currently used for measuring public service online sophistication

    Processing Linked Open Data Cubes

    No full text
    Part 2: Open and Smart GovernmentInternational audienceA significant part of open data provided by governments and international organizations concerns statistics such as demographics and economic indicators. The real value, however, of open statistical data will unveil from performing analytics on top of combined datasets from disparate sources. Linked data provide the most promising technological paradigm to enable such analytics across the Web. Currently, however, relevant processes and tools do not fully exploit the distinctive characteristics of statistical data. The aim of this paper is to present a process that enables publishing statistical raw data as linked data, combining statistics from multiple sources, and exploiting them in data analytics and visualizations. Moreover, the capability of existing software tools to support the vision of linked statistical data analytics is evaluated. We anticipate that the proposed process will contribute to the development of a roadmap for future research and development in the area

    Greek Government Vehicles

    No full text
    A dataset that contains aggregated information about the government vehicles in Greece. The dataset is serialized in RDF and uses the RDF Data Cube Vocabulary (W3C recommendation). The dataset contains information such as the count of vehicles, the average/minimum/max age, average/minimum/max engine displacement and average/minimum/max seating capacity. All the information are described based on the vehicle type, fuel type and governmental agency that own the vehicle

    Open Statistics: The Rise of a New Era for Open Data?

    Get PDF
    Part 1: E-Government FoundationsInternational audienceA large part of open data concerns statistics, such as demographic, economic and social data (henceforth referred to as Open Statistical Data, OSD). In this paper we start by introducing open data fragmentation as a major obstacle for OSD reuse. We proceed by outlining data cube as a logical model for structuring OSD. We then introduce Open Statistics as a new area aiming to systematically study OSD. Open Statistics reuse and extends methods from diverse fields like Open Data, Statistics, Data Warehouses and the Semantic Web. In this paper, we focus on benefits and challenges of Open Statistics. The results suggest that Open Statistics provide benefits not present in any of these fields alone. We conclude that in certain cases OSD can realise the potential of open data

    Applying Brand Equity Theory to Understand Consumer Opinion in Social Media

    No full text
    Billions of people everyday use Social Media (SM), such as Facebook and Twitter, to express their opinions and experiences with brands. Companies are highly interested in understanding such SM brand-related content. Consequently, many studies have been conducted and many applications have been developed to analyse this content. For analysis purposes, the main SM metrics used include volume and sentiment. Interestingly, however, brand equity theory proposes different metrics for assessing brand reputation. These include brand image, brand satisfaction and purchase intention (henceforth referred to as marketing metrics). The objective of this paper is to explore the feasibility of applying marketing metrics in Twitter brand-related content. For this purpose, we collect, study and analyse tweets that mention two brands, namely IKEA and Gatorade. The manual analysis suggests that a significant amount of brand tweets is related to brand image, brand satisfaction and purchase intention. We thereafter design an algorithm that classifies tweets into relevant categories to enable automatic marketing metrics computation. We implement the algorithm using statistical learning approaches and prove that its classification accuracy is good. We anticipate that this article will motivate other studies as well as applications' designers in adopting marketing theories when evaluating brand reputation through SM content