158 research outputs found
Recommended from our members
Online Maintenance of Minimal Directed Hypergraphs
In this paper we deal with directed hypergraphs, a generalization of directed graphs previously introduced in the literature. In particular, we investigate the problem of maintaining efficiently information about minimal hyperpaths while new hyperarcs are inserted. We consider several definitions of minimal hyperpath and we prove that accordingly to some of such definitions the problem of finding the minimal hyperpath is NP-complete, while hyperpaths with minimal GAP and minimal RANK can be found in polynomial time. We deal with this problem in an online fashion, by allowing insertions of hyperarcs in the hypergraphs. We present data structures and algorithms which allow to return a hyperpath with minimal GAP or RANK between an arbitrarily given pair of nodes in a time which is linear in its size. The total time required to maintain the data structure during the insertion of new hyperarcs is (9(m n2) for min-GAP and (9(m n2 log n) for min-RANK (where m is the total size of the description of the hyperarcs and n is the number of nodes). These results are useful in applications where directed hypergraphs are known to be a suitable model (e.g. transition networks, rewriting systems, database schemes, logic programming and problem solving)
Weighted-distance sliding windows and cooccurrence graphs for supporting entity-relationship discovery in unstructured text
The problem of Entity relation discovery in structured data, a well covered topic in literature, consists in searching within unstructured sources (typically, text) in order to find connections among entities. These can be a whole dictionary, or a specific collection of named items. In many cases machine learning and/or text mining techniques are used for this goal. These approaches might be unfeasible in computationally challenging problems, such as processing massive data streams. A faster approach consists in collecting the cooccurrences of any two words (entities) in order to create a graph of relations - a cooccurrence graph. Indeed each cooccurrence highlights some grade of semantic correlation between the words because it is more common to have related words close each other than having them in the opposite sides of the text. Some authors have used sliding windows for such problem: they count all the occurrences within a sliding windows running over the whole text. In this paper we generalise such technique, coming up to a Weighted-Distance Sliding Window, where each occurrence of two named items within the window is accounted with a weight depending on the distance between items: a closer distance implies a stronger evidence of a relationship. We develop an experiment in order to support this intuition, by applying this technique to a data set consisting in the text of the Bible, split into verses
Oii-web: An interactive online programming contest training system
In this paper we report our experience, related to the online training for the
Italian and International Olympiads in Informatics. We developed an interactive online
system, based on CMS, the grading system used in several major programming contests
including the International Olympiads in Informatics (IOI), and used it in three distinct
context: training students for the Italian Olympiads in Informatics (OII), training teachers
in order to be able to assist students for the OII, and training the Italian team for the
IOI. The system, that is freely available, proved to be a game changer for the whole italian
olympiads in informatics ecosystem: in one year, we almost doubled the participation to
OII, from 13k to 21k secondary school students.
The system is developed basing on the Contest Management System (CMS, http://cms-
dev.github.io/), so it is highly available to extensions supporting, for instance, the pro-
duction of feedback on problems solutions submitted by trainees. The system is also freely
available, with the idea of allowing for support to alternative necessities and developmen
Recommended from our members
Dynamic Data Structures for Series Parallel Digraphs
We consider the problem of dynamically maintaining general series parallel directed acyclic graphs (GSP dags), two-terminal series parallel directed acyclic graphs (TTSP dags) and looped series parallel directed graphs (looped SP digraphs). We present data structures for updating (by both inserting and deleting either a group of edges or vertices) GSP dags, TTSP clags and looped SP digraphs of m edges and n vertices in O( log n) worst-case time. The time required to check whether there is a path between two given vertices is O(log n), while a path of length k can be traced out in O(k + log n) time. For GSP and TTSP dags, our data structures are able to report a regular expression describing all the paths between two vertices x and y in O(h + log n), where h ≤ n is the total number of vertices which are contained in paths from x to y. Although GSP dags can have as many as O(n2) edges, we use an implicit representation which requires only O(n) space. Motivations for studying dynamic graphs arise in several areas, such as communication networks, Incremental compilation environments and the design of very high level languages, while the dynamic maintenance of series parallel graphs is also relevant in reducible flow diagrams
Risk Assessment for Venous Thromboembolism in Chemotherapy-Treated Ambulatory Cancer Patients: A Machine Learning Approach
OBJECTIVE: To design a precision medicine approach aimed at exploiting significant patterns in data, in order to produce venous thromboembolism (VTE) risk predictors for cancer outpatients that might be of advantage over the currently recommended model (Khorana score).
DESIGN: Multiple kernel learning (MKL) based on support vector machines and random optimization (RO) models were used to produce VTE risk predictors (referred to as machine learning [ML]-RO) yielding the best classification performance over a training (3-fold cross-validation) and testing set.
RESULTS: Attributes of the patient data set ( n = 1179) were clustered into 9 groups according to clinical significance. Our analysis produced 6 ML-RO models in the training set, which yielded better likelihood ratios (LRs) than baseline models. Of interest, the most significant LRs were observed in 2 ML-RO approaches not including the Khorana score (ML-RO-2: positive likelihood ratio [+LR] = 1.68, negative likelihood ratio [-LR] = 0.24; ML-RO-3: +LR = 1.64, -LR = 0.37). The enhanced performance of ML-RO approaches over the Khorana score was further confirmed by the analysis of the areas under the Precision-Recall curve (AUCPR), and the approaches were superior in the ML-RO approaches (best performances: ML-RO-2: AUCPR = 0.212; ML-RO-3-K: AUCPR = 0.146) compared with the Khorana score (AUCPR = 0.096). Of interest, the best-fitting model was ML-RO-2, in which blood lipids and body mass index/performance status retained the strongest weights, with a weaker association with tumor site/stage and drugs.
CONCLUSIONS: Although the monocentric validation of the presented predictors might represent a limitation, these results demonstrate that a model based on MKL and RO may represent a novel methodological approach to derive VTE risk classifiers. Moreover, this study highlights the advantages of optimizing the relative importance of groups of clinical attributes in the selection of VTE risk predictors
Ensuring sample quality for biomarker discovery studies - Use of ict tools to trace biosample life-cycle
The growing demand of personalized medicine marked the transition from an empirical medicine to a molecular one, aimed at predicting safer and more effective medical treatment for every patient, while minimizing adverse effects. This passage has emphasized the importance of biomarker discovery studies, and has led sample availability to assume a crucial role in biomedical research. Accordingly, a great interest in Biological Bank science has grown concomitantly. In biobanks, biological material and its accompanying data are collected, handled and stored in accordance with standard operating procedures (SOPs) and existing legislation. Sample quality is ensured by adherence to SOPs and sample whole life-cycle can be recorded by innovative tracking systems employing information technology (IT) tools for monitoring storage conditions and characterization of vast amount of data. All the above will ensure proper sample exchangeability among research facilities and will represent the starting point of all future personalized medicine-based clinical trials
The effect of social media and infodemic on mental health during the COVID-19 pandemic: results from the COMET multicentric trial
On January 30, 2020, the World Health Organization (WHO) declared the status of pandemic due to the COVID-19 infection. The initial phases of the pandemic were characterized by uncertainty and public fears. In order to cope with such unexpected conditions, people adopted different coping strategies, including search for information, accessing Internet, and using social media. The present study based on the COMET collaborative research network aims to: (1) assess use of Internet and of social media among the Italian general population; (2) explore differences in web usage between people with pre-existing mental disorders and the general population; (3) identify changes over time in social media usage along the phase 1 of the pandemic; (4) identify the clinical, socio-demographic and contextual predictors of excessive use of social media. A significant increase in time spent on Internet, with an average time of 4.8 ± 0.02 h per day, was found in the global sample of 20,720 participants. Compared with the general population, Internet use was significantly higher in people with pre-existing mental disorders (5.2 ± 0.1 h vs. 4.9 ± 0.02; p < 0.005). According to the multivariate logistic regression model, the risk of excessive use of social media and Internet was significantly higher in people with moderate levels of depressive symptoms (OR: 1.26, CI 95%: 0.99 to 1.59, p < 0.0.005); while protective factors were being students (OR: 0.72, CI 95%: 0.53 to 0.96, p < 0.0029) and living in central Italy (OR: 0.46, CI 95%: 0.23 to 0.90, p < 0.002). The evaluation of social media and Internet use by the general population represents a first step for developing specific protective and supportive interventions for the general population, including practical suggestions on how to safely use Internet and social media
COVID-19-Related Social Isolation Predispose to Problematic Internet and Online Video Gaming Use in Italy
COVID-19 pandemic and its related containment measures have been associated with increased levels of stress, anxiety and depression in the general population. While the use of digital media has been greatly promoted by national governments and international authorities to maintain social contacts and healthy lifestyle behaviors, its increased access may also bear the risk of inappropriate or excessive use of internet-related resources. The present study, part of the COVID Mental hEalth Trial (COMET) study, aims at investigating the possible relationship between social isolation, the use of digital resources and the development of their problematic use. A cross sectional survey was carried out to explore the prevalence of internet addiction, excessive use of social media, problematic video gaming and binge watching, during Italian phase II (May-June 2020) and III (June-September 2020) of the pandemic in 1385 individuals (62.5% female, mean age 32.5 +/- 12.9) mainly living in Central Italy (52.4%). Data were stratified according to phase II/III and three groups of Italian regions (northern, central and southern). Compared to the larger COMET study, most participants exhibited significant higher levels of severe-to-extremely-severe depressive symptoms (46.3% vs. 12.4%; p < 0.01) and extremely severe anxiety symptoms (77.8% vs. 7.5%; p < 0.01). We also observed a rise in problematic internet use and excessive gaming over time. Mediation analyses revealed that COVID-19-related general psychopathology, stress, anxiety, depression and social isolation play a significant role in the emergence of problematic internet use, social media addiction and problematic video gaming. Professional gamers and younger subjects emerged as sub-populations particularly at risk of developing digital addictions. If confirmed in larger and more homogenous samples, our findings may help in shedding light on possible preventive and treatment strategies for digital addictions
The effect of social media and infodemic on mental health during the COVID-19 pandemic: results from the COMET multicentric trial
On January 30, 2020, the World Health Organization (WHO) declared the status of pandemic due to the COVID-19 infection. The initial phases of the pandemic were characterized by uncertainty and public fears. In order to cope with such unexpected conditions, people adopted different coping strategies, including search for information, accessing Internet, and using social media. The present study based on the COMET collaborative research network aims to: (1) assess use of Internet and of social media among the Italian general population; (2) explore differences in web usage between people with pre-existing mental disorders and the general population; (3) identify changes over time in social media usage along the phase 1 of the pandemic; (4) identify the clinical, socio-demographic and contextual predictors of excessive use of social media. A significant increase in time spent on Internet, with an average time of 4.8  ±  0.02 h per day, was found in the global sample of 20,720 participants. Compared with the general population, Internet use was significantly higher in people with pre-existing mental disorders (5.2  ± 0.1 h vs. 4.9  ±  0.02; p < 0.005). According to the multivariate logistic regression model, the risk of excessive use of social media and Internet was significantly higher in people with moderate levels of depressive symptoms (OR: 1.26, CI 95%: 0.99 to 1.59, p < 0.0.005); while protective factors were being students (OR: 0.72, CI 95%: 0.53 to 0.96, p < 0.0029) and living in central Italy (OR: 0.46, CI 95%: 0.23 to 0.90, p < 0.002). The evaluation of social media and Internet use by the general population represents a first step for developing specific protective and supportive interventions for the general population, including practical suggestions on how to safely use Internet and social media
- …