281 research outputs found

    Fair Near Neighbor Search: Independent Range Sampling in High Dimensions. PODS

    Get PDF
    Similarity search is a fundamental algorithmic primitive, widely used in many computer science disciplines. There are several variants of the similarity search problem, and one of the most relevant is the rr-near neighbor (rr-NN) problem: given a radius r>0r>0 and a set of points SS, construct a data structure that, for any given query point qq, returns a point pp within distance at most rr from qq. In this paper, we study the rr-NN problem in the light of fairness. We consider fairness in the sense of equal opportunity: all points that are within distance rr from the query should have the same probability to be returned. In the low-dimensional case, this problem was first studied by Hu, Qiao, and Tao (PODS 2014). Locality sensitive hashing (LSH), the theoretically strongest approach to similarity search in high dimensions, does not provide such a fairness guarantee. To address this, we propose efficient data structures for rr-NN where all points in SS that are near qq have the same probability to be selected and returned by the query. Specifically, we first propose a black-box approach that, given any LSH scheme, constructs a data structure for uniformly sampling points in the neighborhood of a query. Then, we develop a data structure for fair similarity search under inner product that requires nearly-linear space and exploits locality sensitive filters. The paper concludes with an experimental evaluation that highlights (un)fairness in a recommendation setting on real-world datasets and discusses the inherent unfairness introduced by solving other variants of the problem.Comment: Proceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems (PODS), Pages 191-204, June 202

    Predictive models for Alzheimer's disease diagnosis and MCI identification: The use of cognitive scores and artificial intelligence algorithms

    Get PDF
    The paper presents a comprehensive study on predictive models for Alzheimer's disease (AD) and mild cognitive impairment (MCI) diagnosis, implementing a combination of cognitive scores and artificial intelligence algorithms. The research includes detailed analyses of clinical and demographic variables such as age, education, and various cognitive and functional scores, investigating their distributions and correlations with AD and MCI. The study utilizes several machine-learning classifiers, comparing their performance through metrics like accuracy, precision, and area under the ROC curve (AUC). Key findings include the influence of gender on AD prevalence, the potential protective effect of education, and the significance of functional decline and cognitive performance scores in the models. The results demonstrate the effectiveness of ensemble methods and the robustness of the models across different data subsets, highlighting the potential of artificial intelligence in enhancing diagnostic accuracy for Alzheimer's disease and mild cognitive impairment

    Distance-Sensitive Hashing

    Get PDF
    Locality-sensitive hashing (LSH) is an important tool for managing high-dimensional noisy or uncertain data, for example in connection with data cleaning (similarity join) and noise-robust search (similarity search). However, for a number of problems the LSH framework is not known to yield good solutions, and instead ad hoc solutions have been designed for particular similarity and distance measures. For example, this is true for output-sensitive similarity search/join, and for indexes supporting annulus queries that aim to report a point close to a certain given distance from the query point. In this paper we initiate the study of distance-sensitive hashing (DSH), a generalization of LSH that seeks a family of hash functions such that the probability of two points having the same hash value is a given function of the distance between them. More precisely, given a distance space (X,dist)(X, \text{dist}) and a "collision probability function" (CPF) f ⁣:R[0,1]f\colon \mathbb{R}\rightarrow [0,1] we seek a distribution over pairs of functions (h,g)(h,g) such that for every pair of points x,yXx, y \in X the collision probability is Pr[h(x)=g(y)]=f(dist(x,y))\Pr[h(x)=g(y)] = f(\text{dist}(x,y)). Locality-sensitive hashing is the study of how fast a CPF can decrease as the distance grows. For many spaces, ff can be made exponentially decreasing even if we restrict attention to the symmetric case where g=hg=h. We show that the asymmetry achieved by having a pair of functions makes it possible to achieve CPFs that are, for example, increasing or unimodal, and show how this leads to principled solutions to problems not addressed by the LSH framework. This includes a novel application to privacy-preserving distance estimation. We believe that the DSH framework will find further applications in high-dimensional data management.Comment: Accepted at PODS'18. Abstract shortened due to character limi

    Markov Properties of Electrical Discharge Current Fluctuations in Plasma

    Full text link
    Using the Markovian method, we study the stochastic nature of electrical discharge current fluctuations in the Helium plasma. Sinusoidal trends are extracted from the data set by the Fourier-Detrended Fluctuation analysis and consequently cleaned data is retrieved. We determine the Markov time scale of the detrended data set by using likelihood analysis. We also estimate the Kramers-Moyal's coefficients of the discharge current fluctuations and derive the corresponding Fokker-Planck equation. In addition, the obtained Langevin equation enables us to reconstruct discharge time series with similar statistical properties compared with the observed in the experiment. We also provide an exact decomposition of temporal correlation function by using Kramers-Moyal's coefficients. We show that for the stationary time series, the two point temporal correlation function has an exponential decaying behavior with a characteristic correlation time scale. Our results confirm that, there is no definite relation between correlation and Markov time scales. However both of them behave as monotonic increasing function of discharge current intensity. Finally to complete our analysis, the multifractal behavior of reconstructed time series using its Keramers-Moyal's coefficients and original data set are investigated. Extended self similarity analysis demonstrates that fluctuations in our experimental setup deviates from Kolmogorov (K41) theory for fully developed turbulence regime.Comment: 25 pages, 9 figures and 4 tables. V3: Added comments, references, figures and major correction

    Divergent paths for the selection of immunodominant epitopes from distinct antigenic sources

    Get PDF
    Immunodominant epitopes are few selected epitopes from complex antigens that initiate T cell responses. Here, to provide further insights into this process, we use a reductionist cell-free antigen processing system composed of defined components. We use the system to characterize steps in antigen processing of pathogen-derived proteins or autoantigens and we find distinct paths for peptide processing and selection. Autoantigen-derived immunodominant epitopes are resistant to digestion by cathepsins, whereas pathogen-derived epitopes are sensitive. Sensitivity to cathepsins enforces capture of pathogen-derived epitopes by Major Histocompatibility Complex class II (MHC class II) prior to processing, and resistance to HLA-DM-mediated-dissociation preserves the longevity of those epitopes. We show that immunodominance is established by higher relative abundance of the selected epitopes, which survive cathepsin digestion either by binding to MHC class II and resisting DM-mediated-dissociation, or being chemically resistant to cathepsins degradation. Non-dominant epitopes are sensitive to both DM and cathepsins and are destroyed

    Model for the Peptide-Free Conformation of Class II MHC Proteins

    Get PDF
    Background: Major histocompatibility complex proteins are believed to undergo significant conformational changes concomitant with peptide binding, but structural characterization of these changes has remained elusive. Methodology/Principal Findings: Here we use molecular dynamics simulations and experimental probes of protein conformation to investigate the peptide-free state of class II MHC proteins. Upon computational removal of the bound peptide from HLA-DR1-peptide complex, the a50-59 region folded into the P1-P4 region of the peptide binding site, adopting the same conformation as a bound peptide. Strikingly, the structure of the hydrophobic P1 pocket is maintained by engagement of the side chain of Phe a54. In addition, conserved hydrogen bonds observed in crystal structures between the peptide backbone and numerous MHC side chains are maintained between the a51-55 region and the rest of the molecule. The model for the peptide-free conformation was evaluated using conformationally-sensitive antibody and superantigen probes predicted to show no change, moderate change, or dramatic changes in their interaction with peptide-free DR1 and peptide-loaded DR1. The binding observed for these probes is in agreement with the movements predicted by the model. Conclusion/Significance: This work presents a molecular model for peptide-free class II MHC proteins that can help to interpret the conformational changes known to occur within the protein during peptide binding and release, and ca

    Synaptic proximity enables NMDAR signalling to promote brain metastasis.

    Get PDF
    Metastasis-the disseminated growth of tumours in distant organs-underlies cancer mortality. Breast-to-brain metastasis (B2BM) is a common and disruptive form of cancer and is prevalent in the aggressive basal-like subtype, but is also found at varying frequencies in all cancer subtypes. Previous studies revealed parameters of breast cancer metastasis to the brain, but its preference for this site remains an enigma. Here we show that B2BM cells co-opt a neuronal signalling pathway that was recently implicated in invasive tumour growth, involving activation by glutamate ligands of N-methyl-D-aspartate receptors (NMDARs), which is key in model systems for metastatic colonization of the brain and is associated with poor prognosis. Whereas NMDAR activation is autocrine in some primary tumour types, human and mouse B2BM cells express receptors but secrete insufficient glutamate to induce signalling, which is instead achieved by the formation of pseudo-tripartite synapses between cancer cells and glutamatergic neurons, presenting a rationale for brain metastasis.This work was principally supported by grants from the Swiss National Science Foundation and the European Research Council, and by a gift from the Biltema Foundation that was administered by the ISREC Foundation, Lausanne, Switzerland

    Three dimensional structure directs T-cell epitope dominance associated with allergy

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>CD4+ T-cell epitope immunodominance is not adequately explained by peptide selectivity in class II major histocompatibility proteins, but it has been correlated with adjacent segments of conformational flexibility in several antigens.</p> <p>Methods</p> <p>The published T-cell responses to two venom allergens and two aeroallergens were used to construct profiles of epitope dominance, which were correlated with the distribution of conformational flexibility, as measured by crystallographic B factors, solvent-accessible surface, COREX residue stability, and sequence entropy.</p> <p>Results</p> <p>Epitopes associated with allergy tended to be excluded from and lie adjacent to flexible segments of the allergen.</p> <p>Conclusion</p> <p>During the initiation of allergy, the N- and/or C-terminal ends of proteolytic processing intermediates were preferentially loaded into antigen presenting proteins for the priming of CD4+ T cells.</p

    Burden of obesity in the Eastern Mediterranean Region: findings from the Global Burden of Disease 2015 study

    Get PDF
    Mokdad AH, El Bcheraoui C, Afshin A, et al. Burden of obesity in the Eastern Mediterranean Region: findings from the Global Burden of Disease 2015 study. INTERNATIONAL JOURNAL OF PUBLIC HEALTH. 2018;63(Suppl. 1):165-176.We used the Global Burden of Disease (GBD) 2015 study results to explore the burden of high body mass index (BMI) in the Eastern Mediterranean Region (EMR). We estimated the prevalence of overweight and obesity among children (2-19 years) and adults (20 years) in 1980 and 2015. The burden of disease related to high BMI was calculated using the GBD comparative risk assessment approach. The prevalence of obesity increased for adults from 15.1% (95% UI 13.4-16.9) in 1980 to 20.7% (95% UI 18.8-22.8) in 2015. It increased from 4.1% (95% UI 2.9-5.5) to 4.9% (95% UI 3.6-6.4) for the same period among children. In 2015, there were 417,115 deaths and 14,448,548 disability-adjusted life years (DALYs) attributable to high BMI in EMR, which constitute about 10 and 6.3% of total deaths and DALYs, respectively, for all ages. This is the first study to estimate trends in obesity burden for the EMR from 1980 to 2015. We call for EMR countries to invest more resources in prevention and health promotion efforts to reduce this burden
    corecore