534 research outputs found

    Testing probability distributions underlying aggregated data

    Full text link
    In this paper, we analyze and study a hybrid model for testing and learning probability distributions. Here, in addition to samples, the testing algorithm is provided with one of two different types of oracles to the unknown distribution DD over [n][n]. More precisely, we define both the dual and cumulative dual access models, in which the algorithm AA can both sample from DD and respectively, for any i[n]i\in[n], - query the probability mass D(i)D(i) (query access); or - get the total mass of {1,,i}\{1,\dots,i\}, i.e. j=1iD(j)\sum_{j=1}^i D(j) (cumulative access) These two models, by generalizing the previously studied sampling and query oracle models, allow us to bypass the strong lower bounds established for a number of problems in these settings, while capturing several interesting aspects of these problems -- and providing new insight on the limitations of the models. Finally, we show that while the testing algorithms can be in most cases strictly more efficient, some tasks remain hard even with this additional power

    A Hypergraph Dictatorship Test with Perfect Completeness

    Full text link
    A hypergraph dictatorship test is first introduced by Samorodnitsky and Trevisan and serves as a key component in their unique games based \PCP construction. Such a test has oracle access to a collection of functions and determines whether all the functions are the same dictatorship, or all their low degree influences are o(1).o(1). Their test makes q3q\geq3 queries and has amortized query complexity 1+O(logqq)1+O(\frac{\log q}{q}) but has an inherent loss of perfect completeness. In this paper we give an adaptive hypergraph dictatorship test that achieves both perfect completeness and amortized query complexity 1+O(logqq)1+O(\frac{\log q}{q}).Comment: Some minor correction

    Cube Testers and Key Recovery Attacks On Reduced-Round MD6 and Trivium

    Get PDF
    CRYPTO 2008 saw the introduction of the hash function MD6 and of cube attacks, a type of algebraic attack applicable to cryptographic functions having a low-degree algebraic normal form over GF(2). This paper applies cube attacks to reduced round MD6, finding the full 128-bit key of a 14-round MD6 with complexity 2^22 (which takes less than a minute on a single PC). This is the best key recovery attack announced so far for MD6. We then introduce a new class of attacks called cube testers, based on efficient property-testing algorithms, and apply them to MD6 and to the stream cipher Trivium. Unlike the standard cube attacks, cube testers detect nonrandom behavior rather than performing key extraction, but they can also attack cryptographic schemes described by nonrandom polynomials of relatively high degree. Applied to MD6, cube testers detect nonrandomness over 18 rounds in 2^17 complexity; applied to a slightly modified version of the MD6 compression function, they can distinguish 66 rounds from random in 2^24 complexity. Cube testers give distinguishers on Trivium reduced to 790 rounds from random with 2^30 complexity and detect nonrandomness over 885 rounds in 2^27, improving on the original 767-round cube attack

    Cadherin–catenin expression in primary colorectal cancer: a survival analysis

    Get PDF
    Both cell adhesion and cell signalling events are mediated by components of the cadherin-catenin complex. Loss of expression of the components of this complex have been shown to correlate with invasive behaviour in many tumour types although their exact role in colorectal cancer remains unclear. Immunohistochemical analysis of the expression of components of the cadherin-catenin complex in colorectal cancers from 60 patients was undertaken. Loss of memberanous expression of E-cadherin, alpha-catenin and beta-catenin was demonstrated in 52%, 85% and 40% of tumours respectively. Focal nuclear expression of beta-catenin ( 75% of tumour cells per section) was seen in 11 (18%) tumours. Loss of membranous alpha-catenin expression significantly correlated with tumour de-differentiation (P = 0.009). There was a trend towards an association between advanced tumour stage and loss of membranous expression of alpha-catenin or beta-catenin, although these associations were not statistically significant. Univariate analysis revealed that advanced Dukes' stage, tumour de-differentiation, loss of membranous beta-catenin expression, cytoplasmic beta-catenin expression and widespread nuclear expression of beta-catenin all correlated with short survival following apparently curative resection of the primary tumour. However, only Dukes' stage (P = 0.002), tumour grade (P = 0.02) and widespread nuclear expression of beta-catenin (P = 0.002) were independent predictors of short survival. Disturbed growth signalling events in colorectal tumours are thought to result in nuclear accumulation of beta-catenin. Consequently, tumours with widespread nuclear expression of beta-catenin are likely to have severely abnormal growth characteristics, and which therefore might be predictive of short survival in these patients

    Sublinear time algorithms for earth mover's distance

    Get PDF
    We study the problem of estimating the Earth Mover’s Distance (EMD) between probability distributions when given access only to samples of the distributions. We give closeness testers and additive-error estimators over domains in [0, 1][superscript d], with sample complexities independent of domain size – permitting the testability even of continuous distributions over infinite domains. Instead, our algorithms depend on other parameters, such as the diameter of the domain space, which may be significantly smaller. We also prove lower bounds showing the dependencies on these parameters to be essentially optimal. Additionally, we consider whether natural classes of distributions exist for which there are algorithms with better dependence on the dimension, and show that for highly clusterable data, this is indeed the case. Lastly, we consider a variant of the EMD, defined over tree metrics instead of the usual l 1 metric, and give tight upper and lower bounds

    Mutations of the β- and γ-catenin genes are uncommon in human lung, breast, kidney, cervical and ovarian carcinomas

    Get PDF
    β-catenin forms complexes with Tcf and Lef-1 and functions as a transcriptional activator in the Wnt signalling pathway. Although recent investigations have been focused on the role of the adenomatous polyposis coli (APC)/ β-catenin/Tcf pathway in human tumorigenesis, there have been very few reports on mutations of the β-catenin gene in a variety of tumour types. Using PCR and single-strand conformational polymorphism analysis, we examined 93 lung, 9 breast, 6 kidney, 19 cervical and 7 ovarian carcinoma cell lines for mutations in exon 3 of the β-catenin gene. In addition, we tested these same samples for mutations in the NH2-terminal regulatory region of the γ-catenin gene. Mutational analysis for the entire coding region of β-catenin cDNA was also undertaken in 20 lung, 9 breast, 5 kidney and 6 cervical carcinoma cell lines. Deletion of most β-catenin coding exons was confirmed in line NCI-H28 (lung mesothelioma) and a silent mutation at codon 214 in exon 5 was found in HeLa (cervical adenocarcinoma). A missense mutation at codon 19 and a silent mutation at codon 28 in the NH2-terminal regulatory region of the γ-catenin gene were found in H1726 (squamous cell lung carcinoma) and H1048 (small cell lung carcinoma), respectively. Neither deletions nor mutations of these genes were detected in the other cell lines examined. These results suggest that β- and γ-catenins are infrequent mutational targets during development of human lung, breast, kidney, cervical and ovarian carcinomas. © 2001 Cancer Research Campaign http://www.bjcancer.co

    Sublinear-Time Language Recognition and Decision by One-Dimensional Cellular Automata

    Full text link
    After an apparent hiatus of roughly 30 years, we revisit a seemingly neglected subject in the theory of (one-dimensional) cellular automata: sublinear-time computation. The model considered is that of ACAs, which are language acceptors whose acceptance condition depends on the states of all cells in the automaton. We prove a time hierarchy theorem for sublinear-time ACA classes, analyze their intersection with the regular languages, and, finally, establish strict inclusions in the parallel computation classes SC\mathsf{SC} and (uniform) AC\mathsf{AC}. As an addendum, we introduce and investigate the concept of a decider ACA (DACA) as a candidate for a decider counterpart to (acceptor) ACAs. We show the class of languages decidable in constant time by DACAs equals the locally testable languages, and we also determine Ω(n)\Omega(\sqrt{n}) as the (tight) time complexity threshold for DACAs up to which no advantage compared to constant time is possible.Comment: 16 pages, 2 figures, to appear at DLT 202

    A study of patent thickets

    Get PDF
    Report analysing whether entry of UK enterprises into patenting in a technology area is affected by patent thickets in the technology area
    corecore