Search CORE

77 research outputs found

What Can We Learn Privately?

Author: Adam Smith
K. Lee
Kasiviswanathan Homin
Kobbi Nissim
Shiva Prasad
Sofya Raskhodnikova
Publication venue
Publication date: 01/01/2010
Field of study

Learning problems form an important category of computational tasks that generalizes many of the computations researchers apply to large real-life data sets. We ask: what concept classes can be learned privately, namely, by an algorithm whose output does not depend too heavily on any one input or specific training example? More precisely, we investigate learning algorithms that satisfy differential privacy, a notion that provides strong confidentiality guarantees in contexts where aggregate information is released about a database containing sensitive information about individuals. We demonstrate that, ignoring computational constraints, it is possible to privately agnostically learn any concept class using a sample size approximately logarithmic in the cardinality of the concept class. Therefore, almost anything learnable is learnable privately: specifically, if a concept class is learnable by a (non-private) algorithm with polynomial sample complexity and output size, then it can be learned privately using a polynomial number of samples. We also present a computationally efficient private PAC learner for the class of parity functions. Local (or randomized response) algorithms are a practical class of private algorithms that have received extensive investigation. We provide a precise characterization of local private learning algorithms. We show that a concept class is learnable by a local algorithm if and only if it is learnable in the statistical query (SQ) model. Finally, we present a separation between the power of interactive and noninteractive local learning algorithms.Comment: 35 pages, 2 figure

arXiv.org e-Print Archive

CiteSeerX

Recommended from our members

On the Learnability of Monotone Functions

Author: Lee Homin K.
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2009
Field of study

A longstanding lacuna in the field of computational learning theory is the learnability of succinctly representable monotone Boolean functions, i.e., functions that preserve the given order of the input. This thesis makes significant progress towards understanding both the possibilities and the limitations of learning various classes of monotone functions by carefully considering the complexity measures used to evaluate them. We show that Boolean functions computed by polynomial-size monotone circuits are hard to learn assuming the existence of one-way functions. Having shown the hardness of learning general polynomial-size monotone circuits, we show that the class of Boolean functions computed by polynomial-size depth-3 monotone circuits are hard to learn using statistical queries. As a counterpoint, we give a statistical query learning algorithm that can learn random polynomial-size depth-2 monotone circuits (i.e., monotone DNF formulas). As a preliminary step towards a fully polynomial-time, proper learning algorithm for learning polynomial-size monotone decision trees, we also show the relationship between the average depth of a monotone decision tree, its average sensitivity, and its variance. Finally, we return to monotone DNF formulas, and we show that they are teachable (a different model of learning) in the average case. We also show that non-monotone DNF formulas, juntas, and sparse GF2 formulas are teachable in the average case

Columbia University Academic Commons

Recommended from our members

ErmineJ: Tool for functional analysis of gene expression data sets

Author: Braynen William
Keshav Kiran
Lee Homin K
Pavlidis Paul
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: It is common for the results of a microarray study to be analyzed in the context of biologically-motivated groups of genes such as pathways or Gene Ontology categories. The most common method for such analysis uses the hypergeometric distribution (or a related technique) to look for "over-representation" of groups among genes selected as being differentially expressed or otherwise of interest based on a gene-by-gene analysis. However, this method suffers from some limitations, and biologist-friendly tools that implement alternatives have not been reported. RESULTS: We introduce ErmineJ, a multiplatform user-friendly stand-alone software tool for the analysis of functionally-relevant sets of genes in the context of microarray gene expression data. ErmineJ implements multiple algorithms for gene set analysis, including over-representation and resampling-based methods that focus on gene scores or correlation of gene expression profiles. In addition to a graphical user interface, ErmineJ has a command line interface and an application programming interface that can be used to automate analyses. The graphical user interface includes tools for creating and modifying gene sets, visualizing the Gene Ontology as a table or tree, and visualizing gene expression data. ErmineJ comes with a complete user manual, and is open-source software licensed under the Gnu Public License. CONCLUSION: The availability of multiple analysis algorithms, together with a rich feature set and simple graphical interface, should make ErmineJ a useful addition to the biologist's informatics toolbox. ErmineJ is available from

Columbia University Academic Commons

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Arizona

Design of PC beam-column joint applied X-braced bars in the segmented structural system

Author: Homin Chun
Kappyo Hong
Seongsoo Lee
Sijun Kim
Publication venue: 'Vilnius Gediminas Technical University'
Publication date: 01/05/2016
Field of study

This study suggests a joint design using an X-brace bar to identify the stability and structural performance of a precast concrete (PC) beam-column joint design, which may cause problems when used in a segmented PC beam system for a long-span structure. For this, an experimental PC beam-column model at half scale was designed and verified for applicability of X-braced bars in a panel zone. While previous studies suggested the development of a long-span structural system using precast concrete (PC) and described the problems with PC beam-column joints, this study proposes a solution to improve the structural stability and performance of a PC beam-column joint design and conducts analytical verification

Directory of Open Access Journals

VGTU Journals (Vilnius Gediminas Technical University - Vilnius Tech)

Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training

Author: Anzaku Esla Timothy
Boga Beril
De Neve Wesley
Lee Hyun Jung
Ozbulak Utku
Park Homin
Van Messem Arnout
Vankerschaver Joris
Publication venue
Publication date: 01/05/2023
Field of study

peer reviewedAlthough supervised learning has been highly successful in improving the state-of-the-art in the domain of image-based computer vision in the past, the margin of improvement has diminished significantly in recent years, indicating that a plateau is in sight. Meanwhile, the use of self-supervised learning (SSL) for the purpose of natural language processing (NLP) has seen tremendous successes during the past couple of years, with this new learning paradigm yielding powerful language models. Inspired by the excellent results obtained in the field of NLP, self-supervised methods that rely on clustering, contrastive learning, distillation, and information-maximization, which all fall under the banner of discriminative SSL, have experienced a swift uptake in the area of computer vision. Shortly afterwards, generative SSL frameworks that are mostly based on masked image modeling, complemented and surpassed the results obtained with discriminative SSL. Consequently, within a span of three years, over 100 unique general-purpose frameworks for generative and discriminative SSL, with a focus on imaging, were proposed. In this survey, we review a plethora of research efforts conducted on image-oriented SSL, providing a historic view and paying attention to best practices as well as useful software packages. While doing so, we discuss pretext tasks for image-based SSL, as well as techniques that are commonly used in image-based SSL. Lastly, to aid researchers who aim at contributing to image-focused SSL, we outline a number of promising research directions

Open Repository and Bibliography - Liège

Bioluminescence-Activated Deep-Tissue Photodynamic Therapy of Cancer

Author: Gou Young Koh
Homin Kim
Jin Woo Choi
Sang-Hee Lee
Sei Kwang Hahn
Seok Hyun Yun
Seonghoon Kim
Sung Yong Choi
Yi Rang Kim
한세광
Publication venue: 'Ivyspring International Publisher'
Publication date: 01/04/2015
Field of study

ope

CiteSeerX

Harvard University - DASH

PubMed Central

포항공과대학교