Search CORE

310 research outputs found

Learned Query Superoptimization

Author: Marcus Ryan
Publication venue
Publication date: 27/03/2023
Field of study

Traditional query optimizers are designed to be fast and stateless: each query is quickly optimized using approximate statistics, sent off to the execution engine, and promptly forgotten. Recent work on learned query optimization have shown that it is possible for a query optimizer to "learn from its mistakes," correcting erroneous query plans the next time a plan is produced. But what if query optimizers could avoid mistakes entirely? This paper presents the idea of learned query superoptimization. A new generation of query superoptimizers could autonomously experiment to discover optimal plans using exploration-driven algorithms, iterative Bayesian optimization, and program synthesis. While such superoptimizers will take significantly longer to optimize a given query, superoptimizers have the potential to massively accelerate a large number of important repetitive queries being executed on data systems today

arXiv.org e-Print Archive

Automatic Classification of Text Databases through Query Probing

Author: D. Hawking
D. Koller
D. Koller
J. P. Callan
J. Xu
L. Gravano
M. Perkowitz
S. Gauch
W. Meng
W. Meng
W. W. Cohen
W. W. Cohen
Publication venue
Publication date: 01/01/2000
Field of study

Many text databases on the web are "hidden" behind search interfaces, and their documents are only accessible through querying. Search engines typically ignore the contents of such search-only databases. Recently, Yahoo-like directories have started to manually organize these databases into categories that users can browse to find these valuable resources. We propose a novel strategy to automate the classification of search-only text databases. Our technique starts by training a rule-based document classifier, and then uses the classifier's rules to generate probing queries. The queries are sent to the text databases, which are then classified based on the number of matches that they produce for each query. We report some initial exploratory experiments that show that our approach is promising to automatically characterize the contents of text databases accessible on the web.Comment: 7 pages, 1 figur

arXiv.org e-Print Archive

CiteSeerX

Crossref

Columbia University Academic Commons

Image mining: issues, frameworks and techniques

Author: Hsu Wynne
Lee Mong Li
Zhang Ji
Publication venue: 'University of Alberta'
Publication date: 01/01/2001
Field of study

[Abstract]: Advances in image acquisition and storage technology have led to tremendous growth in significantly large and detailed image databases. These images, if analyzed, can reveal useful information to the human users. Image mining deals with the extraction of implicit knowledge, image data relationship, or other patterns not explicitly stored in the images. Image mining is more than just an extension of data mining to image domain. It is an interdisciplinary endeavor that draws upon expertise in computer vision, image processing, image retrieval, data mining, machine learning, database, and artificial intelligence. Despite the development of many applications and algorithms in the individual research fields cited above, research in image mining is still in its infancy. In this paper, we will examine the research issues in image mining, current developments in image mining, particularly, image mining frameworks, state-of-the-art techniques and systems. We will also identify some future research directions for image mining at the end of this paper

University of Southern Queensland ePrints

Maintenance Modification Algorithms and its Implementation on object oriented data warehouse

Author: Dr. Mrs Pushpa Suri
MRs. MEENAKSHI SHARMA
Publication venue: Global Journals Inc. (US)
Publication date: 23/05/2011
Field of study

A data warehouse (DW) is a database used for reporting Paper describes Modification Algorithm and implementation on Object Oriented Data Warehousing. A Data Warehouse collects information and data from source data bases to support analytical processing of decision support functions and acts as an information provider. In initial research data warehouses focused on relational data model. In this paper concept of object oriented data warehouse is introduced modification maintenance algorithms and its implementation to maintained consistency between data warehouse and source data base

Global Journal of Computer Science and Technology (GJCST)

A Three-phased Online Association Rule Mining Approach for Diverse Mining Requests

Author: Chu Yianshu
Hong Tzungpei
Tseng Shianshyong
Wang Chingyao
Publication venue: AIS Electronic Library (AISeL)
Publication date: 05/12/2004
Field of study

In the past, most incremental mining and online mining algorithms considered finding the set of association rules or patterns consistent with the entire set of data inserted so far. Users can not easily obtain the results from their only interested portion of data. For providing ad-hoc, query-driven and online mining supports, we first propose a relation called multidimensional pattern relation to structurally and systematically store the context information and the mining information for later analysis. Each tuple in the relation comes from an inserted dataset in the database. This concept is similar to the construction of a data warehouse for OLAP. However, unlike the summarized information of fact attributes in a data warehouse, the mined patterns in the multidimensional pattern relation can not be directly aggregated to satisfy users’ mining requests. We then develop an online mining approach called Three-phased Online Association Rule Mining (TOARM) based on the proposed multidimensional pattern relation to support online generation of association rules under multidimensional considerations. Experiments for both homogeneous and heterogeneous datasets are made, with results showing the effectiveness of the proposed approach

AIS Electronic Library (AISeL)

Image mining: trends and developments

Author: Hsu Wynne
Lee Mong Li
Zhang Ji
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

[Abstract]: Advances in image acquisition and storage technology have led to tremendous growth in very large and detailed image databases. These images, if analyzed, can reveal useful information to the human users. Image mining deals with the extraction of implicit knowledge, image data relationship, or other patterns not explicitly stored in the images. Image mining is more than just an extension of data mining to image domain. It is an interdisciplinary endeavor that draws upon expertise in computer vision, image processing, image retrieval, data mining, machine learning, database, and artificial intelligence. In this paper, we will examine the research issues in image mining, current developments in image mining, particularly, image mining frameworks, state-of-the-art techniques and systems. We will also identify some future research directions for image mining

University of Southern Queensland ePrints

Mining Indirect Association Rules (Evolutionary Advancement in Fundamental Theories of Computer Science)

Author: Hamano Shinichi
Mukouchi Yasuhito
Sato Masako
Publication venue: 京都大学数理解析研究所
Publication date: 01/05/2004
Field of study

Kyoto University Research Information Repository

FP-Growth Based New Normalization Technique for Subgraph Ranking

Author
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date
Field of study

Crossref