68 research outputs found

    Research and Realization of Integrated Environment for Data Warehouse

    Get PDF
    引入开放式设计思想,使得数据仓库集成环境具有很强的适应性,该集成环境架构在.NET平台上,采用组件开发技术,使系统具有良好的可靠性、可扩展性和安全性。The design of EDWIE(Integrated Environment for Enterprise Data Warehouse) adopts the open thought and the software is based on the.NET software architecture,so the flexibility,reliability,expansibility and security of this system are favorable.国家自然科学基金资助项目(50474033);; 福建省自然科学基金资助项目(A0310008);; 福建省高新技术研究开放计划重点项目(2003H043

    Selection strategy of materialized views in data warehouse

    Get PDF
    为了提高决策支持和OLAP查询的响应效率,数据仓库多采用物化视图的思想。因此,物化视图的选择策略是数据仓库研究的重要问题之一。其目标是选出一组存储、维护代价与查询代价的总和为最小的物化视图。提出一个以MVPP(mul-ti-view processing plan)为视图选择的搜索空间的物化视图选择新算法——VSMF(views selection base on multi-factor)算法。该算法在存储空间约束下同时实现多查询最优化和视图维护最优化。A set of materialized views are stored in the data warehouse for the purpose of efficiently implementing decision-support or OLAP queries.The selection of materialized views is one of the most important issues in the data warehouse development.The goal is to select an appropriate set of views so that the total cost of storage,maintenance and query is minimized.A new algorithm named VSMF(views selection base on multi-factor) algorithm using multi-view processing plan structure as search space is proposed,which solve the problem considering both multi-query optimization and the maintenance process optimization under the storage space constrain.福建省自然科学基金项目(A0310008);; 福建省重点科技基金项目(2003H043

    开放式台湾海峡GIS设计方案

    Get PDF
    福建省重点科技资助项目!第98 Z 179

    Web Information Extraction Based on Tree Structure

    Get PDF
    提出了一种基于树形结构的WEb结构化数据抽取算法.该算法基于HTMl的树形层次结构,包括HTMl树构造算法,数据区域挖掘算法,数据记录挖掘算法以及数据记录模式生成算法.算法引入了页面元素布局位置等信息用于清洗页面,采用层次划分思想实现页面数据区域的挖掘,并通过树匹配生成记录模式,实现最终数据项抽取.实验表明,该方法可以有效地实现WEb结构化数据抽取.It proposes tree structure based Web data extraction algorithm in view of the inadequacies of the existing methods.The tree structure based algorithm includes: the algorithm of HTML tree construction,the algorithm of data region mining,the algorithm of data record mining,and the algorithm of record schema generation.The algorithm cleans the Web pages using the position information of page elements,mines data region by hierarchical clustering, and generates record schema finishing data item extraction through tree matching.Experimental results show that our algorithm can improve the accuracy and efficiency of Web data extraction.国家自然科学基金资助项目(50474033);福建省自然科学基金资助项目(A0310008);福建省重点科技项目(2003H043

    构建数字海洋系统的模式设计

    Get PDF
    福建省重点科技资助项目!第98 Z 179

    Concurrency control algorithm of dynamic adjustment of serialization order for distributed database based on transactions

    Get PDF
    针对分布式数据库中发生待处理的事务流提出一种动态可串行调度算法。通过执行此算法,能够高效地处理源源不断的事务流,使之尽可能串行调度地执行,从而提高并发度,以及分布式数据库各站点的并行处理度。This paper presents an algorithm of dynamic adjustment of serialization order for distributed database on real-time transactions.Running this algorithm can process continual real-time transactions and carry out with an adjustment of serialization order,so the concurrency degree can be improved simultaneously,and the parallel degree of every site of distributed database can be raised too

    Algorithm for mining calendar-based temporal association rules

    Get PDF
    以日历格作为框架来研究时序关联规则,提出了一个有效的挖掘算法。在用户指定的日历模式下,首先通过一次扫描产生所有的频繁2项集及相应的1*日历模式,在此基础上产生k*日历模式,并利用聚集性质产生候选K项集及相应的日历模式,最后扫描事务数据库产生所有的频繁项集及其日历模式。实验证明,该算法具有较好的性能。An efficient algorithm for temporal association rules based on calendar patterns was presented.A user-given calendar schema was adopted to specify the interesting rime intervals as calendar patterns.Then database was scanned once to find all frequent 2-itemsets and their 1-star calendar patterns.Aggregation property and Apriori property were utilized to find all candidate patterns.Finally,calendar-based temporal association rules were obtained through scanning.The experimental results indicate that this proposed algorithm is feasible and efficient.福建省自然科学基金资助项目(A0310008);; 福建省高新技术研究开放计划重点资助项目(2003H043

    应用ODS 技术解决电子政务系统数据一致性问题

    Get PDF
    讨论了ODS 技术在电子政务系统中的应用. 将ODS 引入到电子政务系统中形成更为合理的DB2ODS2DW3 层结构,并通过ODS 记录系统和参考表的使用进行全局联机事务处理,使各业务数据库内容可以实时更新,保持数 据的一致性. 从根本上解决密切相关的业务数据库数据不一致的问题

    Research on Grid Database Materialized Query Caching Policy

    Get PDF
    提出了基于XML Database的网格数据库物化查询缓存机制,提高用户查询的速度,均衡网格负载。定义了网格数据库服务质量与数据质量的标准,提出了物化查询选择算法MQS,为用户提供更好的数据服务。In this paper,grid database materialized query caching policy based on XML Database is presented,which will improve the response time of queries and balance the load of the grid.QoS and QoD of grid database are defined to provide parameters.The MQS algorithm acts as cached materialized query selector to provide better data service.福建省自然基金资助项目(A0310008);; 福建省高新技术研究开放计划重点资助项目(2003H043

    Method of incrementing updating FreeCube

    Get PDF
    数据立方体是联机分析处理的一个重要应用。如何对数据立方体(CUBE)进行更新目前研究相对较少。给出了CUBE的一种新颖的存储结构——FreeCube的定义,该结构大大降低了CUBE存储的空间,分析了它的相关性质,提出了增量更新FreeCube的理论,并给出了具体的算法,以实例说明了该算法的正确性,总结了下一步的工作方向。Data cube is an important application in OLAP system. The study on how to update data cube lags behind at present. A kind of the novel structure of CUBE— — Free is described that can reduce the size of CUBE quickly. And then the characteristics of the con- strictive freecube is analyzed. The theory of incrementing updating freecube is proposed, and algorithm for it and then a instance to indicate the correctness of the algorithm is provided. At last, the directions in the future is summarized.福建省自然科学基金项目(A0310008);; 福建省高新技术研究开放计划重点基金项目(2003H043
    corecore