5 research outputs found

    Skew Handling Techniques in Sort-Merge Join

    No full text
    Joins are among the most frequently executed operations. Several fast join algorithms have been developed and extensively studied; these can be categorized as sort-merge, hash-based, and index-based algorithms. While all three types of algorithms exhibit excellent performance over most data, ameliorating the performance degradation in the presence of skew has been investigated only for hash-based algorithms. However, for sort-merge join, even a small amount of skew present in realistic data can result in a significant performance hit on a commercial DBMS. This paper examines the negative ramifications of skew in sort-merge join and proposes several refinements that deal effectively with data skew. Experiments show that some of these algorithms also impose virtually no penalty in the absence of data skew and are thus suitable for replacing existing sort-merge implementations. We also show how sortmerge band join performance is significantly enhanced with these refinements

    Skew handling techniques in sort-merge join

    Full text link

    Oracle Corporation

    No full text
    Skew Handling Techniques in Sort-Merge Join Joins are among the most frequently executed operations. Sev-eral fast join algorithms have been developed and extensively studied; these can be categorized as sort-merge, hash-based, and index-based algorithms. While all three types of algo-rithms exhibit excellent performance over most data, amelio-rating the performance degradation in the presence of skew has been investigated only for hash-based algorithms. How-ever, for sort-merge join, even a small amount of skew present in realistic data can result in a significant performance hit on a commercial DBMS. This paper examines the negative ramifi-cations of skew in sort-merge join and proposes several refine-ments that deal effectively with data skew. Experiments show that some of these algorithms also impose virtually no penalty in the absence of data skew and are thus suitable for replacing existing sort-merge implementations. We also show how sort-merge band join performance is significantly enhanced with these refinements. 1
    corecore