3 research outputs found

    Hierarchical Load Balancing for Parallel Fast Legendre Transforms

    No full text
    We present a parallel Fast Legendre Transform (FLT) based on the Driscol--Healy algorithm with computation complexity O(N log² N ). The parallel FLT is load-- balanced in a hierarchical fashion. We use a load--balanced FFT to deduce a load-- balanced parallel fast cosine transform, which in turn serves as a building block for the Legendre transform engine, from which the parallel FLT is constructed. We demonstrate how the arithmetic, memory and communication complexities of the parallel FLT are hierarchically derived via the complexity of its modular blocks
    corecore