9 research outputs found

    Performance of all the tasks on TVQA dataset by question type.

    No full text
    M1-M5 represent Two-stream, PAMN, Multi-task, STAGE, and MAF-HMS, respectively.</p

    S1 Dataset -

    No full text
    (ZIP)</p

    Faster R-CNN network.

    No full text
    Faster R-CNN network.</p

    The hybrid multi-head self-attention mechanism.

    No full text
    The hybrid multi-head self-attention mechanism.</p

    Analysis by required modality of MAF-HMS.

    No full text
    Analysis by required modality of MAF-HMS.</p

    The network architecture of MAF-HMS.

    No full text
    The network architecture of MAF-HMS.</p

    Performance comparison on MSVD-QA and MSRVTT-QA dataset.

    No full text
    M1-M5 represent Two-stream, PAMN, Multi-task, STAGE, and MAF-HMS, respectively.</p

    Ablation study on model variants of MAF-HMS on the validation set of TVQA.

    No full text
    Ablation study on model variants of MAF-HMS on the validation set of TVQA.</p

    Evaluation results on the TVQA dataset by TV show.

    No full text
    Evaluation results on the TVQA dataset by TV show.</p
    corecore