884 research outputs found

    MegDet: A Large Mini-Batch Object Detector

    Full text link
    The improvements in recent CNN-based object detection works, from R-CNN [11], Fast/Faster R-CNN [10, 31] to recent Mask R-CNN [14] and RetinaNet [24], mainly come from new network, new framework, or novel loss design. But mini-batch size, a key factor in the training, has not been well studied. In this paper, we propose a Large MiniBatch Object Detector (MegDet) to enable the training with much larger mini-batch size than before (e.g. from 16 to 256), so that we can effectively utilize multiple GPUs (up to 128 in our experiments) to significantly shorten the training time. Technically, we suggest a learning rate policy and Cross-GPU Batch Normalization, which together allow us to successfully train a large mini-batch detector in much less time (e.g., from 33 hours to 4 hours), and achieve even better accuracy. The MegDet is the backbone of our submission (mmAP 52.5%) to COCO 2017 Challenge, where we won the 1st place of Detection task

    Well-arrayed ZnO nanostructures formed by multi-annealing processes at low temperature

    Get PDF
    A novel process of multi-annealing was proposed for forming well-arrayed ZnO nanostructures on as-deposited ZnO thin films that were prepared on quartz glass using a radio frequency (rf) magnetron sputtering at low temperature. It was found that the formation and morphology of ZnO nanostructures were strongly dependent on the reducing gas annealing processes. Oxygen ambient annealing between two reducing annealing processes had the effect of introducing more oxygen into the ZnO thin film, as well as improving the crystallinity of the ZnO nanostructures. An intense photoluminescence peak centered at 504 nm was observed in the well-arrayed ZnO nanostructures, due to the large amount of oxygen vacancies which existed on the larger surface area of ZnO nanostructures formed after the multi-annealing processes at a low temperature of 430 ℃. These results show that multi-annealing processes are very effective in forming well-arrayed and controllable ZnO nanostructures

    Anisotropic permeability in deterministic lateral displacement arrays

    Get PDF
    We uncover anisotropic permeability in microfluidic deterministic lateral displacement (DLD) arrays. A DLD array can achieve high-resolution bimodal size-based separation of microparticles, including bioparticles, such as cells. For an application with a given separation size, correct device operation requires that the flow remains at a fixed angle to the obstacle array. We demonstrate via experiments and lattice-Boltzmann simulations that subtle array design features cause anisotropic permeability. Anisotropic permeability indicates the microfluidic array's intrinsic tendency to induce an undesired lateral pressure gradient. This can cause an inclined flow and therefore local changes in the critical separation size. Thus, particle trajectories can become unpredictable and the device useless for the desired separation task. Anisotropy becomes severe for arrays with unequal axial and lateral gaps between obstacle posts and highly asymmetric post shapes. Furthermore, of the two equivalent array layouts employed with the DLD, the rotated-square layout does not display intrinsic anisotropy. We therefore recommend this layout over the easier-to-implement parallelogram layout. We provide additional guidelines for avoiding adverse effects of anisotropy on the DLD.Comment: 13 pages, 10 figures, 1 table, DLD, particle separation, microfluidics, anisotropic permeabilit

    MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception

    Full text link
    This paper proposes an efficient multi-camera to Bird's-Eye-View (BEV) view transformation method for 3D perception, dubbed MatrixVT. Existing view transformers either suffer from poor transformation efficiency or rely on device-specific operators, hindering the broad application of BEV models. In contrast, our method generates BEV features efficiently with only convolutions and matrix multiplications (MatMul). Specifically, we propose describing the BEV feature as the MatMul of image feature and a sparse Feature Transporting Matrix (FTM). A Prime Extraction module is then introduced to compress the dimension of image features and reduce FTM's sparsity. Moreover, we propose the Ring \& Ray Decomposition to replace the FTM with two matrices and reformulate our pipeline to reduce calculation further. Compared to existing methods, MatrixVT enjoys a faster speed and less memory footprint while remaining deploy-friendly. Extensive experiments on the nuScenes benchmark demonstrate that our method is highly efficient but obtains results on par with the SOTA method in object detection and map segmentation task
    corecore