884 research outputs found
MegDet: A Large Mini-Batch Object Detector
The improvements in recent CNN-based object detection works, from R-CNN [11],
Fast/Faster R-CNN [10, 31] to recent Mask R-CNN [14] and RetinaNet [24], mainly
come from new network, new framework, or novel loss design. But mini-batch
size, a key factor in the training, has not been well studied. In this paper,
we propose a Large MiniBatch Object Detector (MegDet) to enable the training
with much larger mini-batch size than before (e.g. from 16 to 256), so that we
can effectively utilize multiple GPUs (up to 128 in our experiments) to
significantly shorten the training time. Technically, we suggest a learning
rate policy and Cross-GPU Batch Normalization, which together allow us to
successfully train a large mini-batch detector in much less time (e.g., from 33
hours to 4 hours), and achieve even better accuracy. The MegDet is the backbone
of our submission (mmAP 52.5%) to COCO 2017 Challenge, where we won the 1st
place of Detection task
Well-arrayed ZnO nanostructures formed by multi-annealing processes at low temperature
A novel process of multi-annealing was proposed for forming well-arrayed ZnO nanostructures on as-deposited ZnO thin films that were prepared on quartz glass using a radio frequency (rf) magnetron sputtering at low temperature. It was found that the formation and morphology of ZnO nanostructures were strongly dependent on the reducing gas annealing processes. Oxygen ambient annealing between two reducing annealing processes had the effect of introducing more oxygen into the ZnO thin film, as well as improving the crystallinity of the ZnO nanostructures. An intense photoluminescence peak centered at 504 nm was observed in the well-arrayed ZnO nanostructures, due to the large amount of oxygen vacancies which existed on the larger surface area of ZnO nanostructures formed after the multi-annealing processes at a low temperature of 430 ℃. These results show that multi-annealing processes are very effective in forming well-arrayed and controllable ZnO nanostructures
Cost-effective fiber multiplexing system based on low coherence interferometers and application to temperature measurement
Anisotropic permeability in deterministic lateral displacement arrays
We uncover anisotropic permeability in microfluidic deterministic lateral
displacement (DLD) arrays. A DLD array can achieve high-resolution bimodal
size-based separation of microparticles, including bioparticles, such as cells.
For an application with a given separation size, correct device operation
requires that the flow remains at a fixed angle to the obstacle array. We
demonstrate via experiments and lattice-Boltzmann simulations that subtle array
design features cause anisotropic permeability. Anisotropic permeability
indicates the microfluidic array's intrinsic tendency to induce an undesired
lateral pressure gradient. This can cause an inclined flow and therefore local
changes in the critical separation size. Thus, particle trajectories can become
unpredictable and the device useless for the desired separation task.
Anisotropy becomes severe for arrays with unequal axial and lateral gaps
between obstacle posts and highly asymmetric post shapes. Furthermore, of the
two equivalent array layouts employed with the DLD, the rotated-square layout
does not display intrinsic anisotropy. We therefore recommend this layout over
the easier-to-implement parallelogram layout. We provide additional guidelines
for avoiding adverse effects of anisotropy on the DLD.Comment: 13 pages, 10 figures, 1 table, DLD, particle separation,
microfluidics, anisotropic permeabilit
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception
This paper proposes an efficient multi-camera to Bird's-Eye-View (BEV) view
transformation method for 3D perception, dubbed MatrixVT. Existing view
transformers either suffer from poor transformation efficiency or rely on
device-specific operators, hindering the broad application of BEV models. In
contrast, our method generates BEV features efficiently with only convolutions
and matrix multiplications (MatMul). Specifically, we propose describing the
BEV feature as the MatMul of image feature and a sparse Feature Transporting
Matrix (FTM). A Prime Extraction module is then introduced to compress the
dimension of image features and reduce FTM's sparsity. Moreover, we propose the
Ring \& Ray Decomposition to replace the FTM with two matrices and reformulate
our pipeline to reduce calculation further. Compared to existing methods,
MatrixVT enjoys a faster speed and less memory footprint while remaining
deploy-friendly. Extensive experiments on the nuScenes benchmark demonstrate
that our method is highly efficient but obtains results on par with the SOTA
method in object detection and map segmentation task
- …
