195 research outputs found

    Metric-aligned Sample Selection and Critical Feature Sampling for Oriented Object Detection

    Full text link
    Arbitrary-oriented object detection is a relatively emerging but challenging task. Although remarkable progress has been made, there still remain many unsolved issues due to the large diversity of patterns in orientation, scale, aspect ratio, and visual appearance of objects in aerial images. Most of the existing methods adopt a coarse-grained fixed label assignment strategy and suffer from the inconsistency between the classification score and localization accuracy. First, to align the metric inconsistency between sample selection and regression loss calculation caused by fixed IoU strategy, we introduce affine transformation to evaluate the quality of samples and propose a distance-based label assignment strategy. The proposed metric-aligned selection (MAS) strategy can dynamically select samples according to the shape and rotation characteristic of objects. Second, to further address the inconsistency between classification and localization, we propose a critical feature sampling (CFS) module, which performs localization refinement on the sampling location for classification task to extract critical features accurately. Third, we present a scale-controlled smooth L1L_1 loss (SC-Loss) to adaptively select high quality samples by changing the form of regression loss function based on the statistics of proposals during training. Extensive experiments are conducted on four challenging rotated object detection datasets DOTA, FAIR1M-1.0, HRSC2016, and UCAS-AOD. The results show the state-of-the-art accuracy of the proposed detector

    OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

    Full text link
    Large language models (LLMs) have revolutionized natural language processing tasks. However, their practical deployment is hindered by their immense memory and computation requirements. Although recent post-training quantization (PTQ) methods are effective in reducing memory footprint and improving the computational efficiency of LLM, they hand-craft quantization parameters, which leads to low performance and fails to deal with extremely low-bit quantization. To tackle this issue, we introduce an Omnidirectionally calibrated Quantization (OmniQuant) technique for LLMs, which achieves good performance in diverse quantization settings while maintaining the computational efficiency of PTQ by efficiently optimizing various quantization parameters. OmniQuant comprises two innovative components including Learnable Weight Clipping (LWC) and Learnable Equivalent Transformation (LET). LWC modulates the extreme values of weights by optimizing the clipping threshold. Meanwhile, LET tackles activation outliers by shifting the challenge of quantization from activations to weights through a learnable equivalent transformation. Operating within a differentiable framework using block-wise error minimization, OmniQuant can optimize the quantization process efficiently for both weight-only and weight-activation quantization. For instance, the LLaMA-2 model family with the size of 7-70B can be processed with OmniQuant on a single A100-40G GPU within 1-16 hours using 128 samples. Extensive experiments validate OmniQuant's superior performance across diverse quantization configurations such as W4A4, W6A6, W4A16, W3A16, and W2A16. Additionally, OmniQuant demonstrates effectiveness in instruction-tuned models and delivers notable improvements in inference speed and memory reduction on real devices. Codes and models are available at \url{https://github.com/OpenGVLab/OmniQuant}.Comment: Updated result with 2-bit quantization. A differentiable quantization method for LL

    Distinct Bacterial Communities in Wet and Dry Seasons During a Seasonal Water Level Fluctuation in the Largest Freshwater Lake (Poyang Lake) in China

    Get PDF
    Water level fluctuations (WLFs) are an inherent feature of lake ecosystems and have been regarded as a pervasive pressure on lacustrine ecosystems globally due to anthropogenic activities and climate change. However, the impacts of WLFs on lake microbial communities is one of our knowledge gaps. Here, we used the high-throughput 16S rRNA gene sequencing approach to investigate the taxonomic and functional dynamics of bacterial communities in wet-season and dry-season of Poyang Lake (PYL) in China. The results showed that dry-season was enriched in total nitrogen (TN), nitrate (NO3-), ammonia (NH4+), and soluble reactive phosphorus (SRP), while wet-season was enriched in dissolved organic carbon (DOC) and total phosphorus (TP). Bacterial communities were distinct taxonomically and functionally in dry-season and wet-season and the nutrients especially P variation had a significant contribution to the seasonal variation of bacterial communities. Moreover, bacterial communities responded differently to nutrient dynamics in different seasons. DOC, NO3-, and SRP had strong influences on bacterial communities in dry-season while only TP in wet-season. Alpha-diversity, functional redundancy, taxonomic dissimilarities, and taxa niche width were higher in dry-season, while functional dissimilarities were higher in wet-season, suggesting that the bacterial communities were more taxonomically sensitive in dry-season while more functionally sensitive in wet-season. Bacterial communities were more efficient in nutrients utilization in wet-season and might have different nitrogen removal mechanisms in different seasons. Bacterial communities in wet-season had significantly higher relative abundance of denitrification genes but lower anammox genes than in dry-season. This study enriched our knowledge of the impacts of WLFs and seasonal dynamics of lake ecosystems. Given the increasingly pervasive pressure of WLFs on lake ecosystems worldwide, our findings have important implications for conservation and management of lake ecosystems

    Learning to Navigate in a VUCA Environment: Hierarchical Multi-expert Approach

    Full text link
    Despite decades of efforts, robot navigation in a real scenario with volatility, uncertainty, complexity, and ambiguity (VUCA for short), remains a challenging topic. Inspired by the central nervous system (CNS), we propose a hierarchical multi-expert learning framework for autonomous navigation in a VUCA environment. With a heuristic exploration mechanism considering target location, path cost, and safety level, the upper layer performs simultaneous map exploration and route-planning to avoid trapping in a blind alley, similar to the cerebrum in the CNS. Using a local adaptive model fusing multiple discrepant strategies, the lower layer pursuits a balance between collision-avoidance and go-straight strategies, acting as the cerebellum in the CNS. We conduct simulation and real-world experiments on multiple platforms, including legged and wheeled robots. Experimental results demonstrate our algorithm outperforms the existing methods in terms of task achievement, time efficiency, and security.Comment: 8 pages, 10 figure

    Determining the macroinvertebrate community indicators and relevant environmental predictors of the Hun-Tai River Basin (Northeast China): A study based on community patterning

    Full text link
    [EN] It is essential to understand the patterning of biota and environmental influencing factors for proper rehabilitation and management at the river basin scale. The Hun-Tai River Basin was extensively sampled four times for macroinvertebrate community and environmental variables during one year. Self-Organizing Maps (SOMs) were used to reveal the aggregation patterns of the 355 samples. Three community types (i.e., clusters) were found (at the family level) based on the community composition, which showed a clearly gradient by combining them with the representative environmental variables: minimally impacted source area, intermediately anthropogenic impacted sites, and highly anthropogenic impacted downstream area, respectively. This gradient was corroborated by the decreasing trends in density and diversity of macroinvertebrates. Distance from source, total phosphorus and water temperature were identified as the most important variables that distinguished the delineated communities. In addition, the sampling season, substrate type, pH and the percentage of grassland were also identified as relevant variables. These results demonstrated that macroinvertebrates communities are structured in a hierarchical manner where geographic and water quality prevail over temporal (season) and habitat (substrate type) features at the basin scale. In addition, it implied that the local-scale environment variables affected macroinvertebrates under the longitudinal gradient of the geographical and anthropogenic pressure. More than one family was identified as the indicator for each type of community. Abundance contributed significantly for distinguishing the indicators, while Baetidae with higher density indicated minimally and intermediately impacted area and lower density indicated highly impacted area. Therefore, we suggested the use of abundance data in community patterning and classification, especially in the identification of the indicator taxa. (C) 2018 Elsevier B.V. All rights reserved.This work was supported by the National Natural Science Foundation of China (51779275, 41501204, 51479219) and the IWHR Research & Development Support Program (WE0145B532017).Zhang, M.; Muñoz Mas, R.; Martinez-Capel, F.; Qu, X.; Zhang, H.; Peng, W.; Liu, X. (2018). Determining the macroinvertebrate community indicators and relevant environmental predictors of the Hun-Tai River Basin (Northeast China): A study based on community patterning. The Science of The Total Environment. 634:749-759. https://doi.org/10.1016/j.scitotenv.2018.04.021S74975963

    DiffRate : Differentiable Compression Rate for Efficient Vision Transformers

    Full text link
    Token compression aims to speed up large-scale vision transformers (e.g. ViTs) by pruning (dropping) or merging tokens. It is an important but challenging task. Although recent advanced approaches achieved great success, they need to carefully handcraft a compression rate (i.e. number of tokens to remove), which is tedious and leads to sub-optimal performance. To tackle this problem, we propose Differentiable Compression Rate (DiffRate), a novel token compression method that has several appealing properties prior arts do not have. First, DiffRate enables propagating the loss function's gradient onto the compression ratio, which is considered as a non-differentiable hyperparameter in previous work. In this case, different layers can automatically learn different compression rates layer-wisely without extra overhead. Second, token pruning and merging can be naturally performed simultaneously in DiffRate, while they were isolated in previous works. Third, extensive experiments demonstrate that DiffRate achieves state-of-the-art performance. For example, by applying the learned layer-wise compression rates to an off-the-shelf ViT-H (MAE) model, we achieve a 40% FLOPs reduction and a 1.5x throughput improvement, with a minor accuracy drop of 0.16% on ImageNet without fine-tuning, even outperforming previous methods with fine-tuning. Codes and models are available at https://github.com/OpenGVLab/DiffRate.Comment: 16 pages, 8 figures, 13 table
    corecore