195 research outputs found
Metric-aligned Sample Selection and Critical Feature Sampling for Oriented Object Detection
Arbitrary-oriented object detection is a relatively emerging but challenging
task. Although remarkable progress has been made, there still remain many
unsolved issues due to the large diversity of patterns in orientation, scale,
aspect ratio, and visual appearance of objects in aerial images. Most of the
existing methods adopt a coarse-grained fixed label assignment strategy and
suffer from the inconsistency between the classification score and localization
accuracy. First, to align the metric inconsistency between sample selection and
regression loss calculation caused by fixed IoU strategy, we introduce affine
transformation to evaluate the quality of samples and propose a distance-based
label assignment strategy. The proposed metric-aligned selection (MAS) strategy
can dynamically select samples according to the shape and rotation
characteristic of objects. Second, to further address the inconsistency between
classification and localization, we propose a critical feature sampling (CFS)
module, which performs localization refinement on the sampling location for
classification task to extract critical features accurately. Third, we present
a scale-controlled smooth loss (SC-Loss) to adaptively select high
quality samples by changing the form of regression loss function based on the
statistics of proposals during training. Extensive experiments are conducted on
four challenging rotated object detection datasets DOTA, FAIR1M-1.0, HRSC2016,
and UCAS-AOD. The results show the state-of-the-art accuracy of the proposed
detector
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models
Large language models (LLMs) have revolutionized natural language processing
tasks. However, their practical deployment is hindered by their immense memory
and computation requirements. Although recent post-training quantization (PTQ)
methods are effective in reducing memory footprint and improving the
computational efficiency of LLM, they hand-craft quantization parameters, which
leads to low performance and fails to deal with extremely low-bit quantization.
To tackle this issue, we introduce an Omnidirectionally calibrated Quantization
(OmniQuant) technique for LLMs, which achieves good performance in diverse
quantization settings while maintaining the computational efficiency of PTQ by
efficiently optimizing various quantization parameters. OmniQuant comprises two
innovative components including Learnable Weight Clipping (LWC) and Learnable
Equivalent Transformation (LET). LWC modulates the extreme values of weights by
optimizing the clipping threshold. Meanwhile, LET tackles activation outliers
by shifting the challenge of quantization from activations to weights through a
learnable equivalent transformation. Operating within a differentiable
framework using block-wise error minimization, OmniQuant can optimize the
quantization process efficiently for both weight-only and weight-activation
quantization. For instance, the LLaMA-2 model family with the size of 7-70B can
be processed with OmniQuant on a single A100-40G GPU within 1-16 hours using
128 samples. Extensive experiments validate OmniQuant's superior performance
across diverse quantization configurations such as W4A4, W6A6, W4A16, W3A16,
and W2A16. Additionally, OmniQuant demonstrates effectiveness in
instruction-tuned models and delivers notable improvements in inference speed
and memory reduction on real devices. Codes and models are available at
\url{https://github.com/OpenGVLab/OmniQuant}.Comment: Updated result with 2-bit quantization. A differentiable quantization
method for LL
Distinct Bacterial Communities in Wet and Dry Seasons During a Seasonal Water Level Fluctuation in the Largest Freshwater Lake (Poyang Lake) in China
Water level fluctuations (WLFs) are an inherent feature of lake ecosystems and have been regarded as a pervasive pressure on lacustrine ecosystems globally due to anthropogenic activities and climate change. However, the impacts of WLFs on lake microbial communities is one of our knowledge gaps. Here, we used the high-throughput 16S rRNA gene sequencing approach to investigate the taxonomic and functional dynamics of bacterial communities in wet-season and dry-season of Poyang Lake (PYL) in China. The results showed that dry-season was enriched in total nitrogen (TN), nitrate (NO3-), ammonia (NH4+), and soluble reactive phosphorus (SRP), while wet-season was enriched in dissolved organic carbon (DOC) and total phosphorus (TP). Bacterial communities were distinct taxonomically and functionally in dry-season and wet-season and the nutrients especially P variation had a significant contribution to the seasonal variation of bacterial communities. Moreover, bacterial communities responded differently to nutrient dynamics in different seasons. DOC, NO3-, and SRP had strong influences on bacterial communities in dry-season while only TP in wet-season. Alpha-diversity, functional redundancy, taxonomic dissimilarities, and taxa niche width were higher in dry-season, while functional dissimilarities were higher in wet-season, suggesting that the bacterial communities were more taxonomically sensitive in dry-season while more functionally sensitive in wet-season. Bacterial communities were more efficient in nutrients utilization in wet-season and might have different nitrogen removal mechanisms in different seasons. Bacterial communities in wet-season had significantly higher relative abundance of denitrification genes but lower anammox genes than in dry-season. This study enriched our knowledge of the impacts of WLFs and seasonal dynamics of lake ecosystems. Given the increasingly pervasive pressure of WLFs on lake ecosystems worldwide, our findings have important implications for conservation and management of lake ecosystems
Learning to Navigate in a VUCA Environment: Hierarchical Multi-expert Approach
Despite decades of efforts, robot navigation in a real scenario with
volatility, uncertainty, complexity, and ambiguity (VUCA for short), remains a
challenging topic. Inspired by the central nervous system (CNS), we propose a
hierarchical multi-expert learning framework for autonomous navigation in a
VUCA environment. With a heuristic exploration mechanism considering target
location, path cost, and safety level, the upper layer performs simultaneous
map exploration and route-planning to avoid trapping in a blind alley, similar
to the cerebrum in the CNS. Using a local adaptive model fusing multiple
discrepant strategies, the lower layer pursuits a balance between
collision-avoidance and go-straight strategies, acting as the cerebellum in the
CNS. We conduct simulation and real-world experiments on multiple platforms,
including legged and wheeled robots. Experimental results demonstrate our
algorithm outperforms the existing methods in terms of task achievement, time
efficiency, and security.Comment: 8 pages, 10 figure
Determining the macroinvertebrate community indicators and relevant environmental predictors of the Hun-Tai River Basin (Northeast China): A study based on community patterning
[EN] It is essential to understand the patterning of biota and environmental influencing factors for proper rehabilitation and management at the river basin scale. The Hun-Tai River Basin was extensively sampled four times for macroinvertebrate community and environmental variables during one year. Self-Organizing Maps (SOMs) were used to reveal the aggregation patterns of the 355 samples. Three community types (i.e., clusters) were found (at the family level) based on the community composition, which showed a clearly gradient by combining them with the representative environmental variables: minimally impacted source area, intermediately anthropogenic impacted sites, and highly anthropogenic impacted downstream area, respectively. This gradient was corroborated by the decreasing trends in density and diversity of macroinvertebrates. Distance from source, total phosphorus and water temperature were identified as the most important variables that distinguished the delineated communities. In addition, the sampling season, substrate type, pH and the percentage of grassland were also identified as relevant variables. These results demonstrated that macroinvertebrates communities are structured in a hierarchical manner where geographic and water quality prevail over temporal (season) and habitat (substrate type) features at the basin scale. In addition, it implied that the local-scale environment variables affected macroinvertebrates under the longitudinal gradient of the geographical and anthropogenic pressure. More than one family was identified as the indicator for each type of community. Abundance contributed significantly for distinguishing the indicators, while Baetidae with higher density indicated minimally and intermediately impacted area and lower density indicated highly impacted area. Therefore, we suggested the use of abundance data in community patterning and classification, especially in the identification of the indicator taxa. (C) 2018 Elsevier B.V. All rights reserved.This work was supported by the National Natural Science Foundation of China (51779275, 41501204, 51479219) and the IWHR Research & Development Support Program (WE0145B532017).Zhang, M.; Muñoz Mas, R.; Martinez-Capel, F.; Qu, X.; Zhang, H.; Peng, W.; Liu, X. (2018). Determining the macroinvertebrate community indicators and relevant environmental predictors of the Hun-Tai River Basin (Northeast China): A study based on community patterning. The Science of The Total Environment. 634:749-759. https://doi.org/10.1016/j.scitotenv.2018.04.021S74975963
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers
Token compression aims to speed up large-scale vision transformers (e.g.
ViTs) by pruning (dropping) or merging tokens. It is an important but
challenging task. Although recent advanced approaches achieved great success,
they need to carefully handcraft a compression rate (i.e. number of tokens to
remove), which is tedious and leads to sub-optimal performance. To tackle this
problem, we propose Differentiable Compression Rate (DiffRate), a novel token
compression method that has several appealing properties prior arts do not
have. First, DiffRate enables propagating the loss function's gradient onto the
compression ratio, which is considered as a non-differentiable hyperparameter
in previous work. In this case, different layers can automatically learn
different compression rates layer-wisely without extra overhead. Second, token
pruning and merging can be naturally performed simultaneously in DiffRate,
while they were isolated in previous works. Third, extensive experiments
demonstrate that DiffRate achieves state-of-the-art performance. For example,
by applying the learned layer-wise compression rates to an off-the-shelf ViT-H
(MAE) model, we achieve a 40% FLOPs reduction and a 1.5x throughput
improvement, with a minor accuracy drop of 0.16% on ImageNet without
fine-tuning, even outperforming previous methods with fine-tuning. Codes and
models are available at https://github.com/OpenGVLab/DiffRate.Comment: 16 pages, 8 figures, 13 table
- …