8 research outputs found

    Efficient Training for Visual Tracking with Deformable Transformer

    Full text link
    Recent Transformer-based visual tracking models have showcased superior performance. Nevertheless, prior works have been resource-intensive, requiring prolonged GPU training hours and incurring high GFLOPs during inference due to inefficient training methods and convolution-based target heads. This intensive resource use renders them unsuitable for real-world applications. In this paper, we present DETRack, a streamlined end-to-end visual object tracking framework. Our framework utilizes an efficient encoder-decoder structure where the deformable transformer decoder acting as a target head, achieves higher sparsity than traditional convolution heads, resulting in decreased GFLOPs. For training, we introduce a novel one-to-many label assignment and an auxiliary denoising technique, significantly accelerating model's convergence. Comprehensive experiments affirm the effectiveness and efficiency of our proposed method. For instance, DETRack achieves 72.9% AO on challenging GOT-10k benchmarks using only 20% of the training epochs required by the baseline, and runs with lower GFLOPs than all the transformer-based trackers.Comment: arXiv admin note: text overlap with arXiv:2303.16580 by other author

    Towards Efficient Training with Negative Samples in Visual Tracking

    Full text link
    Current state-of-the-art (SOTA) methods in visual object tracking often require extensive computational resources and vast amounts of training data, leading to a risk of overfitting. This study introduces a more efficient training strategy to mitigate overfitting and reduce computational requirements. We balance the training process with a mix of negative and positive samples from the outset, named as Joint learning with Negative samples (JN). Negative samples refer to scenarios where the object from the template is not present in the search region, which helps to prevent the model from simply memorizing the target, and instead encourages it to use the template for object location. To handle the negative samples effectively, we adopt a distribution-based head, which modeling the bounding box as distribution of distances to express uncertainty about the target's location in the presence of negative samples, offering an efficient way to manage the mixed sample training. Furthermore, our approach introduces a target-indicating token. It encapsulates the target's precise location within the template image. This method provides exact boundary details with negligible computational cost but improving performance. Our model, JN-256, exhibits superior performance on challenging benchmarks, achieving 75.8% AO on GOT-10k and 84.1% AUC on TrackingNet. Notably, JN-256 outperforms previous SOTA trackers that utilize larger models and higher input resolutions, even though it is trained with only half the number of data sampled used in those works

    LiteTrack: Layer Pruning with Asynchronous Feature Extraction for Lightweight and Efficient Visual Tracking

    Full text link
    The recent advancements in transformer-based visual trackers have led to significant progress, attributed to their strong modeling capabilities. However, as performance improves, running latency correspondingly increases, presenting a challenge for real-time robotics applications, especially on edge devices with computational constraints. In response to this, we introduce LiteTrack, an efficient transformer-based tracking model optimized for high-speed operations across various devices. It achieves a more favorable trade-off between accuracy and efficiency than the other lightweight trackers. The main innovations of LiteTrack encompass: 1) asynchronous feature extraction and interaction between the template and search region for better feature fushion and cutting redundant computation, and 2) pruning encoder layers from a heavy tracker to refine the balnace between performance and speed. As an example, our fastest variant, LiteTrack-B4, achieves 65.2% AO on the GOT-10k benchmark, surpassing all preceding efficient trackers, while running over 100 fps with ONNX on the Jetson Orin NX edge device. Moreover, our LiteTrack-B9 reaches competitive 72.2% AO on GOT-10k and 82.4% AUC on TrackingNet, and operates at 171 fps on an NVIDIA 2080Ti GPU. The code and demo materials will be available at https://github.com/TsingWei/LiteTrack

    RTrack: Accelerating Convergence for Visual Object Tracking via Pseudo-Boxes Exploration

    Full text link
    Single object tracking (SOT) heavily relies on the representation of the target object as a bounding box. However, due to the potential deformation and rotation experienced by the tracked targets, the genuine bounding box fails to capture the appearance information explicitly and introduces cluttered background. This paper proposes RTrack, a novel object representation baseline tracker that utilizes a set of sample points to get a pseudo bounding box. RTrack automatically arranges these points to define the spatial extents and highlight local areas. Building upon the baseline, we conducted an in-depth exploration of the training potential and introduced a one-to-many leading assignment strategy. It is worth noting that our approach achieves competitive performance to the state-of-the-art trackers on the GOT-10k dataset while reducing training time to just 10% of the previous state-of-the-art (SOTA) trackers' training costs. The substantial reduction in training costs brings single-object tracking (SOT) closer to the object detection (OD) task. Extensive experiments demonstrate that our proposed RTrack achieves SOTA results with faster convergence

    GANS-based data augmentation for citrus disease severity detection using deep learning

    Get PDF
    Recently, many Deep Learning models have been employed to classify different kinds of plant diseases, but very little work has been done for disease severity detection. However, it is more important to master the severities of plant diseases accurately and timely, as it helps to make effective decisions to protect the plants from being further infected and reduce financial loss. In this paper, based on the Huanglongbing (HLB)-infected leaf images obtained from PlantVillage and crowdAI, we created a dataset with 5,406 citrus leaf images infected by HLB. Then six different kinds of popular models were trained to perform the severity detection of citrus HLB with the goal to find which types of models are more suitable to detect HLB severity with the same training circumstance. The experimental results show that the Inception_v3 model with epochs=60 can achieve higher accuracy than that of other models for severity detection with an accuracy of 74.38% due to its highly computational efficiency and small number of parameters. Additionally, aiming for evaluating whether GANs-based data augmentation can contribute to improve the model learning performance, we adopted DCGANs (Deep Convolutional Generative Adversarial Networks) to augment the original training dataset up to two times itself. Finally, a new training dataset with 14,056 leaf images composed by the original training images and the augmented ones were used to train the Inception_v3 model. As a result, we achieved an accuracy of 92.60%, about 20% higher than that of the Inception_v3 model trained by the original training dataset, which suggested that the GANs-based data augmentation is very useful to improve the model learning performance

    Analysis of Maceaene and Macamide Contents of Petroleum Ether Extract of Black, Yellow, and Purple Lepidium Meyenii (Maca) and Their Antioxidant Effect on Diabetes Mellitus Rat Model

    No full text
    ABSTRACT Maceaene and macamide contents as well as antioxidant effect of petroleum ether extract of black maca (BM), yellow maca (YM), and purple maca (PM) on diabetes mellitus (DM) rats were investigated. The results showed that seven, six, and five analogues of macamides were identified from the petroleum ether extracts of BM, YM, and PM, respectively. BM extract exhibited the highest contents of total macamides. Comparatively, the PM extract has the lowest macamide quantity. The maceaene contents in all the extracts showed no significant difference (p>0.05). Macamide contents in maca with the same color were not statistically different. Pharmacological results showed that 60-day oral administration of the petroleum ether extract of maca (100 mg/kg.d) can significantly decrease lipid oxidation as indicated by the decreased thiobarbituric acid reactive substances (TBARS) and carbonylated proteins (CP) concentrations on DM rat model (P<0.05). Among them, oral administration of PM extract showed the lowest TBRAS and CP concentrations. All maca extracts can enhance antioxidant enzyme (SOD, superoxide dismutase; CAT, catalase) activity of liver and red blood cells (RBC) of DM rat. However, only oral administration of PM extract can increase SOD and CAT activity of both RBC and liver. The glutathion (GSH) contents in plasma were significantly increased in DM rats treated with PM extract (p<0.05). But, oral administration of BM and YM extracts did not enhance GSH levels. Take together, the data suggested that PM extract exhibited the most potent antioxidant activity on DM rat model. And, maceaene and macamide in maca extract was not correlated with its antioxidant ability
    corecore