8 research outputs found

    3D Model-based Zero-Shot Pose Estimation Pipeline

    Full text link
    Most existing learning-based pose estimation methods are typically developed for non-zero-shot scenarios, where they can only estimate the poses of objects present in the training dataset. This setting restricts their applicability to unseen objects in the training phase. In this paper, we introduce a fully zero-shot pose estimation pipeline that leverages the 3D models of objects as clues. Specifically, we design a two-step pipeline consisting of 3D model-based zero-shot instance segmentation and a zero-shot pose estimator. For the first step, there is a novel way to perform zero-shot instance segmentation based on the 3D models instead of text descriptions, which can handle complex properties of unseen objects. For the second step, we utilize a hierarchical geometric structure matching mechanism to perform zero-shot pose estimation which is 10 times faster than the current render-based method. Extensive experimental results on the seven core datasets on the BOP challenge show that the proposed method outperforms the zero-shot state-of-the-art method with higher speed and lower computation cost

    MIAD: A Maintenance Inspection Dataset for Unsupervised Anomaly Detection

    Full text link
    Visual anomaly detection plays a crucial role in not only manufacturing inspection to find defects of products during manufacturing processes, but also maintenance inspection to keep equipment in optimum working condition particularly outdoors. Due to the scarcity of the defective samples, unsupervised anomaly detection has attracted great attention in recent years. However, existing datasets for unsupervised anomaly detection are biased towards manufacturing inspection, not considering maintenance inspection which is usually conducted under outdoor uncontrolled environment such as varying camera viewpoints, messy background and degradation of object surface after long-term working. We focus on outdoor maintenance inspection and contribute a comprehensive Maintenance Inspection Anomaly Detection (MIAD) dataset which contains more than 100K high-resolution color images in various outdoor industrial scenarios. This dataset is generated by a 3D graphics software and covers both surface and logical anomalies with pixel-precise ground truth. Extensive evaluations of representative algorithms for unsupervised anomaly detection are conducted, and we expect MIAD and corresponding experimental results can inspire research community in outdoor unsupervised anomaly detection tasks. Worthwhile and related future work can be spawned from our new dataset

    Balancing Logit Variation for Long-tailed Semantic Segmentation

    Full text link
    Semantic segmentation usually suffers from a long-tail data distribution. Due to the imbalanced number of samples across categories, the features of those tail classes may get squeezed into a narrow area in the feature space. Towards a balanced feature distribution, we introduce category-wise variation into the network predictions in the training phase such that an instance is no longer projected to a feature point, but a small region instead. Such a perturbation is highly dependent on the category scale, which appears as assigning smaller variation to head classes and larger variation to tail classes. In this way, we manage to close the gap between the feature areas of different categories, resulting in a more balanced representation. It is noteworthy that the introduced variation is discarded at the inference stage to facilitate a confident prediction. Although with an embarrassingly simple implementation, our method manifests itself in strong generalizability to various datasets and task settings. Extensive experiments suggest that our plug-in design lends itself well to a range of state-of-the-art approaches and boosts the performance on top of them

    Geo6D: Geometric Constraints Learning for 6D Pose Estimation

    Full text link
    Numerous 6D pose estimation methods have been proposed that employ end-to-end regression to directly estimate the target pose parameters. Since the visible features of objects are implicitly influenced by their poses, the network allows inferring the pose by analyzing the differences in features in the visible region. However, due to the unpredictable and unrestricted range of pose variations, the implicitly learned visible feature-pose constraints are insufficiently covered by the training samples, making the network vulnerable to unseen object poses. To tackle these challenges, we proposed a novel geometric constraints learning approach called Geo6D for direct regression 6D pose estimation methods. It introduces a pose transformation formula expressed in relative offset representation, which is leveraged as geometric constraints to reconstruct the input and output targets of the network. These reconstructed data enable the network to estimate the pose based on explicit geometric constraints and relative offset representation mitigates the issue of the pose distribution gap. Extensive experimental results show that when equipped with Geo6D, the direct 6D methods achieve state-of-the-art performance on multiple datasets and demonstrate significant effectiveness, even with only 10% amount of data

    TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems

    Full text link
    Large Language Models (LLMs) have demonstrated proficiency in addressing tasks that necessitate a combination of task planning and the usage of external tools that require a blend of task planning and the utilization of external tools, such as APIs. However, real-world complex systems present three prevalent challenges concerning task planning and tool usage: (1) The real system usually has a vast array of APIs, so it is impossible to feed the descriptions of all APIs to the prompt of LLMs as the token length is limited; (2) the real system is designed for handling complex tasks, and the base LLMs can hardly plan a correct sub-task order and API-calling order for such tasks; (3) Similar semantics and functionalities among APIs in real systems create challenges for both LLMs and even humans in distinguishing between them. In response, this paper introduces a comprehensive framework aimed at enhancing the Task Planning and Tool Usage (TPTU) abilities of LLM-based agents operating within real-world systems. Our framework comprises three key components designed to address these challenges: (1) the API Retriever selects the most pertinent APIs for the user task among the extensive array available; (2) LLM Finetuner tunes a base LLM so that the finetuned LLM can be more capable for task planning and API calling; (3) the Demo Selector adaptively retrieves different demonstrations related to hard-to-distinguish APIs, which is further used for in-context learning to boost the final performance. We validate our methods using a real-world commercial system as well as an open-sourced academic dataset, and the outcomes clearly showcase the efficacy of each individual component as well as the integrated framework

    Seawater carbonate chemistry and competition for growth, photosynthetic performance and biochemical composition in Neopyropia yezoensis and Ulva prolifera

    No full text
    The occurrence of various marine macroalgae in the same niche will inevitably lead to interspecific competition due to similar environmental requirements. With the increasing global atmospheric CO2 concentration, the resulting ocean acidification can potentially influence competition among macroalgae in the future. Neopyropia yezoensis (Rhodophyta, formerly Pyropia yezoensis) and the epiphytic alga Ulva prolifera (Chlorophyta) were selected for investigating competition among macroalgae grown under different CO2 conditions. The results showed that when cultured with U. prolifera, N. yezoensis' growth rate was significantly inhibited along with a sharp decrease in net photosynthetic rate. Although CO2 decreased the growth rate of N. yezoensis, it enhanced the resistance of the alga to the allelopathic effect of U. prolifera. While no difference was found between U. prolifera grown in monoculture and biculture, strong competitive ability was observed. CO2 could enhance this ability with higher net photosynthetic rate. However, CO2 significantly inhibited the carotenoid synthesis in both plants. This inhibition in N. yezoensis was more pronounced in the presence of U. prolifera. Biculture promoted the accumulation of soluble protein in N. yezoensis while it inhibited the process in U. prolifera. In addition, it enhanced the inhibitory effect of acidification on soluble carbohydrates of both plants. Elevated CO2 levels alleviated the competition between N. yezoensis and U. prolifera, but the latter can become the more competitive epiphytic alga which can impact the future of nori culture

    Seawater carbonate chemistry and growth, net photosynthesis rate, Chlorophyll fluorescence parameter, soluble protein, photosynthetic pigments of red algae Pyropia yezoensis

    No full text
    Increasing CO2 levels in the surface water of oceans are expected to decrease oceanic pH and lead to seawater acidification. The responses of macroalgaea to this acidification of coastal waters have been studied in detail; however, most reports have focused on the adult stage only, while ignoring other life cycle stages. In this study, the economically important seaweed species Pyropia yezoensis was cultured under two CO2 concentrations (ambient CO2: 400 μatm; elevated CO2: 1000 μatm) and two light intensities (low light intensity: 80 μmol photons/m**2 /s; and high light intensity: 240 μmol photons/m**2 /s). The effects on the growth and photosynthetic performance of P. yezoensis were explored at different life cycle stages. Relative growth rates were significantly elevated at the conchocelis stage under high light intensity and elevated CO2 concentration. Moreover, the Pmax of P. yezoensis was also increased under high light intensity. However, this positive effect inversed at the thallus stage. The relative growth rate, relative electron transport rate (rETR), and net photosynthetic rate decreased at the thallus stage in response to high CO2 concentration. Under low light intensity, elevated CO2 concentration significantly increased the relative growth rates of conchocelis and thallus stages. These were 269% and 45% higher at elevated CO2 concentration compared with ambient CO2 concentrations, respectively. The Chl a and phycoerythrin levels were also higher under elevated CO2 level at the conchocelis stage. However, the rETR for the thallus stage was elevated under low light. This suggests that seawater acidification could positively affect algae at low light conditions (especially at the conchocelis stage). Different growth stages of P. yezoensis may respond differently to seawater acidification and changes of light intensity. Thalli growth stage, stocking density, and seawater depth should be considered in different areas to optimize the primary production of macroalgae

    Isolation and characterization of a novel temperate bacteriophage infecting Aeromonas hydrophila isolated from a Macrobrachium rosenbergii larvae pond

    No full text
    Aeromonas hydrophila is an opportunistic pathogen that frequently leads to significant mortality in various commercially cultured aquatic species. Bacteriophages offer an alternative strategy for pathogens elimination. In this study, we isolated, identified, and characterized a novel temperate A. hydrophila phage, designated as P05B. The bacteriophage P05B is a myovirus based on its morphological features, and possesses the capability to lyse A. hydrophila strains isolated from shrimp. The optimal multiplicity of infection (MOI), adsorption rate, latent period, and burst size for phage P05B were determined to be 0.001, 91.7 %, 20 min, and 483 PFU/cell, respectively. Phage P05B displayed stability across a range of temperatures (28–50 °C) and pH values (4.0–10.0). Sequence analysis unveiled that the genome of phage P05B comprises 32,302 base pairs with an average G + C content of 59.4 %. A total of 40 open reading frames (ORF) were encoded within the phage P05B genome. The comparative genomic analyses clearly implied that P05B might represent a novel species of the genus Bielevirus under Peduoviridae family. A phylogenetic tree was reconstructed, demonstrating that P05B shares a close evolutionary relationship with other Aeromonas and Aeromonas phages. In conclusion, this study increased our knowledge about a new temperate phage of A. hydrophila with strong lytic ability
    corecore