363 research outputs found

    Stitched ViTs are Flexible Vision Backbones

    Full text link
    Large pretrained plain vision Transformers (ViTs) have been the workhorse for many downstream tasks. However, existing works utilizing off-the-shelf ViTs are inefficient in terms of training and deployment, because adopting ViTs with individual sizes requires separate trainings and is restricted by fixed performance-efficiency trade-offs. In this paper, we are inspired by stitchable neural networks (SN-Net), which is a new framework that cheaply produces a single model that covers rich subnetworks by stitching pretrained model families, supporting diverse performance-efficiency trade-offs at runtime. Building upon this foundation, we introduce SN-Netv2, a systematically improved model stitching framework to facilitate downstream task adaptation. Specifically, we first propose a two-way stitching scheme to enlarge the stitching space. We then design a resource-constrained sampling strategy that takes into account the underlying FLOPs distributions in the space for better sampling. Finally, we observe that learning stitching layers as a low-rank update plays an essential role on downstream tasks to stabilize training and ensure a good Pareto frontier. With extensive experiments on ImageNet-1K, ADE20K, COCO-Stuff-10K and NYUv2, SN-Netv2 demonstrates superior performance over SN-Netv1 on downstream dense predictions and shows strong ability as a flexible vision backbone, achieving great advantages in both training efficiency and deployment flexibility. Code is available at https://github.com/ziplab/SN-Netv2.Comment: Tech repor

    Blockchain technology into the logistics supply chain implementation effectiveness

    Get PDF
    Technologies currently have a tremendous impact on all spheres of economy, business and a state. They integrally change peopleā€™s conception of trade, property, and market entities interaction. Artificial intelligence, additive, informationommunication, green technologies, biotechnologies, and blockchain technologies development and implementation confirm their leadership importance and inevitability in relation to the activities traditional approaches. In the modern world only the companies with flexible vision, equipment and technologies able to instantly reform, adapt to new conditions and challenges, will benefit. The point at issue is Industry 4.0 as a new technological mode emergence

    The 3-D image recognition based on fuzzy neural network technology

    Get PDF
    Three dimensional stereoscopic image recognition system based on fuzzy-neural network technology was developed. The system consists of three parts; preprocessing part, feature extraction part, and matching part. Two CCD color camera image are fed to the preprocessing part, where several operations including RGB-HSV transformation are done. A multi-layer perception is used for the line detection in the feature extraction part. Then fuzzy matching technique is introduced in the matching part. The system is realized on SUN spark station and special image input hardware system. An experimental result on bottle images is also presented

    International criminal justice between Scylla and Charybdis ā€” the "peace versus justice" dilemma analysed through the lenses of Judith Shklar's and Hannah Arendt's legal and political theories

    Get PDF
    The present article discusses the ā€œpeace versus justiceā€ dilemma in international criminal justice through the lenses of the respective legal (and political) theories of Judith Shklar and Hannah Arendtā€”two thinkers who have recently been described as theorists of international criminal law. The article claims that in interventions carried out by the International Criminal Court (ICC), there is an ever-present potentiality for the ā€œpeace versus justiceā€ dilemma to occur. Unfortunately, there is no abstract solution to this problem, insofar as ICC interventions will in some cases be conducive while in others, they will be deleterious to peace. If a tension between peace and justice arises in a particular case, the article asserts, the former must be prioritised over the latter. Such a prioritisation, however, requires a vision of the ICC as a flexible actor of world politics which is situated at the intersection of law, ethics and politics, rather than a strictly legalistic view of the court. Ultimately, then, the present article seeks to probe whether the legal and political theories of Shklar and Arendtā€”in isolation, but ultimately also in combinationā€”support such a flexible vision of the ICC.Publisher PDFPeer reviewe

    A framework for flexible and reconfigurable vision inspection systems

    Get PDF
    Reconfiguration activities remain a significant challenge for automated Vision Inspection Systems (VIS), which are characterized by hardware rigidity and time-consuming software programming tasks. This work contributes to overcoming the current gap in VIS reconfigurability by proposing a novel framework based on the design of Flexible Vision Inspection Systems (FVIS), enabling a Reconfiguration Support System (RSS). FVIS is achieved using reprogrammable hardware components that allow for easy setup based on software commands. The RSS facilitates offline software programming by extracting parameters from real images, Computer-Aided Design (CAD) data, and rendered images using Automatic Feature Recognition (AFR). The RSS offers a user-friendly interface that guides non-expert users through the reconfiguration process for new part types, eliminating the need for low-level coding. The proposed framework has been practically validated during a 4-year collaboration with a global leading automotive half shaft manufacturer. A fully automated FVIS and the related RSS have been designed following the proposed framework and are currently implemented in 7 plants of GKN global automotive supplier, checking 60 defect types on thousands of parts per day, covering more than 200 individual part types and 12 part families

    Vision systems with the human in the loop

    Get PDF
    The emerging cognitive vision paradigm deals with vision systems that apply machine learning and automatic reasoning in order to learn from what they perceive. Cognitive vision systems can rate the relevance and consistency of newly acquired knowledge, they can adapt to their environment and thus will exhibit high robustness. This contribution presents vision systems that aim at flexibility and robustness. One is tailored for content-based image retrieval, the others are cognitive vision systems that constitute prototypes of visual active memories which evaluate, gather, and integrate contextual knowledge for visual analysis. All three systems are designed to interact with human users. After we will have discussed adaptive content-based image retrieval and object and action recognition in an office environment, the issue of assessing cognitive systems will be raised. Experiences from psychologically evaluated human-machine interactions will be reported and the promising potential of psychologically-based usability experiments will be stressed
    • ā€¦
    corecore