52,811 research outputs found

    Prompting Visual-Language Models for Dynamic Facial Expression Recognition

    Get PDF
    This paper presents a novel visual-language model called DFER-CLIP, which is based on the CLIP model and designed for in-the-wild Dynamic Facial Expression Recognition (DFER). Specifically, the proposed DFER-CLIP consists of a visual part and a textual part. For the visual part, based on the CLIP image encoder, a temporal model consisting of several Transformer encoders is introduced for extracting temporal facial expression features, and the final feature embedding is obtained as a learnable "class" token. For the textual part, we use as inputs textual descriptions of the facial behaviour that is related to the classes (facial expressions) that we are interested in recognising – those descriptions are generated using large language models, like ChatGPT. This, in contrast to works that use only the class names and more accurately captures the relationship between them. Alongside the textual description, we introduce a learnable token which helps the model learn relevant context information for each expression during training. Extensive experiments demonstrate the effectiveness of the proposed method and show that our DFER-CLIP also achieves state-of-the-art results compared with the current supervised DFER methods on the DFEW, FERV39k, and MAFW benchmarks

    Machine vision applications in agriculture

    Get PDF
    Keynote paper. [Abstract]: With the trend of computers towards convergence with multimedia entertainment, tools for vision processing are becoming commonplace. This has led to the pursuit of a host of unusual applications in the National Centre for Engineering in Agriculture, in addition to work on vision guidance. These range from the identification of animal species, through the location of macadamia nuts as they are harvested and visual tracking for behaviour analysis of small marsupials to the measurement of the volume of dingo teeth

    Meat color recognition using machine vision

    Get PDF
    New technologies are being developed to give an ease to the human in a variety of different field each and every day. Food industry is the key of development that led to the rise of human civilization. The development of food industry dealt with the husbandry of domesticated animal and plants creating food surpluses that enabled the development of more densely populated and stratified societies. The study of food is very important that improves the quality of human's life. When it comes to classify and grade a meat, the color of fresh meat is a sensory indicator of which affects the consumers behavior, especially the consistency of meat color and musculature. Other factors that influence consumers purchasing include security, nutrition and taste. There has been no report that grades the meat freshness in the process of meat delivery. Most of the meat freshness is grading manually by using the human eyesight at the meat's color and quantity of fats. A parameter to show the freshness of meat has only been analyzed manually using a human's eyes. This is some kind of difficult method when making a right decision whether the meat is fresh or not. In order to overcome this problem, meat grading method has been studied to show the mathematical calculation on the change of color hue, saturation, and intensity (HSI) values. This study focuses on grading system design that helps to characterize the meat freshness according to its color. Using a MATLAB Graphical User Interface (GUI) program, it can analyzes the color of the meat that being inspected. The theory of this program includes the calculation of the mean values and histograms, and the final result. This system is capable of classifying meat freshness

    Development of Moire machine vision

    Get PDF
    Three dimensional perception is essential to the development of versatile robotics systems in order to handle complex manufacturing tasks in future factories and in providing high accuracy measurements needed in flexible manufacturing and quality control. A program is described which will develop the potential of Moire techniques to provide this capability in vision systems and automated measurements, and demonstrate artificial intelligence (AI) techniques to take advantage of the strengths of Moire sensing. Moire techniques provide a means of optically manipulating the complex visual data in a three dimensional scene into a form which can be easily and quickly analyzed by computers. This type of optical data manipulation provides high productivity through integrated automation, producing a high quality product while reducing computer and mechanical manipulation requirements and thereby the cost and time of production. This nondestructive evaluation is developed to be able to make full field range measurement and three dimensional scene analysis
    • …
    corecore