111 research outputs found

    Theoretical Analysis on the Optimization and Integration of College PE Curriculum Education and Teaching System

    Get PDF
    The purpose of this study was to promote the development of PE curriculum theory and practice. Literature review method and logical analysis were used in this study. The results showed that physical education curriculum and teaching related ideas are based on optimizing the teaching theory system of integrated physical education. At the macro theoretical guidance level, moral education is the theoretical basis for the construction of sports related curriculum education and teaching ideology. In terms of setting and construction, lifelong physical education is the theoretical basis of school physical education curriculum education and teaching system construction. In the overall construction, we should not only set up a reasonable framework, but also optimize and incorporate high-quality course teaching resources. On the practical level, physical education teaching refers to the relatively stable structure and procedure of physical education activities established under the guidance of certain teaching ideas or teaching theories, which is the theoretical basis for the practice of physical education teaching system. Physical education teaching practice system is the ultimate embodiment of physical education teaching implementation on the basis of physical education teaching ideology and construction system. Based on different groups and different types of physical education courses, the practice path and effect are optimized and integrated, and efficient and feasible physical education teaching practice system is constructed

    Toward Sufficient Spatial-Frequency Interaction for Gradient-aware Underwater Image Enhancement

    Full text link
    Underwater images suffer from complex and diverse degradation, which inevitably affects the performance of underwater visual tasks. However, most existing learning-based Underwater image enhancement (UIE) methods mainly restore such degradations in the spatial domain, and rarely pay attention to the fourier frequency information. In this paper, we develop a novel UIE framework based on spatial-frequency interaction and gradient maps, namely SFGNet, which consists of two stages. Specifically, in the first stage, we propose a dense spatial-frequency fusion network (DSFFNet), mainly including our designed dense fourier fusion block and dense spatial fusion block, achieving sufficient spatial-frequency interaction by cross connections between these two blocks. In the second stage, we propose a gradient-aware corrector (GAC) to further enhance perceptual details and geometric structures of images by gradient map. Experimental results on two real-world underwater image datasets show that our approach can successfully enhance underwater images, and achieves competitive performance in visual quality improvement

    Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration

    Full text link
    Underwater images are subject to intricate and diverse degradation, inevitably affecting the effectiveness of underwater visual tasks. However, most approaches primarily operate in the raw pixel space of images, which limits the exploration of the frequency characteristics of underwater images, leading to an inadequate utilization of deep models' representational capabilities in producing high-quality images. In this paper, we introduce a novel Underwater Image Enhancement (UIE) framework, named WF-Diff, designed to fully leverage the characteristics of frequency domain information and diffusion models. WF-Diff consists of two detachable networks: Wavelet-based Fourier information interaction network (WFI2-net) and Frequency Residual Diffusion Adjustment Module (FRDAM). With our full exploration of the frequency domain information, WFI2-net aims to achieve preliminary enhancement of frequency information in the wavelet space. Our proposed FRDAM can further refine the high- and low-frequency information of the initial enhanced images, which can be viewed as a plug-and-play universal module to adjust the detail of the underwater images. With the above techniques, our algorithm can show SOTA performance on real-world underwater image datasets, and achieves competitive performance in visual quality

    EPIM: Efficient Processing-In-Memory Accelerators based on Epitome

    Full text link
    The exploration of Processing-In-Memory (PIM) accelerators has garnered significant attention within the research community. However, the utilization of large-scale neural networks on Processing-In-Memory (PIM) accelerators encounters challenges due to constrained on-chip memory capacity. To tackle this issue, current works explore model compression algorithms to reduce the size of Convolutional Neural Networks (CNNs). Most of these algorithms either aim to represent neural operators with reduced-size parameters (e.g., quantization) or search for the best combinations of neural operators (e.g., neural architecture search). Designing neural operators to align with PIM accelerators' specifications is an area that warrants further study. In this paper, we introduce the Epitome, a lightweight neural operator offering convolution-like functionality, to craft memory-efficient CNN operators for PIM accelerators (EPIM). On the software side, we evaluate epitomes' latency and energy on PIM accelerators and introduce a PIM-aware layer-wise design method to enhance their hardware efficiency. We apply epitome-aware quantization to further reduce the size of epitomes. On the hardware side, we modify the datapath of current PIM accelerators to accommodate epitomes and implement a feature map reuse technique to reduce computation cost. Experimental results reveal that our 3-bit quantized EPIM-ResNet50 attains 71.59% top-1 accuracy on ImageNet, reducing crossbar areas by 30.65 times. EPIM surpasses the state-of-the-art pruning methods on PIM

    Super-Resolution by Predicting Offsets: An Ultra-Efficient Super-Resolution Network for Rasterized Images

    Full text link
    Rendering high-resolution (HR) graphics brings substantial computational costs. Efficient graphics super-resolution (SR) methods may achieve HR rendering with small computing resources and have attracted extensive research interests in industry and research communities. We present a new method for real-time SR for computer graphics, namely Super-Resolution by Predicting Offsets (SRPO). Our algorithm divides the image into two parts for processing, i.e., sharp edges and flatter areas. For edges, different from the previous SR methods that take the anti-aliased images as inputs, our proposed SRPO takes advantage of the characteristics of rasterized images to conduct SR on the rasterized images. To complement the residual between HR and low-resolution (LR) rasterized images, we train an ultra-efficient network to predict the offset maps to move the appropriate surrounding pixels to the new positions. For flat areas, we found simple interpolation methods can already generate reasonable output. We finally use a guided fusion operation to integrate the sharp edges generated by the network and flat areas by the interpolation method to get the final SR image. The proposed network only contains 8,434 parameters and can be accelerated by network quantization. Extensive experiments show that the proposed SRPO can achieve superior visual effects at a smaller computational cost than the existing state-of-the-art methods.Comment: This article has been accepted by ECCV202

    Incremental Learning Meets Transfer Learning: Application to Multi-site Prostate MRI Segmentation

    Full text link
    Many medical datasets have recently been created for medical image segmentation tasks, and it is natural to question whether we can use them to sequentially train a single model that (1) performs better on all these datasets, and (2) generalizes well and transfers better to the unknown target site domain. Prior works have achieved this goal by jointly training one model on multi-site datasets, which achieve competitive performance on average but such methods rely on the assumption about the availability of all training data, thus limiting its effectiveness in practical deployment. In this paper, we propose a novel multi-site segmentation framework called incremental-transfer learning (ITL), which learns a model from multi-site datasets in an end-to-end sequential fashion. Specifically, "incremental" refers to training sequentially constructed datasets, and "transfer" is achieved by leveraging useful information from the linear combination of embedding features on each dataset. In addition, we introduce our ITL framework, where we train the network including a site-agnostic encoder with pre-trained weights and at most two segmentation decoder heads. We also design a novel site-level incremental loss in order to generalize well on the target domain. Second, we show for the first time that leveraging our ITL training scheme is able to alleviate challenging catastrophic forgetting problems in incremental learning. We conduct experiments using five challenging benchmark datasets to validate the effectiveness of our incremental-transfer learning approach. Our approach makes minimal assumptions on computation resources and domain-specific expertise, and hence constitutes a strong starting point in multi-site medical image segmentation

    MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

    Full text link
    Recently, the astonishing performance of large language models (LLMs) in natural language comprehension and generation tasks triggered lots of exploration of using them as central controllers to build agent systems. Multiple studies focus on bridging the LLMs to external tools to extend the application scenarios. However, the current LLMs' perceiving tool-use ability is limited to a single text query, which may result in ambiguity in understanding the users' real intentions. LLMs are expected to eliminate that by perceiving the visual- or auditory-grounded instructions' information. Therefore, in this paper, we propose MLLM-Tool, a system incorporating open-source LLMs and multi-modal encoders so that the learnt LLMs can be conscious of multi-modal input instruction and then select the function-matched tool correctly. To facilitate the evaluation of the model's capability, we collect a dataset featured by consisting of multi-modal input tools from HuggingFace. Another important feature of our dataset is that our dataset also contains multiple potential choices for the same instruction due to the existence of identical functions and synonymous functions, which provides more potential solutions for the same query. The experiments reveal that our MLLM-Tool is capable of recommending appropriate tools for multi-modal instructions. Codes and data are available at https://github.com/MLLM-Tool/MLLM-Tool.Comment: 21 pages, 9 figures, 10 table

    The effect of MWA protocols upon morphology and IVIM parameters of hepatic ablation zones—a preliminary in vivo animal study with an MRI-compatible microwave ablation device

    Get PDF
    PURPOSEWe aimed to explore the effect of microwave ablation (MWA) protocols upon morphology and instant changes in intravoxel incoherent motion (IVIM) diffusion-weighted imaging (DWI) parameters on MWA zones in porcine livers.METHODSAccording to the empirical protocol for MWA in tumors less than 3 cm in our hospital, the power and application duration were assigned as five groups: A, 60 W × 5 min (n = 6); B, 80 W × 3 min (n = 7); C, 80 W × 5 min (n = 10); D, 100 W × 3 min (n = 10); E, 100 W × 5 min (n = 9). Spearman correlation between MWA protocols, morphological metrics, and instant post-ablation IVIM parameters was performed.RESULTSThere was fair positive correlation between energy delivery and short axis (RSpearman = 0.426, P= .005) of the white zone. There was moderate-to-good positive correlation between wattage and short axis (RSpearman = 0.584, P < .001) of the white zone. For post-ablation IVIM parameters in the white zone, only wattage had moderate-to-good positive correlation with D value (RSpearman= 0.574, P < .001) or ADC value (RSpearman = 0.550, P < .001). No correlation between energy delivery, wattage, duration, and f value was observed (RSpearman = 0.185, P = .24; RSpearman= − 0.001, P = .99; RSpearman = 0.203, P = .20, respectively).CONCLUSIONThe increase in the short axis of the white zone is more likely to be affected by wattage than energy delivery. The instant post-ablation IVIM is feasible in monitoring the MWA zones since the f value in the white zones is not sensitive to changes in MWA protocols, which is promising in evaluating the instant effect of MWA

    Ultrasensitive Textile Strain Sensors Redefine Wearable Silent Speech Interfaces with High Machine Learning Efficiency

    Full text link
    Our research presents a wearable Silent Speech Interface (SSI) technology that excels in device comfort, time-energy efficiency, and speech decoding accuracy for real-world use. We developed a biocompatible, durable textile choker with an embedded graphene-based strain sensor, capable of accurately detecting subtle throat movements. This sensor, surpassing other strain sensors in sensitivity by 420%, simplifies signal processing compared to traditional voice recognition methods. Our system uses a computationally efficient neural network, specifically a one-dimensional convolutional neural network with residual structures, to decode speech signals. This network is energy and time-efficient, reducing computational load by 90% while achieving 95.25% accuracy for a 20-word lexicon and swiftly adapting to new users and words with minimal samples. This innovation demonstrates a practical, sensitive, and precise wearable SSI suitable for daily communication applications.Comment: 5 figures in the article; 11 figures and 4 tables in supplementary informatio
    • …
    corecore