37 research outputs found
Compact Twice Fusion Network for Edge Detection
The significance of multi-scale features has been gradually recognized by the
edge detection community. However, the fusion of multi-scale features increases
the complexity of the model, which is not friendly to practical application. In
this work, we propose a Compact Twice Fusion Network (CTFN) to fully integrate
multi-scale features while maintaining the compactness of the model. CTFN
includes two lightweight multi-scale feature fusion modules: a Semantic
Enhancement Module (SEM) that can utilize the semantic information contained in
coarse-scale features to guide the learning of fine-scale features, and a
Pseudo Pixel-level Weighting (PPW) module that aggregate the complementary
merits of multi-scale features by assigning weights to all features.
Notwithstanding all this, the interference of texture noise makes the correct
classification of some pixels still a challenge. For these hard samples, we
propose a novel loss function, coined Dynamic Focal Loss, which reshapes the
standard cross-entropy loss and dynamically adjusts the weights to correct the
distribution of hard samples. We evaluate our method on three datasets, i.e.,
BSDS500, NYUDv2, and BIPEDv2. Compared with state-of-the-art methods, CTFN
achieves competitive accuracy with less parameters and computational cost.
Apart from the backbone, CTFN requires only 0.1M additional parameters, which
reduces its computation cost to just 60% of other state-of-the-art methods. The
codes are available at https://github.com/Li-yachuan/CTFN-pytorch-master.Comment: Manuscript submitted to a Springer journa
MANIPULATION ACTION UNDERSTANDING FOR OBSERVATION AND EXECUTION
Modern intelligent agents will need to learn the actions that humans perform. They will need to recognize these actions when they see them and they will need to perform these actions themselves. We want to propose a cognitive system that interprets human manipulation actions from perceptual information (image and depth data) and consists of perceptual modules and reasoning modules that are in interaction with each other. The contributions of this work are given along two core problems at the heart of action understanding: a.) the grounding of relevant information about actions in perception (the perception - action integration problem), and b.) the organization of perceptual and high-level symbolic information for interpreting the actions (the sequencing problem). At the high level, actions are represented with the Manipulation Action Context-free Grammar (MACFG) , a syntactic grammar and associated parsing algorithms, which organizes actions as a sequence of sub-events. Each sub-event is described by the hand (as well as grasp type), movements (actions) and the objects and tools involved, and the relevant information about these quantities is obtained from biological-inspired perception modules. These modules track the hands and objects and recognize the hand grasp, actions, segmentation, and action consequences. Furthermore, a probabilistic semantic parsing framework based on CCG (Combinatory Categorial Grammar) theory is adopted to model the semantic meaning of human manipulation actions.
Additionally, the lesson from the findings on mirror neurons is that the two processes of interpreting visually observed action and generating actions, should share the same underlying cognitive process. Recent studies have shown that grammatical structures underlie the representation of manipulation actions, which are used both to understand and to execute these actions. Analogically, understanding manipulation actions is like understanding language, while executing them is like generating language. Experiments on two tasks, 1) a robot observing people performing manipulation actions, and 2) a robot then executing manipulation actions accordingly, are presented to validate the formalism. The technical parts of this thesis are devoted to the experimental setting of task (1), while the task (2) is given as a live demonstration
Natural Language Processing: Emerging Neural Approaches and Applications
This Special Issue highlights the most recent research being carried out in the NLP field to discuss relative open issues, with a particular focus on both emerging approaches for language learning, understanding, production, and grounding interactively or autonomously from data in cognitive and neural systems, as well as on their potential or real applications in different domains
Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future Directions
Breast cancer has reached the highest incidence rate worldwide among all
malignancies since 2020. Breast imaging plays a significant role in early
diagnosis and intervention to improve the outcome of breast cancer patients. In
the past decade, deep learning has shown remarkable progress in breast cancer
imaging analysis, holding great promise in interpreting the rich information
and complex context of breast imaging modalities. Considering the rapid
improvement in the deep learning technology and the increasing severity of
breast cancer, it is critical to summarize past progress and identify future
challenges to be addressed. In this paper, we provide an extensive survey of
deep learning-based breast cancer imaging research, covering studies on
mammogram, ultrasound, magnetic resonance imaging, and digital pathology images
over the past decade. The major deep learning methods, publicly available
datasets, and applications on imaging-based screening, diagnosis, treatment
response prediction, and prognosis are described in detail. Drawn from the
findings of this survey, we present a comprehensive discussion of the
challenges and potential avenues for future research in deep learning-based
breast cancer imaging.Comment: Survey, 41 page
Proceedings of the Eighth Italian Conference on Computational Linguistics CliC-it 2021
The eighth edition of the Italian Conference on Computational Linguistics (CLiC-it 2021) was held at Università degli Studi di Milano-Bicocca from 26th to 28th January 2022. After the edition of 2020, which was held in fully virtual mode due to the health emergency related to Covid-19, CLiC-it 2021 represented the first moment for the Italian research community of Computational Linguistics to meet in person after more than one year of full/partial lockdown
Gas Turbines
This book is intended to provide valuable information for the analysis and design of various gas turbine engines for different applications. The target audience for this book is design, maintenance, materials, aerospace and mechanical engineers. The design and maintenance engineers in the gas turbine and aircraft industry will benefit immensely from the integration and system discussions in the book. The chapters are of high relevance and interest to manufacturers, researchers and academicians as well
SPATIAL TRANSFORMATION PATTERN DUE TO COMMERCIAL ACTIVITY IN KAMPONG HOUSE
ABSTRACT Kampung houses are houses in kampung area of the city. Kampung House oftenly transformed into others use as urban dynamics. One of the transfomation is related to the commercial activities addition by the house owner. It make house with full private space become into mixused house with more public spaces or completely changed into full public commercial building. This study investigate the spatial transformation pattern of the kampung houses due to their commercial activities addition. Site observations, interviews and questionnaires were performed to study the spatial transformation. This study found that in kampung houses, the spatial transformation pattern was depend on type of commercial activities and owner perceptions, and there are several steps of the spatial transformation related the commercial activity addition.
Keywords: spatial transformation pattern; commercial activity; owner perception, kampung house; adaptabilit