GourmetNet: Food Segmentation Using Multi-Scale Waterfall Features With Spatial and Channel Attention

Sharma, Udit

GourmetNet: Food Segmentation Using Multi-Scale Waterfall Features With Spatial and Channel Attention

Authors: Udit Sharma
Publication date: 1 November 2021
Publisher: RIT Scholar Works

Abstract

Deep learning and Computer vision are extensively used to solve problems in wide range of domains from automotive and manufacturing to healthcare and surveillance. Research in deep learning for food images is mainly limited to food identification and detection. Food segmentation is an important problem as the first step for nutrition monitoring, food volume and calorie estimation. This research is intended to expand the horizons of deep learning and semantic segmentation by proposing a novel single-pass, end-to-end trainable network for food segmentation. Our novel architecture incorporates both channel attention and spatial attention information in an expanded multi-scale feature representation using the WASPv2 module. The refined features will be processed with the advanced multi-scale waterfall module that combines the benefits of cascade filtering and pyramid representations without requiring a separate decoder or postprocessing

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

RIT Scholar Works

oai:repository.rit.edu:theses-...

Last time updated on 12/01/2024