Input image (A) and target (B) alongside visualizations of the first 16 channels of intermediate activations for the given input image produced by early (1), middle (2) and late (3) convolutional layers in U-net (C) and MoNet (D). Histograms computed for all channels in the feature maps for early (1), middle (2), and late (3) convolution layers for U-net (E) and MoNet (F).</p