401 research outputs found
The Distribution of Visual Information in the Vertical Dimension of Roman and Hebrew Letters
English and Hebrew native speakers read texts mutilated by removing a narrow or a wide strip at the top or at the bottom of the lines. Whereas reading the English texts was impaired more by mutilating the top, the reverse was found for the Hebrew texts. This result is ascribed to the different ways in which information is distributed along the vertical axis of Roman and Hebrew letters. Interactions between region and width of mutilation are argued to indicate that the effect is not due just to features at the very top and very bottom
CORE-Deblur: Parallel MRI Reconstruction by Deblurring Using Compressed Sensing
In this work we introduce a new method that combines Parallel MRI and
Compressed Sensing (CS) for accelerated image reconstruction from subsampled
k-space data. The method first computes a convolved image, which gives the
convolution between a user-defined kernel and the unknown MR image, and then
reconstructs the image by CS-based image deblurring, in which CS is applied for
removing the inherent blur stemming from the convolution process. This method
is hence termed CORE-Deblur. Retrospective subsampling experiments with data
from a numerical brain phantom and in-vivo 7T brain scans showed that
CORE-Deblur produced high-quality reconstructions, comparable to those of a
conventional CS method, while reducing the number of iterations by a factor of
10 or more. The average Normalized Root Mean Square Error (NRMSE) obtained by
CORE-Deblur for the in-vivo datasets was 0.016. CORE-Deblur also exhibited
robustness regarding the chosen kernel and compatibility with various k-space
subsampling schemes, ranging from regular to random. In summary, CORE-Deblur
enables high quality reconstructions and reduction of the CS iterations number
by 10-fold.Comment: 11 pages, 6 figures, 1 tabl
The listening talker: A review of human and algorithmic context-induced modifications of speech
International audienceSpeech output technology is finding widespread application, including in scenarios where intelligibility might be compromised - at least for some listeners - by adverse conditions. Unlike most current algorithms, talkers continually adapt their speech patterns as a response to the immediate context of spoken communication, where the type of interlocutor and the environment are the dominant situational factors influencing speech production. Observations of talker behaviour can motivate the design of more robust speech output algorithms. Starting with a listener-oriented categorisation of possible goals for speech modification, this review article summarises the extensive set of behavioural findings related to human speech modification, identifies which factors appear to be beneficial, and goes on to examine previous computational attempts to improve intelligibility in noise. The review concludes by tabulating 46 speech modifications, many of which have yet to be perceptually or algorithmically evaluated. Consequently, the review provides a roadmap for future work in improving the robustness of speech output
Reference-Free Image Quality Metric for Degradation and Reconstruction Artifacts
Image Quality Assessment (IQA) is essential in various Computer Vision tasks
such as image deblurring and super-resolution. However, most IQA methods
require reference images, which are not always available. While there are some
reference-free IQA metrics, they have limitations in simulating human
perception and discerning subtle image quality variations. We hypothesize that
the JPEG quality factor is representatives of image quality measurement, and a
well-trained neural network can learn to accurately evaluate image quality
without requiring a clean reference, as it can recognize image degradation
artifacts based on prior knowledge. Thus, we developed a reference-free quality
evaluation network, dubbed "Quality Factor (QF) Predictor", which does not
require any reference. Our QF Predictor is a lightweight, fully convolutional
network comprising seven layers. The model is trained in a self-supervised
manner: it receives JPEG compressed image patch with a random QF as input, is
trained to accurately predict the corresponding QF. We demonstrate the
versatility of the model by applying it to various tasks. First, our QF
Predictor can generalize to measure the severity of various image artifacts,
such as Gaussian Blur and Gaussian noise. Second, we show that the QF Predictor
can be trained to predict the undersampling rate of images reconstructed from
Magnetic Resonance Imaging (MRI) data
Modeling of Current-Voltage Characteristics of the Photoactivated Device Based on SOI Technology
An analytical model of the silicon on insulator photoactivated modulator (SOI-PAM) device is presented in order to describe the concept of this novel device in which the information is electronic while the modulation command is optical. The model, relying on the classic Shockley’s analysis, is simple and useful for analyzing and synthesizing the voltage-current relations of the device at low drain voltage. Analytical expressions were derived for the output current as function of the input drain and gate voltages with a parameterization of the physical values such as the doping concentrations, channel and oxide thicknesses, and the optical control energy. A prototype SOI-PAM device having an area of 4 μm × 3 μm with known parameters is used to experimentally validate and support the model. Finally, the model allows the understanding of the physical mechanisms inside the device for both dark and under illumination conditions, and it will be used to optimize and to find the performance limits of the device
K-band: Self-supervised MRI Reconstruction via Stochastic Gradient Descent over K-space Subsets
Although deep learning (DL) methods are powerful for solving inverse
problems, their reliance on high-quality training data is a major hurdle. This
is significant in high-dimensional (dynamic/volumetric) magnetic resonance
imaging (MRI), where acquisition of high-resolution fully sampled k-space data
is impractical. We introduce a novel mathematical framework, dubbed k-band,
that enables training DL models using only partial, limited-resolution k-space
data. Specifically, we introduce training with stochastic gradient descent
(SGD) over k-space subsets. In each training iteration, rather than using the
fully sampled k-space for computing gradients, we use only a small k-space
portion. This concept is compatible with different sampling strategies; here we
demonstrate the method for k-space "bands", which have limited resolution in
one dimension and can hence be acquired rapidly. We prove analytically that our
method stochastically approximates the gradients computed in a fully-supervised
setup, when two simple conditions are met: (i) the limited-resolution axis is
chosen randomly-uniformly for every new scan, hence k-space is fully covered
across the entire training set, and (ii) the loss function is weighed with a
mask, derived here analytically, which facilitates accurate reconstruction of
high-resolution details. Numerical experiments with raw MRI data indicate that
k-band outperforms two other methods trained on limited-resolution data and
performs comparably to state-of-the-art (SoTA) methods trained on
high-resolution data. k-band hence obtains SoTA performance, with the advantage
of training using only limited-resolution data. This work hence introduces a
practical, easy-to-implement, self-supervised training framework, which
involves fast acquisition and self-supervised reconstruction and offers
theoretical guarantees
- …
