154 research outputs found
Audio Coding Based on Integer Transforms
Die Audiocodierung hat sich in den letzten Jahren zu einem sehr
populären Forschungs- und Anwendungsgebiet entwickelt. Insbesondere
gehörangepasste Verfahren zur Audiocodierung, wie etwa MPEG-1 Layer-3
(MP3) oder MPEG-2 Advanced Audio Coding (AAC), werden häufig zur
effizienten Speicherung und Übertragung von Audiosignalen verwendet. Für
professionelle Anwendungen, wie etwa die Archivierung und Übertragung im
Studiobereich, ist hingegen eher eine verlustlose Audiocodierung angebracht.
Die bisherigen Ansätze für gehörangepasste und verlustlose
Audiocodierung sind technisch völlig verschieden. Moderne
gehörangepasste Audiocoder basieren meist auf Filterbänken, wie etwa der
überlappenden orthogonalen Transformation "Modifizierte Diskrete
Cosinus-Transformation" (MDCT). Verlustlose Audiocoder hingegen
verwenden meist prädiktive Codierung zur Redundanzreduktion. Nur wenige
Ansätze zur transformationsbasierten verlustlosen Audiocodierung wurden
bisher versucht.
Diese Arbeit präsentiert einen neuen Ansatz hierzu, der das
Lifting-Schema auf die in der gehörangepassten Audiocodierung
verwendeten überlappenden Transformationen anwendet. Dies ermöglicht
eine invertierbare Integer-Approximation der ursprünglichen
Transformation, z.B. die IntMDCT als Integer-Approximation der MDCT. Die
selbe Technik kann auch für Filterbänke mit niedriger Systemverzögerung
angewandt werden. Weiterhin ermöglichen ein neuer, mehrdimensionaler
Lifting-Ansatz und eine Technik zur Spektralformung von
Quantisierungsfehlern eine Verbesserung der Approximation der
ursprünglichen Transformation.
Basierend auf diesen neuen Integer-Transformationen werden in dieser
Arbeit neue Verfahren zur Audiocodierung vorgestellt. Die Verfahren
umfassen verlustlose Audiocodierung, eine skalierbare verlustlose
Erweiterung eines gehörangepassten Audiocoders und einen integrierten
Ansatz zur fein skalierbaren gehörangepassten und verlustlosen
Audiocodierung. Schließlich wird mit Hilfe der Integer-Transformationen
ein neuer Ansatz zur unhörbaren Einbettung von Daten mit hohen
Datenraten in unkomprimierte Audiosignale vorgestellt.In recent years audio coding has become a very popular field for
research and applications. Especially perceptual audio coding schemes,
such as MPEG-1 Layer-3 (MP3) and MPEG-2 Advanced Audio Coding (AAC), are
widely used for efficient storage and transmission of music
signals. Nevertheless, for professional applications, such as archiving
and transmission in studio environments, lossless audio coding schemes
are considered more appropriate.
Traditionally, the technical approaches used in perceptual and lossless
audio coding have been separate worlds. In perceptual audio coding, the
use of filter banks, such as the lapped orthogonal transform "Modified
Discrete Cosine Transform" (MDCT), has been the approach of choice being
used by many state of the art coding schemes. On the other hand,
lossless audio coding schemes mostly employ predictive coding of
waveforms to remove redundancy. Only few attempts have been made so far
to use transform coding for the purpose of lossless audio coding.
This work presents a new approach of applying the lifting scheme to
lapped transforms used in perceptual audio coding. This allows for an
invertible integer-to-integer approximation of the original transform,
e.g. the IntMDCT as an integer approximation of the MDCT. The same
technique can also be applied to low-delay filter banks. A generalized,
multi-dimensional lifting approach and a noise-shaping technique are
introduced, allowing to further optimize the accuracy of the
approximation to the original transform.
Based on these new integer transforms, this work presents new audio
coding schemes and applications. The audio coding applications cover
lossless audio coding, scalable lossless enhancement of a perceptual
audio coder and fine-grain scalable perceptual and lossless audio
coding. Finally an approach to data hiding with high data rates in
uncompressed audio signals based on integer transforms is described
Audio Coding Based on Integer Transforms
Die Audiocodierung hat sich in den letzten Jahren zu einem sehr
populären Forschungs- und Anwendungsgebiet entwickelt. Insbesondere
gehörangepasste Verfahren zur Audiocodierung, wie etwa MPEG-1 Layer-3
(MP3) oder MPEG-2 Advanced Audio Coding (AAC), werden häufig zur
effizienten Speicherung und Übertragung von Audiosignalen verwendet. Für
professionelle Anwendungen, wie etwa die Archivierung und Übertragung im
Studiobereich, ist hingegen eher eine verlustlose Audiocodierung angebracht.
Die bisherigen Ansätze für gehörangepasste und verlustlose
Audiocodierung sind technisch völlig verschieden. Moderne
gehörangepasste Audiocoder basieren meist auf Filterbänken, wie etwa der
überlappenden orthogonalen Transformation "Modifizierte Diskrete
Cosinus-Transformation" (MDCT). Verlustlose Audiocoder hingegen
verwenden meist prädiktive Codierung zur Redundanzreduktion. Nur wenige
Ansätze zur transformationsbasierten verlustlosen Audiocodierung wurden
bisher versucht.
Diese Arbeit präsentiert einen neuen Ansatz hierzu, der das
Lifting-Schema auf die in der gehörangepassten Audiocodierung
verwendeten überlappenden Transformationen anwendet. Dies ermöglicht
eine invertierbare Integer-Approximation der ursprünglichen
Transformation, z.B. die IntMDCT als Integer-Approximation der MDCT. Die
selbe Technik kann auch für Filterbänke mit niedriger Systemverzögerung
angewandt werden. Weiterhin ermöglichen ein neuer, mehrdimensionaler
Lifting-Ansatz und eine Technik zur Spektralformung von
Quantisierungsfehlern eine Verbesserung der Approximation der
ursprünglichen Transformation.
Basierend auf diesen neuen Integer-Transformationen werden in dieser
Arbeit neue Verfahren zur Audiocodierung vorgestellt. Die Verfahren
umfassen verlustlose Audiocodierung, eine skalierbare verlustlose
Erweiterung eines gehörangepassten Audiocoders und einen integrierten
Ansatz zur fein skalierbaren gehörangepassten und verlustlosen
Audiocodierung. Schließlich wird mit Hilfe der Integer-Transformationen
ein neuer Ansatz zur unhörbaren Einbettung von Daten mit hohen
Datenraten in unkomprimierte Audiosignale vorgestellt.In recent years audio coding has become a very popular field for
research and applications. Especially perceptual audio coding schemes,
such as MPEG-1 Layer-3 (MP3) and MPEG-2 Advanced Audio Coding (AAC), are
widely used for efficient storage and transmission of music
signals. Nevertheless, for professional applications, such as archiving
and transmission in studio environments, lossless audio coding schemes
are considered more appropriate.
Traditionally, the technical approaches used in perceptual and lossless
audio coding have been separate worlds. In perceptual audio coding, the
use of filter banks, such as the lapped orthogonal transform "Modified
Discrete Cosine Transform" (MDCT), has been the approach of choice being
used by many state of the art coding schemes. On the other hand,
lossless audio coding schemes mostly employ predictive coding of
waveforms to remove redundancy. Only few attempts have been made so far
to use transform coding for the purpose of lossless audio coding.
This work presents a new approach of applying the lifting scheme to
lapped transforms used in perceptual audio coding. This allows for an
invertible integer-to-integer approximation of the original transform,
e.g. the IntMDCT as an integer approximation of the MDCT. The same
technique can also be applied to low-delay filter banks. A generalized,
multi-dimensional lifting approach and a noise-shaping technique are
introduced, allowing to further optimize the accuracy of the
approximation to the original transform.
Based on these new integer transforms, this work presents new audio
coding schemes and applications. The audio coding applications cover
lossless audio coding, scalable lossless enhancement of a perceptual
audio coder and fine-grain scalable perceptual and lossless audio
coding. Finally an approach to data hiding with high data rates in
uncompressed audio signals based on integer transforms is described
Unmasking Communication Partners: A Low-Cost AI Solution for Digitally Removing Head-Mounted Displays in VR-Based Telepresence
Face-to-face conversation in Virtual Reality (VR) is a challenge when
participants wear head-mounted displays (HMD). A significant portion of a
participant's face is hidden and facial expressions are difficult to perceive.
Past research has shown that high-fidelity face reconstruction with personal
avatars in VR is possible under laboratory conditions with high-cost hardware.
In this paper, we propose one of the first low-cost systems for this task which
uses only open source, free software and affordable hardware. Our approach is
to track the user's face underneath the HMD utilizing a Convolutional Neural
Network (CNN) and generate corresponding expressions with Generative
Adversarial Networks (GAN) for producing RGBD images of the person's face. We
use commodity hardware with low-cost extensions such as 3D-printed mounts and
miniature cameras. Our approach learns end-to-end without manual intervention,
runs in real time, and can be trained and executed on an ordinary gaming
computer. We report evaluation results showing that our low-cost system does
not achieve the same fidelity of research prototypes using high-end hardware
and closed source software, but it is capable of creating individual facial
avatars with person-specific characteristics in movements and expressions.Comment: 9 pages, IEEE 3rd International Conference on Artificial Intelligence
& Virtual Realit
Robust laser frequency stabilization by serrodyne modulation
We report the relative frequency stabilization of a distributed feedback
erbium-doped fiber laser on an optical cavity by serrodyne frequency shifting.
A correction bandwidth of 2.3 MHz and a dynamic range of 220 MHz are achieved,
which leads to a strong robustness against large disturbances up to high
frequencies. We demonstrate that serrodyne frequency shifting reaches a higher
correction bandwidth and lower relative frequency noise level compared to a
standard acousto-optical modulator based scheme. Our results allow to consider
promising applications in the absolute frequency stabilization of lasers on
optical cavities.Comment: 3 pages, accepted for publication in Optics Letter
Plasma homocysteine levels and associated factors in community-dwelling adolescents: the EVA-TYROL study
BackgroundHomocysteine (Hcy) has been associated with an adverse cardiovascular risk profile in adolescents. Assessment of the association between plasma Hcy levels and clinical/laboratory factors might improve our understanding of the pathogenesis of cardiovascular disease.MethodsHcy was measured in 1,900 14- to 19-year-old participants of prospective population-based EVA-TYROL Study (44.3% males, mean age 16.4 years) between 2015 and 2018. Factors associated with Hcy were assessed by physical examination, standardized interviews, and fasting blood analysis.ResultsMean plasma Hcy was 11.3 ± 4.5 µmol/L. Distribution of Hcy was characterized by extreme right skew. Males exhibited higher Hcy and sex differences increased with increasing age. Univariate associations with Hcy emerged for age, sex, body mass index, high-density lipoprotein cholesterol, and for factors pertaining to blood pressure, glucose metabolism, renal function, and diet quality, whereas the most important multivariate predictors of Hcy were sex and creatinine.DiscussionClinical and laboratory factors associated with Hcy in adolescents were manifold, with sex and high creatinine identified as strongest independent determinants. These results may aid when interpreting future studies investigating the vascular risk of homocysteine
Heat and particle exhaust in high-performance plasmas in Wendelstein 7-X
The paper reports for the first time the heat and particle exhaust at the plasma boundary through various edge diagnostics for the high-performance plasma obtained after pellet injection on Wendelstein 7-X. The plasma density at the edge is found to be reduced by a factor of 2 in the high-performance phase, supporting the previously reported density peaking at the plasma centre. The plasma beta effect on the magnetic topology is reflected by the appearance of the second strike line, which is well understood with simulation. However, during the rapid decay phase of the enhanced confinement, a transient localized heat flow of up to 16 MW m-2 is observed at the leading edge of a poorly cooled divertor component, which has not been understood but raises concerns about machine safety
Signatures of Thermal Dilepton Radiation at RHIC
The properties of thermal dilepton production from heavy-ion collisions in
the RHIC energy regime are evaluated for invariant masses ranging from 0.5 to 3
GeV. Using an expanding thermal fireball to model the evolution through both
quark-gluon and hadronic phases various features of the spectra are addressed.
In the low-mass region, due to an expected large background, the focus is on
possible medium modifications of the narrow resonance structures from
and mesons, whereas in the intermediate-mass region the old idea of
identifying QGP radiation is reiterated including effects of chemical
under-saturation in the early stages of central Au+Au collisions.Comment: 17 pages ReVTeX including 16 figure
Tolerability of inhaled N-chlorotaurine in an acute pig streptococcal lower airway inflammation model
<p>Abstract</p> <p>Background</p> <p>Inhalation of N-chlorotaurine (NCT), an endogenous new broad spectrum non-antibiotic anti-infective, has been shown to be very well tolerated in the pig model recently. In the present study, inhaled NCT was tested for tolerability and efficacy in the infected bronchopulmonary system using the same model.</p> <p>Methods</p> <p>Anesthetized pigs were inoculated with 20 ml of a solution containing approximately 10<sup>8 </sup>CFU/ml <it>Streptococcus pyogenes </it>strain d68 via a duodenal tube placed through the tracheal tube down to the carina. Two hours later, 5 ml of 1% NCT aqueous solution (test group, n = 15) or 5 ml of 0.9% NaCl (control group, n = 16) was inhaled via the tracheal tube connected to a nebulizer. Inhalation was repeated every hour, four times in total. Lung function and haemodynamics were monitored. Bronchoalveolar lavage samples were removed for determination of colony forming units (CFU), and lung samples for histology.</p> <p>Results</p> <p>Arterial pressure of oxygen (PaO<sub>2</sub>) decreased rapidly after instillation of the bacteria in all animals and showed only a slight further decrease at the end of the experiment without a difference between both groups. Pulmonary artery pressure increased to a peak 1-1.5 h after application of the bacteria, decreased in the following hour and remained constant during treatment, again similarly in both groups. Histology demonstrated granulocytic infiltration in the central parts of the lung, while this was absent in the periphery. Expression of TNF-alpha, IL-8, and haemoxygenase-1 in lung biopsies was similar in both groups. CFU counts in bronchoalveolar lavage came to 170 (10; 1388) CFU/ml (median and 25 and 75 percentiles) for the NCT treated pigs, and to 250 (10; 5.5 × 10<sup>5</sup>) CFU/ml for NaCl treated pigs (p = 0.4159).</p> <p>Conclusions</p> <p>Inhaled NCT at a concentration of 1% proved to be very well tolerated also in the infected bronchopulmonary system. This study confirms the tolerability in this delicate body region, which has been proven in healthy pigs previously. Regarding efficacy, no conclusions can be drawn, mainly because of the limited test period of the model.</p
- …