154 research outputs found

    Audio Coding Based on Integer Transforms

    Get PDF
    Die Audiocodierung hat sich in den letzten Jahren zu einem sehr populären Forschungs- und Anwendungsgebiet entwickelt. Insbesondere gehörangepasste Verfahren zur Audiocodierung, wie etwa MPEG-1 Layer-3 (MP3) oder MPEG-2 Advanced Audio Coding (AAC), werden häufig zur effizienten Speicherung und Übertragung von Audiosignalen verwendet. Für professionelle Anwendungen, wie etwa die Archivierung und Übertragung im Studiobereich, ist hingegen eher eine verlustlose Audiocodierung angebracht. Die bisherigen Ansätze für gehörangepasste und verlustlose Audiocodierung sind technisch völlig verschieden. Moderne gehörangepasste Audiocoder basieren meist auf Filterbänken, wie etwa der überlappenden orthogonalen Transformation "Modifizierte Diskrete Cosinus-Transformation" (MDCT). Verlustlose Audiocoder hingegen verwenden meist prädiktive Codierung zur Redundanzreduktion. Nur wenige Ansätze zur transformationsbasierten verlustlosen Audiocodierung wurden bisher versucht. Diese Arbeit präsentiert einen neuen Ansatz hierzu, der das Lifting-Schema auf die in der gehörangepassten Audiocodierung verwendeten überlappenden Transformationen anwendet. Dies ermöglicht eine invertierbare Integer-Approximation der ursprünglichen Transformation, z.B. die IntMDCT als Integer-Approximation der MDCT. Die selbe Technik kann auch für Filterbänke mit niedriger Systemverzögerung angewandt werden. Weiterhin ermöglichen ein neuer, mehrdimensionaler Lifting-Ansatz und eine Technik zur Spektralformung von Quantisierungsfehlern eine Verbesserung der Approximation der ursprünglichen Transformation. Basierend auf diesen neuen Integer-Transformationen werden in dieser Arbeit neue Verfahren zur Audiocodierung vorgestellt. Die Verfahren umfassen verlustlose Audiocodierung, eine skalierbare verlustlose Erweiterung eines gehörangepassten Audiocoders und einen integrierten Ansatz zur fein skalierbaren gehörangepassten und verlustlosen Audiocodierung. Schließlich wird mit Hilfe der Integer-Transformationen ein neuer Ansatz zur unhörbaren Einbettung von Daten mit hohen Datenraten in unkomprimierte Audiosignale vorgestellt.In recent years audio coding has become a very popular field for research and applications. Especially perceptual audio coding schemes, such as MPEG-1 Layer-3 (MP3) and MPEG-2 Advanced Audio Coding (AAC), are widely used for efficient storage and transmission of music signals. Nevertheless, for professional applications, such as archiving and transmission in studio environments, lossless audio coding schemes are considered more appropriate. Traditionally, the technical approaches used in perceptual and lossless audio coding have been separate worlds. In perceptual audio coding, the use of filter banks, such as the lapped orthogonal transform "Modified Discrete Cosine Transform" (MDCT), has been the approach of choice being used by many state of the art coding schemes. On the other hand, lossless audio coding schemes mostly employ predictive coding of waveforms to remove redundancy. Only few attempts have been made so far to use transform coding for the purpose of lossless audio coding. This work presents a new approach of applying the lifting scheme to lapped transforms used in perceptual audio coding. This allows for an invertible integer-to-integer approximation of the original transform, e.g. the IntMDCT as an integer approximation of the MDCT. The same technique can also be applied to low-delay filter banks. A generalized, multi-dimensional lifting approach and a noise-shaping technique are introduced, allowing to further optimize the accuracy of the approximation to the original transform. Based on these new integer transforms, this work presents new audio coding schemes and applications. The audio coding applications cover lossless audio coding, scalable lossless enhancement of a perceptual audio coder and fine-grain scalable perceptual and lossless audio coding. Finally an approach to data hiding with high data rates in uncompressed audio signals based on integer transforms is described

    Audio Coding Based on Integer Transforms

    Get PDF
    Die Audiocodierung hat sich in den letzten Jahren zu einem sehr populären Forschungs- und Anwendungsgebiet entwickelt. Insbesondere gehörangepasste Verfahren zur Audiocodierung, wie etwa MPEG-1 Layer-3 (MP3) oder MPEG-2 Advanced Audio Coding (AAC), werden häufig zur effizienten Speicherung und Übertragung von Audiosignalen verwendet. Für professionelle Anwendungen, wie etwa die Archivierung und Übertragung im Studiobereich, ist hingegen eher eine verlustlose Audiocodierung angebracht. Die bisherigen Ansätze für gehörangepasste und verlustlose Audiocodierung sind technisch völlig verschieden. Moderne gehörangepasste Audiocoder basieren meist auf Filterbänken, wie etwa der überlappenden orthogonalen Transformation "Modifizierte Diskrete Cosinus-Transformation" (MDCT). Verlustlose Audiocoder hingegen verwenden meist prädiktive Codierung zur Redundanzreduktion. Nur wenige Ansätze zur transformationsbasierten verlustlosen Audiocodierung wurden bisher versucht. Diese Arbeit präsentiert einen neuen Ansatz hierzu, der das Lifting-Schema auf die in der gehörangepassten Audiocodierung verwendeten überlappenden Transformationen anwendet. Dies ermöglicht eine invertierbare Integer-Approximation der ursprünglichen Transformation, z.B. die IntMDCT als Integer-Approximation der MDCT. Die selbe Technik kann auch für Filterbänke mit niedriger Systemverzögerung angewandt werden. Weiterhin ermöglichen ein neuer, mehrdimensionaler Lifting-Ansatz und eine Technik zur Spektralformung von Quantisierungsfehlern eine Verbesserung der Approximation der ursprünglichen Transformation. Basierend auf diesen neuen Integer-Transformationen werden in dieser Arbeit neue Verfahren zur Audiocodierung vorgestellt. Die Verfahren umfassen verlustlose Audiocodierung, eine skalierbare verlustlose Erweiterung eines gehörangepassten Audiocoders und einen integrierten Ansatz zur fein skalierbaren gehörangepassten und verlustlosen Audiocodierung. Schließlich wird mit Hilfe der Integer-Transformationen ein neuer Ansatz zur unhörbaren Einbettung von Daten mit hohen Datenraten in unkomprimierte Audiosignale vorgestellt.In recent years audio coding has become a very popular field for research and applications. Especially perceptual audio coding schemes, such as MPEG-1 Layer-3 (MP3) and MPEG-2 Advanced Audio Coding (AAC), are widely used for efficient storage and transmission of music signals. Nevertheless, for professional applications, such as archiving and transmission in studio environments, lossless audio coding schemes are considered more appropriate. Traditionally, the technical approaches used in perceptual and lossless audio coding have been separate worlds. In perceptual audio coding, the use of filter banks, such as the lapped orthogonal transform "Modified Discrete Cosine Transform" (MDCT), has been the approach of choice being used by many state of the art coding schemes. On the other hand, lossless audio coding schemes mostly employ predictive coding of waveforms to remove redundancy. Only few attempts have been made so far to use transform coding for the purpose of lossless audio coding. This work presents a new approach of applying the lifting scheme to lapped transforms used in perceptual audio coding. This allows for an invertible integer-to-integer approximation of the original transform, e.g. the IntMDCT as an integer approximation of the MDCT. The same technique can also be applied to low-delay filter banks. A generalized, multi-dimensional lifting approach and a noise-shaping technique are introduced, allowing to further optimize the accuracy of the approximation to the original transform. Based on these new integer transforms, this work presents new audio coding schemes and applications. The audio coding applications cover lossless audio coding, scalable lossless enhancement of a perceptual audio coder and fine-grain scalable perceptual and lossless audio coding. Finally an approach to data hiding with high data rates in uncompressed audio signals based on integer transforms is described

    Unmasking Communication Partners: A Low-Cost AI Solution for Digitally Removing Head-Mounted Displays in VR-Based Telepresence

    Full text link
    Face-to-face conversation in Virtual Reality (VR) is a challenge when participants wear head-mounted displays (HMD). A significant portion of a participant's face is hidden and facial expressions are difficult to perceive. Past research has shown that high-fidelity face reconstruction with personal avatars in VR is possible under laboratory conditions with high-cost hardware. In this paper, we propose one of the first low-cost systems for this task which uses only open source, free software and affordable hardware. Our approach is to track the user's face underneath the HMD utilizing a Convolutional Neural Network (CNN) and generate corresponding expressions with Generative Adversarial Networks (GAN) for producing RGBD images of the person's face. We use commodity hardware with low-cost extensions such as 3D-printed mounts and miniature cameras. Our approach learns end-to-end without manual intervention, runs in real time, and can be trained and executed on an ordinary gaming computer. We report evaluation results showing that our low-cost system does not achieve the same fidelity of research prototypes using high-end hardware and closed source software, but it is capable of creating individual facial avatars with person-specific characteristics in movements and expressions.Comment: 9 pages, IEEE 3rd International Conference on Artificial Intelligence & Virtual Realit

    Robust laser frequency stabilization by serrodyne modulation

    Full text link
    We report the relative frequency stabilization of a distributed feedback erbium-doped fiber laser on an optical cavity by serrodyne frequency shifting. A correction bandwidth of 2.3 MHz and a dynamic range of 220 MHz are achieved, which leads to a strong robustness against large disturbances up to high frequencies. We demonstrate that serrodyne frequency shifting reaches a higher correction bandwidth and lower relative frequency noise level compared to a standard acousto-optical modulator based scheme. Our results allow to consider promising applications in the absolute frequency stabilization of lasers on optical cavities.Comment: 3 pages, accepted for publication in Optics Letter

    Plasma homocysteine levels and associated factors in community-dwelling adolescents: the EVA-TYROL study

    Get PDF
    BackgroundHomocysteine (Hcy) has been associated with an adverse cardiovascular risk profile in adolescents. Assessment of the association between plasma Hcy levels and clinical/laboratory factors might improve our understanding of the pathogenesis of cardiovascular disease.MethodsHcy was measured in 1,900 14- to 19-year-old participants of prospective population-based EVA-TYROL Study (44.3% males, mean age 16.4 years) between 2015 and 2018. Factors associated with Hcy were assessed by physical examination, standardized interviews, and fasting blood analysis.ResultsMean plasma Hcy was 11.3 ± 4.5 µmol/L. Distribution of Hcy was characterized by extreme right skew. Males exhibited higher Hcy and sex differences increased with increasing age. Univariate associations with Hcy emerged for age, sex, body mass index, high-density lipoprotein cholesterol, and for factors pertaining to blood pressure, glucose metabolism, renal function, and diet quality, whereas the most important multivariate predictors of Hcy were sex and creatinine.DiscussionClinical and laboratory factors associated with Hcy in adolescents were manifold, with sex and high creatinine identified as strongest independent determinants. These results may aid when interpreting future studies investigating the vascular risk of homocysteine

    Heat and particle exhaust in high-performance plasmas in Wendelstein 7-X

    Get PDF
    The paper reports for the first time the heat and particle exhaust at the plasma boundary through various edge diagnostics for the high-performance plasma obtained after pellet injection on Wendelstein 7-X. The plasma density at the edge is found to be reduced by a factor of 2 in the high-performance phase, supporting the previously reported density peaking at the plasma centre. The plasma beta effect on the magnetic topology is reflected by the appearance of the second strike line, which is well understood with simulation. However, during the rapid decay phase of the enhanced confinement, a transient localized heat flow of up to 16 MW m-2 is observed at the leading edge of a poorly cooled divertor component, which has not been understood but raises concerns about machine safety

    Signatures of Thermal Dilepton Radiation at RHIC

    Get PDF
    The properties of thermal dilepton production from heavy-ion collisions in the RHIC energy regime are evaluated for invariant masses ranging from 0.5 to 3 GeV. Using an expanding thermal fireball to model the evolution through both quark-gluon and hadronic phases various features of the spectra are addressed. In the low-mass region, due to an expected large background, the focus is on possible medium modifications of the narrow resonance structures from ω\omega and ϕ\phi mesons, whereas in the intermediate-mass region the old idea of identifying QGP radiation is reiterated including effects of chemical under-saturation in the early stages of central Au+Au collisions.Comment: 17 pages ReVTeX including 16 figure

    Tolerability of inhaled N-chlorotaurine in an acute pig streptococcal lower airway inflammation model

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Inhalation of N-chlorotaurine (NCT), an endogenous new broad spectrum non-antibiotic anti-infective, has been shown to be very well tolerated in the pig model recently. In the present study, inhaled NCT was tested for tolerability and efficacy in the infected bronchopulmonary system using the same model.</p> <p>Methods</p> <p>Anesthetized pigs were inoculated with 20 ml of a solution containing approximately 10<sup>8 </sup>CFU/ml <it>Streptococcus pyogenes </it>strain d68 via a duodenal tube placed through the tracheal tube down to the carina. Two hours later, 5 ml of 1% NCT aqueous solution (test group, n = 15) or 5 ml of 0.9% NaCl (control group, n = 16) was inhaled via the tracheal tube connected to a nebulizer. Inhalation was repeated every hour, four times in total. Lung function and haemodynamics were monitored. Bronchoalveolar lavage samples were removed for determination of colony forming units (CFU), and lung samples for histology.</p> <p>Results</p> <p>Arterial pressure of oxygen (PaO<sub>2</sub>) decreased rapidly after instillation of the bacteria in all animals and showed only a slight further decrease at the end of the experiment without a difference between both groups. Pulmonary artery pressure increased to a peak 1-1.5 h after application of the bacteria, decreased in the following hour and remained constant during treatment, again similarly in both groups. Histology demonstrated granulocytic infiltration in the central parts of the lung, while this was absent in the periphery. Expression of TNF-alpha, IL-8, and haemoxygenase-1 in lung biopsies was similar in both groups. CFU counts in bronchoalveolar lavage came to 170 (10; 1388) CFU/ml (median and 25 and 75 percentiles) for the NCT treated pigs, and to 250 (10; 5.5 × 10<sup>5</sup>) CFU/ml for NaCl treated pigs (p = 0.4159).</p> <p>Conclusions</p> <p>Inhaled NCT at a concentration of 1% proved to be very well tolerated also in the infected bronchopulmonary system. This study confirms the tolerability in this delicate body region, which has been proven in healthy pigs previously. Regarding efficacy, no conclusions can be drawn, mainly because of the limited test period of the model.</p
    corecore