141 research outputs found

    Audio Coding Based on Integer Transforms

    Get PDF
    Die Audiocodierung hat sich in den letzten Jahren zu einem sehr populären Forschungs- und Anwendungsgebiet entwickelt. Insbesondere gehörangepasste Verfahren zur Audiocodierung, wie etwa MPEG-1 Layer-3 (MP3) oder MPEG-2 Advanced Audio Coding (AAC), werden häufig zur effizienten Speicherung und Übertragung von Audiosignalen verwendet. Für professionelle Anwendungen, wie etwa die Archivierung und Übertragung im Studiobereich, ist hingegen eher eine verlustlose Audiocodierung angebracht. Die bisherigen Ansätze für gehörangepasste und verlustlose Audiocodierung sind technisch völlig verschieden. Moderne gehörangepasste Audiocoder basieren meist auf Filterbänken, wie etwa der überlappenden orthogonalen Transformation "Modifizierte Diskrete Cosinus-Transformation" (MDCT). Verlustlose Audiocoder hingegen verwenden meist prädiktive Codierung zur Redundanzreduktion. Nur wenige Ansätze zur transformationsbasierten verlustlosen Audiocodierung wurden bisher versucht. Diese Arbeit präsentiert einen neuen Ansatz hierzu, der das Lifting-Schema auf die in der gehörangepassten Audiocodierung verwendeten überlappenden Transformationen anwendet. Dies ermöglicht eine invertierbare Integer-Approximation der ursprünglichen Transformation, z.B. die IntMDCT als Integer-Approximation der MDCT. Die selbe Technik kann auch für Filterbänke mit niedriger Systemverzögerung angewandt werden. Weiterhin ermöglichen ein neuer, mehrdimensionaler Lifting-Ansatz und eine Technik zur Spektralformung von Quantisierungsfehlern eine Verbesserung der Approximation der ursprünglichen Transformation. Basierend auf diesen neuen Integer-Transformationen werden in dieser Arbeit neue Verfahren zur Audiocodierung vorgestellt. Die Verfahren umfassen verlustlose Audiocodierung, eine skalierbare verlustlose Erweiterung eines gehörangepassten Audiocoders und einen integrierten Ansatz zur fein skalierbaren gehörangepassten und verlustlosen Audiocodierung. Schließlich wird mit Hilfe der Integer-Transformationen ein neuer Ansatz zur unhörbaren Einbettung von Daten mit hohen Datenraten in unkomprimierte Audiosignale vorgestellt.In recent years audio coding has become a very popular field for research and applications. Especially perceptual audio coding schemes, such as MPEG-1 Layer-3 (MP3) and MPEG-2 Advanced Audio Coding (AAC), are widely used for efficient storage and transmission of music signals. Nevertheless, for professional applications, such as archiving and transmission in studio environments, lossless audio coding schemes are considered more appropriate. Traditionally, the technical approaches used in perceptual and lossless audio coding have been separate worlds. In perceptual audio coding, the use of filter banks, such as the lapped orthogonal transform "Modified Discrete Cosine Transform" (MDCT), has been the approach of choice being used by many state of the art coding schemes. On the other hand, lossless audio coding schemes mostly employ predictive coding of waveforms to remove redundancy. Only few attempts have been made so far to use transform coding for the purpose of lossless audio coding. This work presents a new approach of applying the lifting scheme to lapped transforms used in perceptual audio coding. This allows for an invertible integer-to-integer approximation of the original transform, e.g. the IntMDCT as an integer approximation of the MDCT. The same technique can also be applied to low-delay filter banks. A generalized, multi-dimensional lifting approach and a noise-shaping technique are introduced, allowing to further optimize the accuracy of the approximation to the original transform. Based on these new integer transforms, this work presents new audio coding schemes and applications. The audio coding applications cover lossless audio coding, scalable lossless enhancement of a perceptual audio coder and fine-grain scalable perceptual and lossless audio coding. Finally an approach to data hiding with high data rates in uncompressed audio signals based on integer transforms is described

    Audio Coding Based on Integer Transforms

    Get PDF
    Die Audiocodierung hat sich in den letzten Jahren zu einem sehr populären Forschungs- und Anwendungsgebiet entwickelt. Insbesondere gehörangepasste Verfahren zur Audiocodierung, wie etwa MPEG-1 Layer-3 (MP3) oder MPEG-2 Advanced Audio Coding (AAC), werden häufig zur effizienten Speicherung und Übertragung von Audiosignalen verwendet. Für professionelle Anwendungen, wie etwa die Archivierung und Übertragung im Studiobereich, ist hingegen eher eine verlustlose Audiocodierung angebracht. Die bisherigen Ansätze für gehörangepasste und verlustlose Audiocodierung sind technisch völlig verschieden. Moderne gehörangepasste Audiocoder basieren meist auf Filterbänken, wie etwa der überlappenden orthogonalen Transformation "Modifizierte Diskrete Cosinus-Transformation" (MDCT). Verlustlose Audiocoder hingegen verwenden meist prädiktive Codierung zur Redundanzreduktion. Nur wenige Ansätze zur transformationsbasierten verlustlosen Audiocodierung wurden bisher versucht. Diese Arbeit präsentiert einen neuen Ansatz hierzu, der das Lifting-Schema auf die in der gehörangepassten Audiocodierung verwendeten überlappenden Transformationen anwendet. Dies ermöglicht eine invertierbare Integer-Approximation der ursprünglichen Transformation, z.B. die IntMDCT als Integer-Approximation der MDCT. Die selbe Technik kann auch für Filterbänke mit niedriger Systemverzögerung angewandt werden. Weiterhin ermöglichen ein neuer, mehrdimensionaler Lifting-Ansatz und eine Technik zur Spektralformung von Quantisierungsfehlern eine Verbesserung der Approximation der ursprünglichen Transformation. Basierend auf diesen neuen Integer-Transformationen werden in dieser Arbeit neue Verfahren zur Audiocodierung vorgestellt. Die Verfahren umfassen verlustlose Audiocodierung, eine skalierbare verlustlose Erweiterung eines gehörangepassten Audiocoders und einen integrierten Ansatz zur fein skalierbaren gehörangepassten und verlustlosen Audiocodierung. Schließlich wird mit Hilfe der Integer-Transformationen ein neuer Ansatz zur unhörbaren Einbettung von Daten mit hohen Datenraten in unkomprimierte Audiosignale vorgestellt.In recent years audio coding has become a very popular field for research and applications. Especially perceptual audio coding schemes, such as MPEG-1 Layer-3 (MP3) and MPEG-2 Advanced Audio Coding (AAC), are widely used for efficient storage and transmission of music signals. Nevertheless, for professional applications, such as archiving and transmission in studio environments, lossless audio coding schemes are considered more appropriate. Traditionally, the technical approaches used in perceptual and lossless audio coding have been separate worlds. In perceptual audio coding, the use of filter banks, such as the lapped orthogonal transform "Modified Discrete Cosine Transform" (MDCT), has been the approach of choice being used by many state of the art coding schemes. On the other hand, lossless audio coding schemes mostly employ predictive coding of waveforms to remove redundancy. Only few attempts have been made so far to use transform coding for the purpose of lossless audio coding. This work presents a new approach of applying the lifting scheme to lapped transforms used in perceptual audio coding. This allows for an invertible integer-to-integer approximation of the original transform, e.g. the IntMDCT as an integer approximation of the MDCT. The same technique can also be applied to low-delay filter banks. A generalized, multi-dimensional lifting approach and a noise-shaping technique are introduced, allowing to further optimize the accuracy of the approximation to the original transform. Based on these new integer transforms, this work presents new audio coding schemes and applications. The audio coding applications cover lossless audio coding, scalable lossless enhancement of a perceptual audio coder and fine-grain scalable perceptual and lossless audio coding. Finally an approach to data hiding with high data rates in uncompressed audio signals based on integer transforms is described

    Unmasking Communication Partners: A Low-Cost AI Solution for Digitally Removing Head-Mounted Displays in VR-Based Telepresence

    Full text link
    Face-to-face conversation in Virtual Reality (VR) is a challenge when participants wear head-mounted displays (HMD). A significant portion of a participant's face is hidden and facial expressions are difficult to perceive. Past research has shown that high-fidelity face reconstruction with personal avatars in VR is possible under laboratory conditions with high-cost hardware. In this paper, we propose one of the first low-cost systems for this task which uses only open source, free software and affordable hardware. Our approach is to track the user's face underneath the HMD utilizing a Convolutional Neural Network (CNN) and generate corresponding expressions with Generative Adversarial Networks (GAN) for producing RGBD images of the person's face. We use commodity hardware with low-cost extensions such as 3D-printed mounts and miniature cameras. Our approach learns end-to-end without manual intervention, runs in real time, and can be trained and executed on an ordinary gaming computer. We report evaluation results showing that our low-cost system does not achieve the same fidelity of research prototypes using high-end hardware and closed source software, but it is capable of creating individual facial avatars with person-specific characteristics in movements and expressions.Comment: 9 pages, IEEE 3rd International Conference on Artificial Intelligence & Virtual Realit

    Robust laser frequency stabilization by serrodyne modulation

    Full text link
    We report the relative frequency stabilization of a distributed feedback erbium-doped fiber laser on an optical cavity by serrodyne frequency shifting. A correction bandwidth of 2.3 MHz and a dynamic range of 220 MHz are achieved, which leads to a strong robustness against large disturbances up to high frequencies. We demonstrate that serrodyne frequency shifting reaches a higher correction bandwidth and lower relative frequency noise level compared to a standard acousto-optical modulator based scheme. Our results allow to consider promising applications in the absolute frequency stabilization of lasers on optical cavities.Comment: 3 pages, accepted for publication in Optics Letter

    Plasma homocysteine levels and associated factors in community-dwelling adolescents: the EVA-TYROL study

    Get PDF
    BackgroundHomocysteine (Hcy) has been associated with an adverse cardiovascular risk profile in adolescents. Assessment of the association between plasma Hcy levels and clinical/laboratory factors might improve our understanding of the pathogenesis of cardiovascular disease.MethodsHcy was measured in 1,900 14- to 19-year-old participants of prospective population-based EVA-TYROL Study (44.3% males, mean age 16.4 years) between 2015 and 2018. Factors associated with Hcy were assessed by physical examination, standardized interviews, and fasting blood analysis.ResultsMean plasma Hcy was 11.3 ± 4.5 µmol/L. Distribution of Hcy was characterized by extreme right skew. Males exhibited higher Hcy and sex differences increased with increasing age. Univariate associations with Hcy emerged for age, sex, body mass index, high-density lipoprotein cholesterol, and for factors pertaining to blood pressure, glucose metabolism, renal function, and diet quality, whereas the most important multivariate predictors of Hcy were sex and creatinine.DiscussionClinical and laboratory factors associated with Hcy in adolescents were manifold, with sex and high creatinine identified as strongest independent determinants. These results may aid when interpreting future studies investigating the vascular risk of homocysteine

    Signatures of Thermal Dilepton Radiation at RHIC

    Get PDF
    The properties of thermal dilepton production from heavy-ion collisions in the RHIC energy regime are evaluated for invariant masses ranging from 0.5 to 3 GeV. Using an expanding thermal fireball to model the evolution through both quark-gluon and hadronic phases various features of the spectra are addressed. In the low-mass region, due to an expected large background, the focus is on possible medium modifications of the narrow resonance structures from ω\omega and ϕ\phi mesons, whereas in the intermediate-mass region the old idea of identifying QGP radiation is reiterated including effects of chemical under-saturation in the early stages of central Au+Au collisions.Comment: 17 pages ReVTeX including 16 figure

    Tolerability of inhaled N-chlorotaurine in an acute pig streptococcal lower airway inflammation model

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Inhalation of N-chlorotaurine (NCT), an endogenous new broad spectrum non-antibiotic anti-infective, has been shown to be very well tolerated in the pig model recently. In the present study, inhaled NCT was tested for tolerability and efficacy in the infected bronchopulmonary system using the same model.</p> <p>Methods</p> <p>Anesthetized pigs were inoculated with 20 ml of a solution containing approximately 10<sup>8 </sup>CFU/ml <it>Streptococcus pyogenes </it>strain d68 via a duodenal tube placed through the tracheal tube down to the carina. Two hours later, 5 ml of 1% NCT aqueous solution (test group, n = 15) or 5 ml of 0.9% NaCl (control group, n = 16) was inhaled via the tracheal tube connected to a nebulizer. Inhalation was repeated every hour, four times in total. Lung function and haemodynamics were monitored. Bronchoalveolar lavage samples were removed for determination of colony forming units (CFU), and lung samples for histology.</p> <p>Results</p> <p>Arterial pressure of oxygen (PaO<sub>2</sub>) decreased rapidly after instillation of the bacteria in all animals and showed only a slight further decrease at the end of the experiment without a difference between both groups. Pulmonary artery pressure increased to a peak 1-1.5 h after application of the bacteria, decreased in the following hour and remained constant during treatment, again similarly in both groups. Histology demonstrated granulocytic infiltration in the central parts of the lung, while this was absent in the periphery. Expression of TNF-alpha, IL-8, and haemoxygenase-1 in lung biopsies was similar in both groups. CFU counts in bronchoalveolar lavage came to 170 (10; 1388) CFU/ml (median and 25 and 75 percentiles) for the NCT treated pigs, and to 250 (10; 5.5 × 10<sup>5</sup>) CFU/ml for NaCl treated pigs (p = 0.4159).</p> <p>Conclusions</p> <p>Inhaled NCT at a concentration of 1% proved to be very well tolerated also in the infected bronchopulmonary system. This study confirms the tolerability in this delicate body region, which has been proven in healthy pigs previously. Regarding efficacy, no conclusions can be drawn, mainly because of the limited test period of the model.</p

    Actin Fusion Proteins Alter the Dynamics of Mechanically Induced Cytoskeleton Rearrangement

    Get PDF
    Mechanical forces can regulate various functions in living cells. The cytoskeleton is a crucial element for the transduction of forces in cell-internal signals and subsequent biological responses. Accordingly, many studies in cellular biomechanics have been focused on the role of the contractile acto-myosin system in such processes. A widely used method to observe the dynamic actin network in living cells is the transgenic expression of fluorescent proteins fused to actin. However, adverse effects of GFP-actin fusion proteins on cell spreading, migration and cell adhesion strength have been reported. These shortcomings were shown to be partly overcome by fusions of actin binding peptides to fluorescent proteins. Nevertheless, it is not understood whether direct labeling by actin fusion proteins or indirect labeling via these chimaeras alters biomechanical responses of cells and the cytoskeleton to forces. We investigated the dynamic reorganization of actin stress fibers in cells under cyclic mechanical loading by transiently expressing either egfp-Lifeact or eyfp-actin in rat embryonic fibroblasts and observing them by means of live cell microscopy. Our results demonstrate that mechanically-induced actin stress fiber reorganization exhibits very different kinetics in EYFP-actin cells and EGFP-Lifeact cells, the latter showing a remarkable agreement with the reorganization kinetics of non-transfected cells under the same experimental conditions
    corecore