63 research outputs found
Recommended from our members
Signal Coding Approaches for Spatial Audio and Unreliable Networks
This dissertation is divided into two parts. The first part is concerned with developing algorithms for the compression of emerging 3D audio format, while the second part investigates optimization techniques for error-resilient predictive compression systems design.In the first part, advances in development of compression algorithms for higher order ambisonics (HOA) data is presented. HOA has proven to be the method of choice in virtual reality applications, given its capability in reproducing spatial audio and its rendering flexibility. Recent standardization for HOA compression adopted a framework wherein HOA data are decomposed into principal components that are then encoded by standard audio coding, i.e., frequency domain quantization and entropy coding to exploit psychoacoustic redundancy. A noted shortcoming of this approach is the occasional mismatch in principal components across blocks, and the resulting suboptimal transitions in the data fed to the audio coder. In this dissertation, we propose a framework where singular value decomposition (SVD) is performed after transformation to the frequency domain via the modified discrete cosine transform (MDCT). This framework not only ensures smooth transition across blocks, but also enables frequency dependent SVD for better energy compaction. Moreover, we introduce a novel noise substitution technique to compensate for suppressed ambient energy in discarded higher order ambisonics channels, which significantly enhances the perceptual quality of the reconstructed HOA signal. In the next step, to reduce the burden of side information, a new encoding architecture is presented, where transform matrices are estimated backward-adaptively. This framework allows a more frequent usage of optimal SVD, thereby approaching the full potential of frequencydomain SVD. Also the division of HOA data into predominant and ambient components in current schemes, is difficult to perceptually optimize and ignores spatial inter channel masking effects. To address this issues, a new encoding framework for compression of HOA data is presented, where a null-space basis vector extension technique enables all compression to be performed in the SVD domain, and a jointly computed common masking threshold accounts for effects of spatial masking across components.The second part is concerned with developing optimization techniques for error-resilient predictive compression systems design. Prediction is used in virtually all compression systems and when such a compressed signal is transmitted over unreliable networks, packet losses can lead to significant error propagation through the prediction loop. Despite this, the conventional design technique completely ignores the effect of packet losses, and estimates the prediction parameters to minimize the mean squared prediction error, and optimizes the quantizer to minimize the reconstruction error at the encoder. While some design techniques have been proposed toaccurately estimate and minimize the end-to-end distortion (EED) at the decoderthat accounts for packet losses, they operate in a closed-loop, which introduces a mismatch between statistics used for design and statistics used in operation, causing a negative impact on convergenceand stability of the design procedure. The first contribution of the dissertation is this part is proposing an effective technique for designing a compression system with a first order linear predictor, that accounts for the instability caused by error propagation due to packet losses, and enjoys stable statistics during design by employing open-loop iterations that on convergence mimic closed loop operation.End-to-end distortion (EED) estimation, accounting for error propagationand concealment at the decoder, has been originally developed for video coding, and enables optimal rate-distortion (RD) decisions at the encoder. However, this approach was limited to the video coder’ssimple setting of a single tap constant coefficient temporal predictor. This thesis considerably generalized the framework to account for: i) high order prediction filters, and ii) filter adaptation to localsignal statistics. We demonstrate how this EED estimatecan be leveraged, by an encoder with short and long term linearprediction, to improve RD decisions and achieve major performance gains. The approach is further extended to estimate EED in speech coders. The error propagation problem is exacerbated in this case, as standard coders not only predict the signal from past frames, but also the parameters (in the line spectral frequency domain) employed for such prediction. Hence, the prediction loop propagates errors in the reconstructed signal as well as errors in the prediction parameters. A recursive algorithm is proposed to estimate, at the encoder, the overall EED, by the subterfuge of parallel tracking of decoder statistics for prediction parameters and signal reconstructions, in their respective domains, which are then combined to obtain the ultimate EED estimate
Life Sciences Program Tasks and Bibliography for FY 1996
This document includes information on all peer reviewed projects funded by the Office of Life and Microgravity Sciences and Applications, Life Sciences Division during fiscal year 1996. This document will be published annually and made available to scientists in the space life sciences field both as a hard copy and as an interactive Internet web page
The 1988 Goddard Conference on Space Applications of Artificial Intelligence
This publication comprises the papers presented at the 1988 Goddard Conference on Space Applications of Artificial Intelligence held at the NASA/Goddard Space Flight Center, Greenbelt, Maryland on May 24, 1988. The purpose of this annual conference is to provide a forum in which current research and development directed at space applications of artificial intelligence can be presented and discussed. The papers in these proceedings fall into the following areas: mission operations support, planning and scheduling; fault isolation/diagnosis; image processing and machine vision; data management; modeling and simulation; and development tools/methodologies
Life Sciences Program Tasks and Bibliography for FY 1997
This document includes information on all peer reviewed projects funded by the Office of Life and Microgravity Sciences and Applications, Life Sciences Division during fiscal year 1997. This document will be published annually and made available to scientists in the space life sciences field both as a hard copy and as an interactive internet web page
Third International Symposium on Space Mission Operations and Ground Data Systems, part 1
Under the theme of 'Opportunities in Ground Data Systems for High Efficiency Operations of Space Missions,' the SpaceOps '94 symposium included presentations of more than 150 technical papers spanning five topic areas: Mission Management, Operations, Data Management, System Development, and Systems Engineering. The papers focus on improvements in the efficiency, effectiveness, productivity, and quality of data acquisition, ground systems, and mission operations. New technology, techniques, methods, and human systems are discussed. Accomplishments are also reported in the application of information systems to improve data retrieval, reporting, and archiving; the management of human factors; the use of telescience and teleoperations; and the design and implementation of logistics support for mission operations
Research and technology, 1992
Selected research and technology activities at Ames Research Center, including the Moffett Field site and the Dryden Flight Research Facility, are summarized. These activities exemplify the Center's varied and productive research efforts for 1992
Combined brain language connectivity and intraoperative neurophysiologic techniques in awake craniotomy for eloquent-area brain tumor resection
Speech processing can be disturbed by primary brain tumors (PBT). Improvement of presurgical planning techniques decrease neurological morbidity associated to tumor resection during awake craniotomy. The aims of this work were: 1. To perform Diffusion Kurtosis Imaging based tractography (DKI-tract) in the detection of brain tracts involved in language; 2. To investigate which factors contribute to functional magnetic resonance imaging (fMRI) maps in predicting eloquent language regional reorganization; 3. To determine the technical aspects of accelerometric (ACC) recording of speech during surgery.
DKI-tracts were streamlined using a 1.5T magnetic resonance scanner. Number of tracts and fiber pathways were compared between DKI and standard Diffusion Tensor Imaging (DTI) in healthy subjects (HS) and PBT patients. fMRI data were acquired using task-specific and resting-state paradigms during language and motor tasks. After testing intraoperative fMRI’s influence on direct cortical stimulation (DCS) number of stimuli, graph-theory measures were extracted and analyzed. Regarding speech recording, ACC signals were recorded after evaluating neck positions and filter bandwidths. To test this method, language disturbances were recorded in patients with dysphonia and after applying DCS in the inferior frontal gyrus. In contrast, HS reaction time was recorded during speech execution.
DKI-tract showed increased number of arcuate fascicle tracts in PBT patients. Lower spurious tracts were identified with DKI-tract. Intraoperative fMRI and DCS showed similar stimuli in comparison with DCS alone. Increased local centrality accompanied language ipsilateral and contralateral reorganization. ACC recordings showed minor artifact contamination when placed at the suprasternal notch using a 20-200 Hz filter bandwidth. Patients with dysphonia showed decreased amplitude and frequency in comparison with HS. ACC detected an additional 11% disturbances after DCS, and a shortening of latency within the presence of a loud stimuli during speech execution. This work improved current knowledge on presurgical planning techniques based on brain structural and functional neuroimaging connectivity, and speech recordingA função linguística do ser humano pode ser afetada pela presença de tumores cerebrais (TC) A melhoria de técnicas de planeamento pré-cirurgico diminui a morbilidade neurológica iatrogénica associada ao seu tratamento cirúrgico. O objetivo deste trabalho é: 1. Testar a fiabilidade da tractografia estimada por difusor de kurtose (tract-DKI), dos feixes cerebrais envolvidos na linguagem 2. Identificar os fatores que contribuem para o mapeamento linguagem por ressonância magnética funcional (RMf) na predição da neuroplasticidade. 3. Identificar aspetos técnicos do registo da linguagem por accelerometria (ACC).
A DKI-tract foi estimada após realização de RM cerebral com 1.5T. O número e percurso das fibras foi avaliado. A RMf foi adquirida durante realização de tarefas linguísticas, motoras, e em repouso. Foi testada influência dos mapas de ativação calculados por RMf, no número de estímulos realizados durante a estimulação direta cortical (EDC) intraoperatória. Medidas de conectividade foram extraídas de regiões cerebrais. A posição e filtragem de sinal ACC foram estudadas após vocalização de palavras. O sinal ACC obtido em voluntários foi comparado com doentes disfónicos, após estimulação do giro inferior frontal, e após a adição de um estímulo sonoro perturbador durante vocalização.
A tract-DKI estimou um elevado número de fascículos do feixe arcuato com menos falsos negativos. Os mapas linguísticos de RMf intraoperatória, não influenciou a EDC. Medidas de centralidade aumentaram após neuroplasticidade ipsilateral e contralateral. A posição supraesternal e a filtragem de sinal ACC entre 20-200Hz demonstrou menor ruido de contaminação. Este método identificou diminuição de frequência e amplitude em doentes com disfonia, 11% de erros linguísticos adicionais após estimulação e diminuição do tempo de latência quando presente o sinal sonoro perturbador.
Este trabalho promoveu a utilização de novas técnicas no planeamento pré-cirúrgico do doente com tumor cerebral e alterações da linguagem através do estudo de conectividade estrutural, funcional e registo da linguagem
- …