Search CORE

8 research outputs found

High-Quality Time Stretch and Pitch Shift Effects for Speech and Audio Using the Instantaneous Harmonic Analysis

Author: A Petrovsky
AS Spanias
B Boashash
D Gabor
E Azarov
E Azarov
F Zhang
L Weruaga
P Maragos
S Levine
T Abe
T Abe
TF Quatieri
X Serra
Publication venue: SpringerOpen
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Relationships between Protein Intake and Renal Function in a Japanese General Population: NIPPON DATA90

Author: Beck GJ Berg RL, Coggins CH, Gassm
Brenner BM Meyer TM, Hostetter TH
Hozawa A Okamura T, Kadowaki T, Mu
Ihle BU Becker GJ, Whitworth JA, C
King AJ Levey AS
Kitazato H Fujita H, Shimotomai T,
Klahr S Levey AS, Beck GJ, Caggiul
Kontessis P Jones S, Dodds R, Trev
Levey AS Bosch JP, Lewis JB, Green
Levey AS Coresh J, Balk E, Kausz A
Levey AS Greene T, Sarnak MJ, Wang
McGandy RB Barrows CH Jr, Spanias
Munro HN McGandy RB, Hartz SC, Rus
Nakamura H Ito S, Ebe N, Shibata A
Okuda N Miura K, Yoshita K, Matsum
Publication venue: 'Japan Epidemiological Association'
Publication date: 01/01/2010
Field of study

Crossref

A New Method to Represent Speech Signals Via Predefined Signature and Envelope Sequences

Author: AJ Newman
AM Karaş
AM Karaş
AN Akansu
AS Spanias
BS Yarman
E Oja
G Strang
G Varile
H Gürkan
H Hotelling
IPA
IT Jolliffe
JS Garofolo
K Fukunaga
K Pearson
R Akdeniz
R Akdeniz
S Watanabe
SR Quackenbush
WD Voiers
Y Linde
Ü Güz
Ü Güz
Ü Güz
Ü Güz
Publication venue: SpringerOpen
Publication date: 01/01/2006
Field of study

A novel systematic procedure referred to as “SYMPES” to model speech signals is introduced. The structure of SYMPES is based on the creation of the so-called predefined “signature S={SR(n)} and envelope E={EK(n)}” sets. These sets are speaker and language independent. Once the speech signals are divided into frames with selected lengths, then each frame sequence Xi(n) is reconstructed by means of the mathematical form Xi(n)=CiEK(n)SR(n). In this representation, Ci is called the gain factor, SR(n) and EK(n) are properly assigned from the predefined signature and envelope sets, respectively. Examples are given to exhibit the implementation of SYMPES. It is shown that for the same compression ratio or better, SYMPES yields considerably better speech quality over the commercially available coders such as G.726 (ADPCM) at 16 kbps and voice excited LPC-10E (FS1015) at 2.4 kbps

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Isik University Academic Open Access

Introduction

Author: AM Kondoz
AS Spanias
AV McCree
AV McCree
C Tsao
CJ Weinstein
DY Wong
F Bimbot
GS Kang
JL Flanagan
JL Flanagan
KK Paliwal
KK Paliwal
KS Lee
KS Lee
LJ Fransen
MM Sondhi
RV Cox
S Narayanan
S Parthasarathy
T Dutoit
WB Kleijn
WB Kleijn
X Dong
Y Li
Y Shiraki
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Design of MELPe-Based Variable-Bit-Rate Speech Coding with Mel Scale Approach Using Low-Order Linear Prediction Filter and Representing Excitation Signal Using Glottal Closure Instants

Author: A Gray
AS Spanias
AV McCree
B Yegnanarayana
BS Atal
C Ma
CM Vikram
D Pravena
D Singh
E Kruger
GJ Lal
HS Pannu
HS Pannu
JD Gibson
JM Hillenbrand
JS Garofolo
JW Picone
KK Paliwal
KS Rao
LR Rabiner
M Kaur
T Ananthapadmanabha
T Ananthapadmanabha
WC Chu
Y Hu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A near-end listening enhancement system by RNN-based noise cancellation and speech modification

Author: A Varga
AB Aicha
ANSI
AS Spanias
C Yan
C Yan
C Yan
C Yan
C Yan
CH Taal
D Yu
E Jokinen
Gang Li
IB Thomas
KR Rao
L Deng
M Cooke
NV George
P ITU-T
PN Petkov
R Niederjohn
Rui Zhang
Ruimin Hu
S Khademi
SM Kuo
T Painter
TC Zorilă
TC Zorilă
WB Kleijn
Xiaochen Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A uniform phase representation for the harmonic model in speech synthesis applications

Author: A El-Jaroudi
A Oppenheim
A Sugiyama
AS Spanias
AV Oppenheim
B Bozkurt
B Doval
B Doval
B Yegnanarayana
C Hamon
D Erro
D Erro
D Zhu
Daniel Erro
DB Paul
E Banos
E Moulines
E Rodriguez-Banga
G Degottex
G Degottex
G Degottex
G Degottex
G Degottex
G Kafentzis
G Kafentzis
G Richard
Gilles Degottex
H Banno
H Kawahara
H Kawahara
H Zen
H Zen
HA Murthy
I Sainz
I Saratxaga
I Saratxaga
J Bonada
J Kominek
J Laroche
J Laroche
J Latorre
JM Scott
K Tokuda
K Tokuda
K Yu
M Campedel-Oudot
M Cooke
M Tahon
MJF Gales
NI Fisher
P Lanchantin
P Mowlaee
P Mowlaee
PA Naylor
R Maia
R McAulay
R McAulay
R Smits
RJ McAulay
RL Miller
SB Davis
SP Lipshitz
SR Schweinberger
T Ananthapadmanabha
T Drugman
T Drugman
T Drugman
T Kinnunen
T Quatieri
TF Quatieri
TF Quatieri
V Hansen
V Hansen
X Anguera
Y Agiomyrgiannakis
Y Ohtani
Y Pantazis
Y Shiga
Y Stylianou
Y Stylianou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Feature-based vocoders, e.g., STRAIGHT, offer a way to manipulate the perceived characteristics of the speech signal in speech transformation and synthesis. For the harmonic model, which provide excellent perceived quality, features for the amplitude parameters already exist (e.g., Line Spectral Frequencies (LSF), Mel-Frequency Cepstral Coefficients (MFCC)). However, because of the wrapping of the phase parameters, phase features are more difficult to design. To randomize the phase of the harmonic model during synthesis, a voicing feature is commonly used, which distinguishes voiced and unvoiced segments. However, voice production allows smooth transitions between voiced/unvoiced states which makes voicing segmentation sometimes tricky to estimate. In this article, two-phase features are suggested to represent the phase of the harmonic model in a uniform way, without voicing decision. The synthesis quality of the resulting vocoder has been evaluated, using subjective listening tests, in the context of resynthesis, pitch scaling, and Hidden Markov Model (HMM)-based synthesis. The experiments show that the suggested signal model is comparable to STRAIGHT or even better in some scenarios. They also reveal some limitations of the harmonic framework itself in the case of high fundamental frequencies.G. Degottex has been funded by the Swiss National Science Foundation (SNSF) (grants PBSKP2_134325, PBSKP2_140021), Switzerland, and the Foundation for Research and Technology-Hellas (FORTH), Heraklion, Greece. D. Erro has been funded by the Basque Government (BER2TEK, IE12-333) and the Spanish Ministry of Economy and Competitiveness (SpeechTech4All, TEC2012-38939-C03-03)

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital para la Docencia y la Investigación

Applications

Author: AK Jain
AS Spanias
B Ji
BG Jo
BG Sherlock
C-M Liu
D Sevic
DF Elliott
DJ Mulvaney
E Castro De
E Viscito
G Strang
GA Davidson
GA Davidson
H Hartenstein
H Zhai
HC Andrews
HC Andrews
HC Andrews
HC Andrews
IJ Good
IY Choi
J Ma
JD Johnston
JE Shore
JJKƠ Ruanaidh
JL Starck
JM Boyce
JP Princen
JP Princen
K Brandenburg
K Ito
K Sayood
KR Rao
LC Ludeman
M Borgerding
M Bosi
M Boucheret
M Iwadare
M Ramkumar
M-L Ku
MD Swanson
MF Cátedra
MN Do
N Ahmed
O Urhan
OK Ersoy
P Duhamel
P Noll
PM Shankar
RH Bamberger
RM Jiang
RW Cox
S Shlien
S Venkataraman
S Wendling
S-I Park
S-W Lee
SB Weinstein
SD Stearns
SJ Leon
TD Lookabaugh
TJ Peters
TK Sarkar
V Britanak
W Bender
W Philips
WK Pratt
WM Lawton
WY Zou
Y Tanaka
Y Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref