搜索结果: 1-15 共查到“speech recognition”相关记录17条 . 查询时间(0.062 秒)
PARAMETRIC REPRESENTATION OF THE SPEAKER’S LIPS FOR MULTIMODAL SIGN LANGUAGE AND SPEECH RECOGNITION
Sign language Gestures Speech recognition Computer Vision Principal Component Analysis Machine learning Face detection Linear contrasting
2017/5/15
In this article, we propose a new method for parametric representation of human’s lips region. The functional diagram of the method is described and implementation details with the explanation of its ...
Speech Feature Denoising and Dereverberation via Deep Autoencoders for Noisy Reverberant Speech Recognition
robust speech recognition feature denoising denoising autoencoder deep neural network
2014/11/27
Denoising autoencoders (DAs) have shown success in generating robust features for images, but there has been limited work in applying DAs for speech. In this paper we present a deep denoising autoenco...
Multi-level Context-dependent Acoustic Modeling for Automatic Speech Recognition
Multi-level Context-dependent Acoustic Modeling Automatic Speech Recognition
2014/11/27
In this paper, we propose a multi-level, contextdependent acoustic modeling framework for automatic speech recognition. For each context-dependent unit considered by the recognizer, we construct a set...
An Efferent-Inspired Auditory Model Front-End for Speech Recognition
efferent auditory model feature extraction
2014/11/27
In this paper, we investigate a closed-loop auditory model and explore its potential as a feature representation for speech recognition. The closed-loop representation consists of an auditory-based, e...
A Back-off Discriminative Acoustic Model for Automatic Speech Recognition
context-dependent acoustic modeling back-off acoustic models discriminative training
2014/11/27
In this paper we propose a back-off discriminative acoustic model for Automatic Speech Recognition (ASR). We use a set of broad phonetic classes to divide the classification problem originating from c...
Research Developments and Directions in Speech Recognition and Understanding, Part 1
Research Developments Speech Recognition Understanding
2014/11/27
To advance research, it isimportant to identify prom-ising future research direc-tions, especially those thathave not been adequately pursued or funded in the past. The working group producing this ar...
SPEECH RECOGNITION WITH LOCALIZED TIME-FR EQUENCY PATTERN DETECTORS
automatic speech recognition acoustic modeling
2014/11/27
SPEECH RECOGNITION WITH LOCALIZED TIME-FR EQUENCY PATTERN DETECTORS.
THE PHASE SPECTRA BASED FEATURE FOR ROBUST SPEECH RECOGNITION
Group delay function Phase Spectrum Robust phoneme recognition
2009/1/1
Speech recognition in adverse environment is one of the major issue in
automatic speech recognition nowadays. While most current speech recognition system
show to be highly efficient for ideal envir...
STATE-DEPENDENT MIXTURE TYING WITH VARIABLE CODEBOOK SIZE FOR ACCENTED SPEECH RECOGNITION
State-dependent tied mixture models variable codebook size
2008/7/27
In this paper, we propose a state-dependent tied mixture (SDTM) models with variable codebook size to improve the model robustness for accented p honetic variations while maintaining model discrimina...
State-Dependent Phoneme-Based Model Merging for Dialectal Chinese Speech Recognition
Speech recognition dialectal Ch inese speech recognition state-dependent phoneme-based model merging acoustic modeling acoustic model distance measure
2006/12/13
Aiming at building a dialectal Chinese speech rec ognizer from a standard Chinese speech recognizer with a small am ount of dialectal Chinese speech, a novel, simple but effective acoustic modeling me...
Weighting Observation Vectors for Robust Speech Recognition in Noisy Environments
Weighting Observation Vectors Robust Speech Recognition Noisy Environments
2004/10/4
In this paper, we propose a novel approach to robust speech recognition in noisy environments by discriminating the observation vectors. In conventional HMM-based speech recognition, all the observ...
Audio-Visual Speech Recognition using Red Exclusion and Neural Networks
Audio-Visual Speech Recognition Feature Extraction Neural Networks Sensor Fusion
2002/5/1
Automatic speech recognition (ASR) performs well under restricted conditions, but performance degrades in noisy environments. Audio-Visual Speech Recognition(AVSR) combats this by incorporating a visu...
A Real-World Speech Recognition System Based on CDCPMs
Speaker-Independent Speech Recognition center-distance continuous probability model (CDCPM) embedded multiple-model (EMM)
1998/3/27
In this paper a real-world continuous-manner 2000-phrase speaker-independent Chinese speech recognition system is introduced, the vocabulary of which consists of 2000 Chinese phrases and each phrase i...
A Log-Index Weighted Cepstral Distance Measure for Speech Recognition
Log-Index Weighted Cepstral Distance Measure Speech Recognition
1997/5/27
A log-index weighted cepstral distance measure is proposed and tested in speaker-independent and speaker-dependent isolated word recognition systems using statistic techniques. The weights for the cep...