An on-line speaker adaptation method for HMM-based speech recognizers

In the past few years numerous techniques have been proposed to improve the efficiency of basic adaptation methods like MLLR and MAP. These adaptation methods have a common aim, which is to increase the likelihood of the phoneme models for a particular speaker. During their operation, these speaker...

Teljes leírás

Elmentve itt :

Bibliográfiai részletek
Szerzők:	Bánhalmi András Kocsor András
Testületi szerző:	Conference for PhD Students in Computer Science (5.) (2006) (Szeged)
Dokumentumtípus:	Cikk
Megjelent:	2008
Sorozat:	Acta cybernetica 18 No. 3
Kulcsszavak:	Számítástechnika, Kibernetika
Tárgyszavak:	Természettudományok Számítás- és információtudomány
Online Access:	http://acta.bibl.u-szeged.hu/12825


LEADER	02293nab a2200241 i 4500
001	acta12825
005	20220616144159.0
008	161015s2008 hu o 0\|\| eng d
022			\|a 0324-721X
040			\|a SZTE Egyetemi Kiadványok Repozitórium \|b hun
041			\|a eng
100	1		\|a Bánhalmi András
245	1	3	\|a An on-line speaker adaptation method for HMM-based speech recognizers \|h [elektronikus dokumentum] / \|c Bánhalmi András
260			\|c 2008
300			\|a 379-390
490	0		\|a Acta cybernetica \|v 18 No. 3
520	3		\|a In the past few years numerous techniques have been proposed to improve the efficiency of basic adaptation methods like MLLR and MAP. These adaptation methods have a common aim, which is to increase the likelihood of the phoneme models for a particular speaker. During their operation, these speaker adaptation methods need precise phonetic segmentation information of the actual utterance, but these data samples are often faulty. To improve the overall performance, only those frames from the spoken sentence which are well segmented should be retained, while the incorrectly segmented data should not be used during adaptation. Several heuristic algorithms have been proposed in the literature for the selection of the reliably segmented data blocks, and here we would like to suggest some new heuristics that discriminate between faulty and well-segmented data. The effect of these methods on the efficiency of speech recognition using speaker adaptation is examined, and conclusions for each will be drawn. Besided post-filtering the set of the segmented adaptation examples, another way of improving the efficiency of the adaptation method might be to create a more precise segmentation, which should then reduce the chance of faulty data samples being included. We suggest a method like this here as well which is based on a scoring procedure for the N-best lists, taking into account phoneme duration.
650		4	\|a Természettudományok
650		4	\|a Számítás- és információtudomány
695			\|a Számítástechnika, Kibernetika
700	0	1	\|a Kocsor András \|e aut
710			\|a Conference for PhD Students in Computer Science (5.) (2006) (Szeged)
856	4	0	\|u http://acta.bibl.u-szeged.hu/12825/1/Banhalmi_2008_ActaCybernetica.pdf \|z Dokumentum-elérés

An on-line speaker adaptation method for HMM-based speech recognizers

Hasonló tételek