Proposing a Speech to Gesture Translation Architecture for Spanish Deaf mute People

Title	Proposing a Speech to Gesture Translation Architecture for Spanish Deaf mute People
Publication Type	Journal Article
Año de publicación	2008
Autores	San-Segundo, R, Montero, JM, Macias-Guarasa, J, Cordoba, R, Ferreiros, J, Pardo, JM
Idioma de publicación	English
Journal	Journal of Visual Languages and Computing
Volumen	19
Número	588
Páginas	523-538
Fecha de publicación	01/2008
Lugar de publicación	Reino Unido de Gran Bretaña e Irlanda del Norte
Editorial	Elsevier
Rank in category	55/79
JCR Category	COMPUTER SCIENCE, SW ENG
Palabras clave	animated agents, animation, gesture complexity, lifelike characters, spanish sign language, speech to gesture translation
JCR Impact Factor	0.644
ISSN	1045-926X
URL	http://www.sciencedirect.com/science?_ob=ArticleURL&_udi=B6WMM-4P5R60V-1&_user=885393&_coverDate=10%2F31%2F2008&_rdoc=2&_fmt=high&_orig=browse&_srch=doc-info(%23toc%236938%232008%23999809994%23700268%23FLA%23display%23Volume)&_cdi=6938&_sort=d&_docanchor=
DOI	10.1016/j.jvlc.2007.06.002
Abstract	This article describes an architecture for translating speech into Spanish Sign Language (SSL). The architecture proposed is made up of four modules: speech recognizer, semantic analysis, gesture sequence generation and gesture playing. For the speech recognizer and the semantic analysis modules, we use software developed by IBM and CSLR (Center for Spoken Language Research at University of Colorado), respectively. Gesture sequence generation and gesture animation are the modules on which we have focused our main effort. Gesture sequence generation uses semantic concepts (obtained from the semantic analysis) associating them with several SSL gestures. This association is carried out based on a number of generation rules. For gesture animation, we have developed an animated agent (virtual representation of a human person) and a strategy for reducing the effort in gesture animation. This strategy consists of making the system automatically generate all agent positions necessary for the gesture animation. In this process, the system uses a few main agent positions (two or three per second) and some interpolation strategies, both issues previously generated by the service developer (the person who adapts the architecture proposed in this paper to a specific domain). Related to this module, we propose a distance between agent positions and a measure of gesture complexity. This measure can be used to analyze the gesture perception versus its complexity. With the architecture proposed, we are not trying to build a domain independent translator but a system able to translate speech utterances into gesture sequences in a restricted domain: railway, flights or weather information.

Attachment	Size
PaperDefinitivo-YJVLC387.pdf	583.05 KB

Geintra

Home

General information

People

Publications

Research projects

Demos

Datasets and open source

Work with us

Login

Research lines

Work with us

Contact