![]() |
Sign Language Processing
University of Colorado at Boulder - Computer Science Department
Collaboration Technology Research Group
Here’s a collection of Sign Language related resources. It focuses mainly on literature on automatic Sign Language processing. Other links were collected as a result of this main search and included whenever found to be potentially interesting. This is of course an ongoing effort. If you have contributions not listed here, please send me a link (your contribution will of course be acknowledged).
Additionally, you may want to look at my Multimodality page.
Sign Languages are visual languages consisting of formalized gestures, facial expressions including lip movement and eye-gaze. Gestures usually involve whole body movements, in particular movements of the hands and arms. Facial expressions are commonly used as part of the language. In American Sign Language (ASL), for instance, determining if an utterance is a statement of fact or a question, depends on eyebrow and facial configurations (raised eyebrows on a tilted forward head are used to mark questions); in some sentences, determining if it is affirmative or negative depends on interpreting head movements. Being full languages, Sign Languages are complex and pose at least similar obstacles to machine processing as spoken languages do.
Sign language processing constitutes a specialization of the broader human movement analysis research and is anchored on similar techniques as the ones employed for, e.g. gesture and facial analysis in general. There are a variety of algorithms that target gesture (both through vision and through instrumented gloves), facial expression and lipreading, as well as other sources of data (modalities) such as eye-gaze detection. Most of the work targets one of the possible modalities independently from the others, particularly in Sign Language processing, where the bulk of the work is done on interpretation of hand-gestures, or even more narrowly on fingerspelling. Existing techniques for analyzing independent human motion modalities are mainly based on traditional processing tools and methods, e.g., Hidden Markov Models (HMMs) (e.g. Starner and Pentland, 1996; Assan and Grobel, 1998; Hienz et al, 1999), neural nets (e.g. Wilson and Anspach, 1993; Erenshteyn et al., 1994; Vamplew, 1996), rules extracted by a temporal concept learner (Kadous, 1998), bayesian nets (Garg et al., 2000), finite state machines (Hong et al., 2000), colored Petri nets coupled to HMMs (Huang and Jeng, 2001).
Listed are centers that did research in the listed areas at some point in time. Some continue their research efforts, others not. Researchers may also have moved to other centers. We list the year the last related publication appeared in press.
|
Center/Project |
Last pub |
People |
|
University of New South Wales |
2002 |
|
|
2002 |
|
|
|
|
2001 |
Zon Wei; Yuan Kui; Liu Jindong; Luo Bencheng |
|
Aachen University of Technology |
2001 |
Hermann Hienz, Britta Bauer, Kirsti Grobel, Karl-Friedrich Kraiss, Marcell Assan |
|
Nagoya Institute of Technology |
2001 |
Marcus Vinicius Lamar
(U Federal do Paraná, Brazil), Md.S.Bhuiyan, A.Iwata |
|
National Tsing-Hua University, Taiwan |
2001 |
Chung-Lin Huang, Ming-Shan Wu, Sheng-Hung Jeng |
|
Paris XI Orsay University |
2001 |
|
|
University
of Macedonia in Thessaloniki, Greece |
2001 |
|
|
2001 |
||
|
2001 |
Eun-Jung
Holden, Robyn Owens,
Geoffrey G. Roy |
|
|
|
2000 |
K.
Imagawa, Shan Lu, Seiji Igi, Hideaki Matsuo, Yuji Nagashima, Yuji Takata, and
Terutaka Teshima |
|
Hitachi Research Laboratory |
2000 |
Hirohiko Sagawa, Masaru Takeuchi, Masaru Ohki, Tomoko Sakiyama, Eiji Oohira, Hisashi Ikeda, Hiromichi Fujisawa |
|
2000 |
Wen Gao, Jiyong Ma, Jiangqin Wu Chunli Wang |
|
|
2000 |
Yuntao Cui, John J. Weng |
|
|
2000 |
Vivek A.
Sujan, Marco A.
Meggiolaro (PUC, Brazil) |
|
|
|
1999 |
Sugiyama, H.; Tanahashi, S.; Aoki, Y. - Seong-Hyo Shin; Sang-Woon Kim |
|
Kyoto Institute of Technology |
1999 |
Takao Kurokawa, Sumihiro Kawano, K. Senba |
|
University of Illinois at Urbana-Champaign Beckman Institute |
1999 |
|
|
1999 |
Franc Solina, Slavko Krapez, Ales Jaklic |
|
|
1998 |
Seiji Igi, Shan Lu, K. Imagawa, H. Matsuo, H. Sakato, Y.
Nagashima |
|
|
DalTech (Technical University of Nova Scotia) |
1998 |
Moussa Habib Abdallah, A.E. Marble, Charoensak
Charayaphan |
|
1998 |
Alex Pentland, Thad Starner, Joshua
Weaver |
|
|
National Taiwan University |
1998 |
Rung-Huei Liang, Ming Ouhyoung (Shih-Chien
University, Taiwan) |
|
Ohio State University |
1998 |
Chung-Lin Huang |
|
|
1997 |
Yamamoto,
Y., Uchida, Masafumi, Ide, Hideto |
|
|
1997 |
Henrik Birk, Thomas B. Moeslund, Claus B. Madsen |
|
KAIST, Korea, Electrical Engineering |
1997 |
Zeungnam Bien; Gyu-Tae Park; Won Jang (Agency for Devence
Development, Korea); Jong-Sung Kim |
|
University of Delaware |
1997 |
Roman Erenshteyn, Pavel Laskov, Richard A.
Foulds, Garland Stern, Lynn Messing |
|
University of
Essex, UK |
1997 |
G.J. Sweeney, Andy C. Dowton |
|
University of Tasmania |
1996 |
Peter Vamplew, Anthony
Adams |
|
|
1995 |
Yamaguchi T., Yoshihara M., Akiba M., Kuga M., Kanazawa N.
and Kamata K |
|
|
1995 |
Chung-Lin Huang; Wen-Yi Huang; Cheng-Chang Lien |
|
Ohio State University, Biomedical Engineering |
1995 |
M.B. Waldron, S. Kim |
|
|
1994 |
Hamilton J. and Micheli-Tzanakou E. |
|
|
1994 |
Geoff D. Roberts |
|
Simon Fraser University, Canada |
1994 |
Brigitte Dorner, Eli Hagen |
|
|
1993 |
A. Sutherland |
|
Raytheon Co |
1993 |
Elizabeth J. Wilson, Gretel Anspach |
|
University of Tokyo |
1993 |
Tosiyasu L. Kunii, Jintae Lee |
|
|
1988 |
S. Tamura, S.
Kawasaki |
|
|
1982 |
R. Harrison |
|
Sign Language |
|
|
American Sign Language Research Project at Boston U |
|
|
National Center for Sign Language and Gesture Resources |
http://www.bu.edu/asllrp/cslgr/ |
|
Communications Research Laboratory, Japan - Research and development on sign language recognition |
http://www2.crl.go.jp/jt/a131/research/univ-e.html |
|
Leiden U’s Sign Phonology Group Sign language sites on the WWW |
|
|
Sign Stream |
|
|
U of Hamburg’s International Bibliography of Sign Language |
http://www.sign-lang.uni-hamburg.de/bibweb/ |
|
Waleed’s page on Machine Gesture and Sign Language Recognition |
|
|
Beckman Institute’s Gesture Interpretation using Spatio-Temporal analysis – GIST |
http://vision.ai.uiuc.edu/mhyang/gist.html |
|
Animation |
|
|
DePaul University |
|
|
Signing avatar |
|
|
Visicast |
|
Transcription / Notation |
|
|
HamNoSys |
|
|
SignWriting |
|
|
U Hamburg Bibliography “Transcription / notation” |
http://www.sign-lang.uni-hamburg.de/bibweb/Lidat.acgi?KEYWORDALTID=459 |
|
Translation |
|
|
Visicast |
|
|
Penn’s TEAM |
|
|
The SASL Project |
|
Linguistics |
|
|
Kearsy Cormier |
|
|
|
|
|
Education |
|
|
ICICLE Project |
|
|
|
|
|
Software |
|
|
Intel OpenCV |
|
|
Entropic’s
HTK |
Computer Vision and Image
Understanding: CVIU
IJCV - International Journal of Computer Vision
IEEE Trans on Pattern Analysis and Machine Intelligence
International
Journal of Pattern Recognition and Artificial Intelligence (IJPRAI)
IEEE International Conference on Automatic Face and Gesture Recognition
IEEE Conference on Computer Vision and Pattern Recognition
IEEE International Conference on Computer Vision
International Workshop on Gesture and Sign Language based Human-Computer Interaction
Workshop on Human Motion
International Conference on Spoken Language Processing
So-qiwoa, a computational sign linguistics discussion list:
Send an email to <listserv@list.unm.edu> saying
"subscribe so-qiwoa-l
firstname lastname", example:
"subscribe so-qiwoa-l Laurent Clerc"
[Aggarwal and Cai(1999)]
J. K. Aggarwal and Q. Cai.
Human motion analysis: A review.
Computer and Vision Image Understanding, 73 (3): 428-440, March 1999.
http://www.idealibrary.com/servlet/doi/10.1006/cviu.1998.0744
Human motion analysis is receiving increasing attention from computer vision researchers. This interest is motivated by a wide spectrum of applications, such as athletic performance analysis, surveillance, man-machine interfaces, content-based image storage and retrieval, and video conferencing. This paper gives an overview of the various tasks involved in motion analysis of the human body. We focus on three major areas related to interpreting human motion: (1) motion analysis involving human body parts, (2) tracking a moving human from a single view or multiple camera perspectives, and (3) recognizing human activities from image sequences. Motion analysis of human body parts involves the low-level segmentation of the human body into segments connected by joints and recovers the 3D structure of the human body using its 2D projections over a sequence of images. Tracking human motion from a single view or multiple perspectives focuses on higher-level processing, in which moving humans are tracked without identifying their body parts. After successfully matching the moving human image from one frame to another in an image sequence, understanding the human movements or activities comes naturally, which leads to our discussion of recognizing human activities.
[Cedras and Shah(1995)]
C. Cedras and M. Shah.
Motion based recognition: A survey.
Image and Vision Computing, 13 (2): 129-155, 1995.
[Gavrila(1999)]
D. Gavrila.
The visual analysis of human movement: A survey.
Computer and Vision Image Understanding, 73 (1): 82-98, January 1999.
http://citeseer.nj.nec.com/400214.html
The ability to recognize humans and their activities by vision is key for a machine to interact intelligently and effortlessly with a human inhabited environment. Because of many potentially important applications, "Looking at People" is currently one of the most active application domains in computer vision. This survey identifies a number of promising applications and provides an overview of recent developments in this domain. The scope of this survey is limited to work on whole-body or hand motion; it does not include work on human faces. The emphasis is on discussing the various methodologies; they are grouped in 2-D approaches with or without explicit shape models and 3-D...
[Kohler and Schroeter(1998)]
M. Kohler and S. Schroeter.
A survey of video-based gesture recognition - stereo and mono systems.
Technical Report 693, Informatik VII, University of Dortmund, Germany, August 1998.
ftp://ls7-ftp.cs.uni-dortmund.de/pub/reports/ls7/1998/rr-693.ps.gz
[LaViola(1999)]
Joseph J.
LaViola, Jr.
A survey of hand posture and gesture recognition techniques and technology.
Technical Report CS-99-11, Department of Computer Science, Brown University, June 1999.
ftp://ftp.cs.brown.edu/pub/techreports/99/cs99-11.ps.Z
[Moeslund and Granum(2001)]
T. B. Moeslund and E. Granum.
A survey of computer vision-based human motion capture.
International Journal of Computer Vision, 81 (3), 2001.
http://citeseer.nj.nec.com/moeslund01survey.html
[Pavlovic et al.(1997)]
V. I. Pavlovic, R. Sharma, and T. S. Huang.
Visual interpretation of hand gestures for human-computer interaction: A review.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 19 (7): 677-695, July 1997.
http://computer.org/tpami/tp1997/i0677abs.htm
The use of hand gestures provides an attractive alternative to cumbersome interface devices for human-computer interaction (HCI). In particular, visual interpretation of hand gestures can help in achieving the ease and naturalness desired for HCI. This has motivated a very active research area concerned with computer vision-based analysis and interpretation of hand gestures. We survey the literature on visual interpretation of hand gestures in the context of its role in HCI. This discussion is organized on the basis of the method used for modeling, analyzing, and recognizing gestures. Important differences in the gesture interpretation approaches arise depending on whether a 3D model of the human hand or an image appearance model of the human hand is used. 3D hand models offer a way of more elaborate modeling of hand gestures but lead to computational hurdles that have not been overcome given the real-time requirements of HCI. Appearance-based models lead to computationally efficient "purposive" approaches that work well under constrained situations but seem to lack the generality desirable for HCI. We also discuss implemented gestural systems as well as other potential applications of vision-based gesture recognition. Although the current progress is encouraging, further theoretical as well as computational advances are needed before gestures can be widely used for HCI. We discuss directions of future research in gesture recognition, including its integration with other natural modes of human-computer interaction.
[Wu and Huang(1999)]
Y. Wu and T. S. Huang.
Vision-based gesture recognition: A review.
In A. Braffort, R. Gherbi, S. Gibet, J. Richardson, and D. Teil, editors, Proceedings of the Gesture Workshop, volume 1739 of Lecture Notes in Computer Science. Springer, 1999.
http://link.springer-ny.com/link/service/series/0558/papers/1739/17390103.pdf
[Abdallah(1998)]
M. Abdallah.
A neuro-hierarchical multilayer network in the translation of the american sign language.
In Proceedings of the IEEE Southeastcon:Region 3, pages 224-227, 1998.
http://ieeexplore.ieee.org/iel4/5503/14789/00673334.pdf
A neuro hierarchial approach based on an adaptive back-propagation algorithm is proposed. In a separate preprocessing step, the input and the output vector are generated using the encoded-sequence representation. The algorithm is applied to translate the American sign language from spoken words. Experimental results indicate that this approach results in fast convergence, stable learning and a relatively small network size when compared to traditional methods.
[Abe et al.(1994)]
M. Abe, H. Sakou, and H. Sagawa.
Sign language translation based on syntactic and semantic analysis.
Systems and Computers in Japan, 25, 1994.
[Akyol et al.(2000)]
S. Akyol, B. Bauer, and U. Canzler.
Gesture and mimic interpretation for sign language recognition.
In Proceedings of the 4th International Student Conference on Electrical Engineering, May 2000.
http://www.techinfo.rwth-aachen.de/Veroeffentlichungen/v001_2000.pdf
[Aoki et al.(1998)]
Y. Aoki, R. Mitsumori, J. Li, and A. Burger.
Sign language communication between japanese-korean and japanese-portuguese using cg animation.
In IEEE International Conference on acoustics speech and signal processing, pages 3765-3768, 1998.
http://ieeexplore.ieee.org/iel4/5518/14898/00679703.pdf
We propose a sign language communication between different languages such as Japanese-Korean and Japanese-Portuguese using computer graphics (CG) animation of sign language based on the intelligent image communication method. For this purpose sign language animation is produced using gesture or text data expressing sign language. In the production process of CG animation of sign language, MATLAB and LIFO language are used, where MATLAB is useful for three-dimensional signal processing of gestures and for displaying animation of sign language. On the other hand the LIFO language, which is a descendant of the LISP and FORTH language families, is developed and used to produce live CG animations, resulting in a high-speed interactive system of designing and displaying sign language animations. A simple experiment was conducted to translate Japanese sign language into Korean and Portuguese sign languages using the developed CG animation system.
[Assan and Grobel(1997)]
M. Assan and K. Grobel.
Video-based sign language recognition using hidden markov models.
In Proceedings of the Gesture Workshop, volume 1371 of Lecture Notes in Artificial Intelligence, pages 97-110. Springer, 1997.
http://link.springer-ny.com/link/service/series/0558/bibs/1371/13710097.htm
This paper is concerned with the video-based recognition of signs. Concentrating on the manual parameters of sign language, the system aims for the signer dependent recognition of 262 different signs taken from Sign Language of the Netherlands . For Hidden Markov Modelling a sign is considered a doubly stochastic process, represented by an unobservable state sequence. The observations emitted by the states are regarded as feature vectors, that are extracted from video frames. This work deals with three topics: Firstly the recognition of isolated signs, secondly the influence of variations of the feature vector on the recognition rate and thirdly an approach for the recognition of connected signs. The system achieves recognition rates up to 94% for isolated signs and 73% for a reduced vocabulary of connected signs.
[Bauer(2001)]
B. Bauer.
Towards an automatic sign language recognition system using subunits.
In Proceedings of the Gesture Workshop, London, 2001.
http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/bauer_gw01_02.pdf
Draft.
[Bauer and Hienz(2000)]
B. Bauer and H. Hienz.
Relevant features for video-based continuous sign language recognition.
In IEEE International Conference on Automatic Face and Gesture Recognition, Grenoble, France, 2000.
http://computer.org/proceedings/fg/0580/05800440abs.htm
[Bauer et al.(2000)]
B. Bauer, H. Hienz, and K.-F. Kraiss.
Video-based continuous sign language recognition using statistical methods.
In Proceedings of the International Conference on Pattern Recognition, volume II, pages 463-466, 2000.
http://www.computer.org/proceedings/icpr/0750/Volume%202/07502463abs.htm
[Bauer et al.(1999)]
B. Bauer, S. Niessen, and H. Hienz.
Towards an automatic sign language translation system.
In Proc. of the International Workshop on Physicality and Tangibility in Interaction: Towards New Paradigms for Interaction Beyond the Desktop, Siena, Italy, Oct. 1999.
[Birk et al.(1997)]
H. Birk, T. B. Moeslund, and C. B. Madsen.
Real-time recognition of hand alphabet gestures using principal component analysis.
In Scandinavian Conference on Image Analysis, 1997.
http://www.lut.fi/scia97/papers/paper131.htm
[Braffort(1996a)]
A. Braffort.
Argo: An architecture for sign language recognition and interpretation.
In P. Harling and A. Edwards, editors, Proceedings of the Gesture Workshop, pages 17-30. Springer, 1996.
[Braffort(1996b)]
A. Braffort.
A gesture recognition architecture for sign language.
In Second International ACM/SIGCAPH Conference on Assistive Technologies (ASSETS'96), pages 102-109, British Columbia, Canada, 1996.
http://www.acm.org/pubs/citations/proceedings/assets/228347/p102-braffort/
[Braffort(1997)]
A. Braffort.
A computer system dedicated to sign language.
In IEA'97, Tampere, Finland, 1997.
[Braffort et al.(1994)]
A. Braffort, C. Collet, and D. Teil.
Anthropomorphic model for hand gesture interface.
In M. Galer, S. Harker, and J. Ziegler, editors, Proceedings of ACM CHI'94 Conference on Human Factors in Computing Systems, 1994.
[Chapman(1998)]
R. N. Chapman.
Lexicon for computer translation of american sign language.
In Applications in Robotics, User Interfaces, and Natural Language Processing, volume 1458 of Lecture Notes in Computer Science. Springer, 1998.
http://link.springer-ny.com/link/service/series/0558/bibs/1458/14580033.htm
This work presents a method for translation of American Sign Language (ASL) to English using a feature-based lexicon, designed to exploit ASL's phonology by searching the lexicon for the sign's manual and non-manual information. Manual sign information consists of phonemes sig (movement), tab (location), dez (handshape), and ori (hand orientation), which we use as the ASL unit of analysis. Non-manual sign information consists of specific facial and body configurations. A camera acquires the sign and individual frames are analyzed and values assigned to the sig, tab, dez, and ori sign parameters as well as other sign features, for referencing during lexical search. ASL formational constraints are exploited to target specific image segments for analysis and linguistic constraints serve to further reduce the lexical search space. Primary keys for lexical search are sig and tab, the most discriminating sign features, followed by the remaining features, as necessary, until a single lexical entry is extracted from the lexicon. If a single lexical candidate cannot be determined, an exception is raised, signaling search failure. This method of using ASL phonological constraints to aid image analysis and lexical search process simplifies the task of sign identification.
[Charayaphan and Marble(1992)]
C. Charayaphan and A. Marble.
An image processing system for interpreting motion in american sign language.
Journal of Biomedical Engineering, 14 (15): 419-425, 1992.
[Cui et al.(1995)]
Y. Cui, D. L. Swets, and J. J. Weng.
Learning-based hand sign recognition using shoslif-m.
In Proceedings of the IEEE International Conference on Computer Vision, pages 631-636, Boston, MA, June 1995.
http://citeseer.nj.nec.com/cui95learningbased.html
[Cui and Weng(1999)]
Y. Cui and J. Weng.
A learning-based prediction-and-verification segmentation scheme for hand sign image sequence.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 21 (8): 798-804, Aug. 1999.
http://dlib.computer.org/tp/books/tp1999/pdf/i0798.pdf
We present a prediction-and-verification segmentation scheme using attention images from multiple fixations. A major advantage of this scheme is that it can handle a large number of different deformable objects presented in complex backgrounds. The scheme is also relatively efficient. The system was tested to segment hands in sequences of intensity images, where each sequence represents a hand sign in American Sign Language. The experimental result showed a 95 percent correct segmentation rate with a 3 percent false rejection rate.
[Cui and Weng(2000)]
Y. Cui and J. Weng.
Appearance-based hand sign recognition from intensity image sequences.
Computer and Vision Image Understanding, 78: 157-176, 2000.
http://www.cse.msu.edu/~weng/research/SHOSLIF-M-rec.ps
[Cui and Weng(1995)]
Y. Cui and J. J. Weng.
Learning-based hand sign recognition.
In IEEE International Conference on Automatic Face and Gesture Recognition, pages 201-206, Zurich, Switzerland, June 1995.
[Cui and Weng(1996a)]
Y. Cui and J. J. Weng.
Hand segmentation using learning-based prediction and verification for hand sign recognition.
In IEEE Conference on Computer Vision and Pattern Recognition, pages 88-93, San Francisco, CA, June 1996.
http://www.cps.msu.edu/~weng/research/HandCVPR96.ps
[Cui and Weng(1996b)]
Y. Cui and J. J. Weng.
View-based hand segmentation and hand-sequence recognition with complex backgrounds.
In Proceedings of the International Conference on Pattern Recognition, volume III, pages 617-621, Vienna, Austria, Aug. 1996.
http://www.cps.msu.edu/~weng/research/SegICPR96.ps
[Dorner(1993)]
B. Dorner.
Hand shape identification and tracking for sign language interpretation.
In International Joint Conference on Artificial Intelligence, Chambery, 1993.
[Dorner(1994)]
B. Dorner.
Chasing the colour glove: Visual hand tracking.
Master's thesis, Simon Fraser University, 1994.
ftp://fas.sfu.ca/pub/thesis/1994/BrigitteDornerMSc.ps
[Dorner and Hagen(1994)]
B. Dorner and E. Hagen.
Towards an american sign language interface.
Artificial Intelligence Review, 8 (2/3), 1994.
http://www.wkap.nl/oasis.htm/61598
[Erenshteyn et al.(1995)]
R. Erenshteyn, R. Foulds, L. Messing, G. Stern, and S. Galuska.
Handshape recognition-a step toward computer translation of american sign language into english.
In WCNN '95. World Congress on Neural Networks, volume 1, pages 216-19, Mahwah, NJ, USA, 1995. Lawrence Erlbaum Associates.
[Erenshteyn and Laskov(1996)]
R. Erenshteyn and P. Laskov.
A multi-stage approach to fingerspelling and gesture recognition.
In Proceedings of the Workshop on the Integration of Gesture in Language and Speech, pages 185-194, Wilmington, DE, 1996.
http://citeseer.nj.nec.com/73836.html
Human gesture analysis and recognition as many others recognition applications require the development and implementation of multiclass recognition system. We examine the application of two approaches to multiclass recognition: a hierarchical (multi-stage) neural network classifier and error-correcting codes together with collective decision making (mixture of formal "experts"). Applications that have been solved include the recognition of fingerspelled letters from American Sign Language and recognition of static handshapes. These applications are examples of multiclass problem - 26 and 77 classes respectively. We address the problem of finding a hierarchy of classifiers, optimal with respect to recognition accuracy, given a labeled corpus of signing samples. This problem stems from the necessity of recognizing a large number of classes for which a single neural network classifier requires prohibitively large training sets and infeasible training time. We outline the general framework of the problem of finding an optimal hierarchy and explain its relationship to the problems of decision tree inference and object clustering. Another approach involves Golay(23,12) code generation and its use for system outputs encoding. These codes have to satisfy some initial condi tions, such as minimum inter-code Hamming distance, number of classes and inter-digit correlation. Code redundancy digits are used to improve overall recognition accuracy. Coding has been combined with two-output neural networks that produced output codes. Collective decision making procedures have been applied. Higher accuracy and less training time are the main advantage of the developed approach. Both approaches have been compared on two problems of human gesture recognition.
[Erenshteyn et al.(1997)]
R. Erenshteyn, P. Laskov, and R. Foulds.
Human gesture recognition using neural networks and multi-class encoding.
In Proceedings of ANNIE'97 (Artificial Neural Networks in Engineering), volume 7, pages 549-554, St.Louis, MO, 1997.
http://www.cis.udel.edu/~laskov/publications/ANNIE-97.ps.gz
Human gesture analysis and recognition as many others recognition applications require the development and implementation of multiclass recognition system. We examine the application of two approaches to multiclass recognition: a hierarchical (multi-stage) neural network classifier and error-correcting codes together with collective decision making (mixture of formal "experts"). Applications that have been solved include the recognition of fingerspelled letters from American Sign Language and recognition of static handshapes. These applications are examples of multiclass problem - 26 and 77 classes respectively. We address the problem of finding a hierarchy of classifiers, optimal with respect to recognition accuracy, given a labeled corpus of signing samples. This problem stems from the necessity of recognizing a large number of classes for which a single neural network classifier requires prohibitively large training sets and infeasible training time. We outline the general framework of the problem of finding an optimal hierarchy and explain its relationship to the problems of decision tree inference and object clustering. Another approach involves Golay(23,12) code generation and its use for system outputs encoding. These codes have to satisfy some initial condi tions, such as minimum inter-code Hamming distance, number of classes and inter-digit correlation. Code redundancy digits are used to improve overall recognition accuracy. Coding has been combined with two-output neural networks that produced output codes. Collective decision making procedures have been applied. Higher accuracy and less training time are the main advantage of the developed approach. Both approaches have been compared on two problems of human gesture recognition.
[Erenshteyn et al.(1996a)]
R. Erenshteyn, P. Laskov, R. Foulds, L. Messing, and G. Stern.
Recognition approach to gesture language understanding.
In Proceedings of the International Conference on Pattern Recognition, Vienna, 1996.
http://www.cis.udel.edu/~laskov/publications/ICPR-96.ps.gz
We explore recognition implications of understanding gestural communication, having chosen American Sign Language as an example of a gestural language. An instrumented glove and specially developed software have been used for data collection and labeling. We address the problem of recognizing dynamic signing, i.e. signing performed at natural speed. Two neural network architectures have been used for recognition of different types of fingerspelled sentences. Experimental results are presented suggesting that two features of signing affect recognition accuracy: signing frequency which to a large extent can be accounted for by training a network on the samples of the respective frequency, and coarticulation effect which a network fails to identify. As a possible solution to coarticulation problem two post-processing algorithms for temporal segmentation are proposed and experimentally evaluated.
[Erenshteyn et al.(1996b)]
R. Erenshteyn, P. Laskov, R. Foulds, L. Messing, G. Stern, and S. Galuska.
Static and dynamic recognition of fingerspelled sentences.
In World Congress on Neural Networks, International Neural Network Society 1996 Annual Meeting, pages 368-71, Mahwah, NJ, USA, 1996. Lawrence Erlbaum Assoc.
http://www.cis.udel.edu/~laskov/publications/ANNIE-97.ps.gz
[Erenshteyn et al.(1994)]
R. Erenshteyn, L. Messing, R. A. Foulds, G. Stern, and S. Galuska.
Back propagation neural network for american sign language recognition.
In P. Werbos, H. Szu, and B. Widrow, editors, World Congress on Neural Networks, volume I, pages 405-409, San Diego, CA, USA, 1994. Erlbaum.
[Fang et al.(2001a)]
G. Fang, W. Gao, X. Chen, C. Wang, and J. Ma.
Signer-independent continuous sign language recognition based on srn/hmm.
In Proceedings of the Gesture Workshop, London, Apr. 2001.
http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/SRNpaper_10.pdf
Draft.
[Fang et al.(2001b)]
G. Fang, W. Gao, and J. Ma.
Signer-independent sign language recognition based on sofm/hmm.
In IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, pages 90-95, 2001.
http://ieeexplore.ieee.org/iel5/7480/20325/00938915.pdf
The aim of sign language recognition is to provide an efficient and accurate mechanism to transcribe sign language into text or speech. State-of-the-art sign language recognition should be able to solve the signer-independent problem for practical application. In this paper, a hybrid SOFM/HMM system, which combines self-organizing feature maps (SOFMs) with hidden Markov models (HMMs), is presented for signer-independent Chinese sign language recognition. We implement the SOFM/HMM sign recognition system. Meanwhile, results from the HMM-based system are provided as comparison. Experimental results show the SOFM/HMM system increases the recognition accuracy by 5% than the HMM-based one. Furthermore, a self-adjusting recognition algorithm is also proposed for improving the SOFM/HMM discrimination. When it is applied to the SOFM/HMM system it can improve the recognition accuracy by 1.9%. All experiments were performed in real-time with the dictionary size 208.
[Fujishige and Kurokawa(1997)]
E. Fujishige and T. Kurokawa.
Japanese processing for japanese-to-sign language translation via semantic network.
Human interface news and report, 12: 45-50, 1997.
[Gao et al.(2000a)]
W. Gao, J. Ma, S. Shan, X. Chen, W. Zheng, H. Zhang, J. Yan, and J. Wu.
Handtalker: A multimodal dialog system using sign language and 3-d virtual human.
In T. Tan, Y. Shi, and W. Gao, editors, Proceedings of the Third International Conference on Advances in Multimodal Interfaces, volume 1948 of Lecture Notes in Computer Science, pages 582-589, Oct. 2000.
http://link.springer-ny.com/link/service/series/0558/bibs/1948/19480564.htm
In this paper, we describe HandTalker: a system we designed for making friendly communication reality between deaf people and normal hearing society. The system consists of GTS (Gesture/Sign language To Spoken language) part and STG (Spoken language To Gesture/Sign language) part. GTS is based on the technology of sign language recognition, and STG is based on 3D virtual human synthesis. Integration of the sign language recognition and 3D virtual human techniques greatly improves the system performance. The computer interface for deaf people is data-glove, camera and computer display, and the interface for hearing-abled is microphone, keyboard, and display. HandTalker now can support no domain limited and continuously communication between deaf and hearing-abled Chinese people.
[Gao et al.(2000b)]
W. Gao, J. Ma, J. Wu, and C. Wang.
Sign language recognition based on hmm/ann/dp.
International Journal of Pattern Recognition and Artificial Intelligence, 14 (5), Aug. 2000.
http://www.wspc.com.sg/profiles/anncat/anncat/jnlarticle/ijpraiv14n5/S0218001400000386.pdf
In this paper, a system designed for helping the deaf to communicate with others is presented. Some useful new ideas are proposed in design and implementation. An algorithm based on geometrical analysis for the purpose of extracting invariant feature to signer position is presented. An ANN-DP combined approach is employed for segmenting subwords automatically from the data stream of sign signals. To tackle the epenthesis movement problem, a DP-based method has been used to obtain the context-dependent models. Some techniques for system implementation are also given, including fast matching, frame prediction and search algorithms. The implemented system is able to recognize continuous large vocabulary Chinese Sign Language. Experiments show that proposed techniques in this paper are efficient on either recognition speed or recognition performance.
[Gibet et al.(1996)]
S. Gibet, A. Braffort, C. Collet, F. Forest, R. Gherbi, and T. Lebourque.
Gesture in human-machine communication: Capture, analysis-synthesis, recognition, semantics.
In P. Harling and A. Edwards, editors, Proceedings of the Gesture Workshop, pages 89-95. Springer, 1996.
[Gibet et al.(1997)]
S. Gibet, J. Richardson, T. Lebourque, and A. Braffort.
Corpus of 3d natural movements and sign language primitives of movement.
In I. Wachsmuth and M. Fröhlich, editors, Proceedings of the Gesture Workshop, volume 1371 of Lecture Notes in Computer Science, pages 111-121, Bielefeld, Germany, 1997. Springer.
This paper describes the development of a corpus or database of hand-arm pointing gestures, considered as a basic element for gestural communication. The structure of the corpus is defined for natural pointing movements carried out in different directions, heights and amplitudes. It is then extended to movement primitives habitually used in sign language communication. The corpus is based on movements recorded using an optoelectronic recording system that allows the 3D description of movement trajectories in space. The main technical characteristics of the capture and pretreatment system are presented, and perspectives are highlighted for recognition and generation purposes.
[Grobel(1994)]
K. Grobel.
Recognition of fingerspelling from video.
In Proceedings of the Sixth Biennial Conference of the International Society for Augmentative and Alternative Communication, pages 282-286, 1994.
[Grobel and Assan(1997)]
K. Grobel and M. Assan.
Isolated sign language recognition using hidden markov models.
In IEEE International Conference on Systems, Man & Cybernetics, pages 162-167, Piscataway, NJ, 1997.
http://ieeexplore.ieee.org/iel3/4942/13619/00625742.pdf
This paper is concerned with the video-based recognition of isolated signs. Concentrating on the manual parameters of sign language, the system aims for the signer dependent recognition of 262 different signs. For hidden Markov modelling a sign is considered a doubly stochastic process, represented by an unobservable state sequence. The observations emitted by the states are regarded as feature vectors, that are extracted from video frames. The system achieves recognition rates up to 94%.
[Grobel and Hienz(1996a)]
K. Grobel and H. Hienz.
Fuzzy video-based handshape recognition.
In K. e. a. George, editor, Proceedings of the 1996 ACM Symposium on Applied Computing, pages 614-618, Philadelphia, Feb. 1996.
[Grobel and Hienz(1996b)]
K. Grobel and H. Hienz.
Video-based handshape recognition using a handshape structure model in real-time.
In Proceedings of the International Conference on Pattern Recognition, pages 446-450, Wien, 1996.
[Grobel and Hienz(1996c)]
K. Grobel and H. Hienz.
Video-based recognition of fingerspelling in real-time.
In T. Lehmann, I. Scholl, and K. Spitzer, editors, Proceedings des Aachener Workshops am Bildverarbeitung für die Medizin. Algorithmen. Systeme. Anwendungen, pages 197-202, 1996.
http://citeseer.nj.nec.com/118789.html
[Hagen(1994)]
E. Hagen.
A flexible american sign language interface to deductive databases.
Master's thesis, Computer Science, Simon Fraser University, 1994.
ftp://fas.sfu.cs/pub/theses/1994/EliHagenMSc.ps
[Hamilton and Micheli-Tzanakou(1994)]
J. Hamilton and E. Micheli-Tzanakou.
Alopex neural networks for manual alphabet recognition.
In Proceedings of the 16th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 1994.
http://ieeexplore.ieee.org/iel4/3230/9234/00415347.pdf
Alopex and backpropagation were used to train neural networks to recognize signs from the American Manual Alphabet. In many cases, the resulting networks gave comparable performance. The Alopex optimization technique did not converge to low error percentages during training as well as backpropagation did; backpropagation gave poorer performance on networks with a small number of hidden nodes.
[Harrison(1982)]
R. Harrison.
Computer representation of deaf sign language.
In D. S. Greenaway and E. Warman, editors, Eurographics '82 : proceedings of the international conference and exhibition, pages 303-308, Manchester, UK, 1982. North-Holland Publ. Co.
[Hienz and Bauer(1999)]
H. Hienz and B. Bauer.
Sign language recognition based on statistical methods.
In Bi-Annual Report - Department of Technical Computer Science 1997/1998, pages 34-37. Shaker, 1999.
[Hienz and Grobel(1998)]
H. Hienz and K. Grobel.
An automatic video-based sign language recognition system as part of a sign printing system.
In P. Kopacek, editor, IEEE International Conference on Intelligent Engineering Systems, pages 163-168, 1998.
[Hienz et al.(1996)]
H. Hienz, K. Grobel, and G. Beckers.
Video-based handshape recognition using artificial neural networks.
In H.-J. Zimmermann, editor, Fourth European Congress on Intelligent Techniques and Soft Computing, pages 1659-1663, Aachen, Germany, Sept. 1996. Wissenschaftsverlag.
[Hienz and Kraiss(1999)]
H. Hienz and K.-F. Kraiss.
Hmm-based continuous sign language recognition using stochastic grammars.
In Proceedings of the Gesture Workshop, volume 1739 of Lecture Notes in Artificial Intelligence, Gif-sur-Yvette, France, Mar. 1999. Springer.
http://www.techinfo.rwth-aachen.de/Veroeffentlichungen/V001_1999.pdf
[Hienz et al.(1999)]
H. Hienz, K.-F. Kraiss, and B. Bauer.
Continuous sign language recognition using hidden markov models.
In Y. Tang, editor, The Second International Conference on Multimodal Interface, pages IV10--IV15, Hong Kong, China, 1999.
http://www.techinfo.rwth-aachen.de/Veroeffentlichungen/V002_1999.pdf
[Holden and Roy(1992)]
E. Holden and G. Roy.
The graphical translation of english text into signed english in the hand sign translator system.
Computer Graphics Forum (Eurographics'92), 11 (3): C357--C366, 1992.
http://www.cs.uwa.edu.au/~eunjung/mypubs/GForumpaper.ps.gz
Signed English is a manual interpretation of English using fingerspelling and signs. A prototype of the Hand Sign Translator (HST) system was developed to graphically translate English into Signed English, using two-handed animation. The HST consists of a practical interface that aims to help users learn Signed English, and the translation process where English text is transformed into a series of images that represent corresponding signs. This paper describes the translation process which involves two stages; the input environment and the animation process. The input environment consists of text analysis in order to extract corresponding kinematic data from the database, named English-Sign ...
[Holden(1991)]
E. J. Holden.
Graphical representation of hand movement as in deaf sign language: The hand sign translator system.
Master's thesis, University of Western Australia, 1991.
[Holden(1997)]
E.-J. Holden.
Visual Recognition of Hand Motion.
PhD thesis, University of Western Australia, 1997.
http://www.cs.uwa.edu.au/~eunjung/mypubs/PhDThesis.ps.gz
[Holden and Owens(2001)]
E. J. Holden and R. Owens.
Visual sign language recognition.
In R. Klette, T. Huang, and G. Gimen'garb, editors, 10th International Workshop on Theoretical Foundations of Computer Vision, volume 2032 of Lecture Notes in Computer Science. Springer, Mar. 2001.
http://link.springer.de/link/service/series/0558/bibs/2032/20320270.htm
Also in Proceedings of DICTA'99 (Digital Image Computing: Techniques & Applications), pp. 275-279.
Automatic gesture recognition systems generally require two separate processes: a motion sensing process where some motion features are extracted from the visual input; and a classification process where the features are recognised as gestures. We have developed the Hand Motion Understanding (HMU) system that uses the combination of a 3D model-based hand tracker for motion sensing and an adaptive fuzzy expert system for motion classification. The HMU system understands static and dynamic hand signs of the Australian Sign Language (Auslan). This paper presents the hand tracker that extracts 3D hand configuration data with 21 degrees-of-freedom (DOFs) from a 2D image sequence that is captured from a single viewpoint, with the aid of a colour-coded glove. Then the temporal sequence of 3D hand configurations detected by the tracker is recognised as a sign by an adaptive fuzzy expert system. The HMU system was evaluated with 22 static and dynamic signs. Before training the HMU system achieved 91% recognition, and after training it achieved over 95% recognition.
[Holden et al.(1999a)]
E.-J. Holden, R. Owens, and G. G. Roy.
3d hand tracker for visual sign recognition.
Technical Report 99/2, Department of Computer Science, The University of Western Australia, 1999.
http://citeseer.nj.nec.com/263562.html
[Holden et al.(1999b)]
E.-J. Holden, R. Owens, and G. G. Roy.
Adaptive fuzzy expert system for sign recognition.
Technical Report 99/3, Department of Computer Science, The University of Western Australia, 1999.
http://citeseer.nj.nec.com/holden99adaptive.html
[Holden et al.(1996)]
E. J. Holden, G. G. Roy, and R. Owens.
Hand movement classification using an adaptive fuzzy expert system.
International Journal of Expert Systems, 9 (4): 465-480, 1996.
http://www.cs.uwa.edu.au/~eunjung/mypubs/IJESpaper.ps.gz
[Huang and Huang(1998)]
C. Huang and W. Huang.
Sign language recognition using model-based tracking and a 3d hopfield neural-network.
Machine Vision and Applications, 10 (5/6): 292-307, 1998.
http://link.springer.de/link/service/journals/00138/bibs/8010005/80100292.htm
[Huang et al.(1992)]
C.-L. Huang, C. L. Cheng, and L. L. Pau.
Chinese sign language interpretation through motion and shape analysis.
In V. Srinivasa, O. S. Heng, and A. Y. Hock, editors, Proceedings of the 2nd Singapore International Conference on Image Processing, pages 576-580, 1992.
[Huang et al.(1995)]
C.-L. Huang, W.-Y. Huang, and C.-C. Lien.
Sign language recognition using 3-d hopfield neural network.
In Proceedings International Conference on Image Processing, pages 611-614, 1995.
http://ieeexplore.ieee.org/iel3/4052/11607/00537553.pdf
This paper presents a sign language recognition system which consists of three modules: model-based hand tracking, feature extraction, and gesture recognition using a 3-D Hopfield neural network. In the experiments, we illustrate that this system can recognize 15 different gestures accurately.
[Imagawa et al.(1997)]
K. Imagawa, S. Lu, and S. Igi.
Tracking hands by skin color from sign language image sequences.
In Proc. of 54th IPSJ Annual Conference, pages 249-250, 1997.
[Imagawa et al.(1998a)]
K. Imagawa, S. Lu, and S. Igi.
Color-based hands tracking system for sign language recognition.
In IEEE International Conference on Automatic Face and Gesture Recognition, pages 462-467, 1998.
http://dlib.computer.org/conferen/fg/8344/pdf/83440462.pdf
[Imagawa et al.(1998b)]
K. Imagawa, S. Lu, and S. Igi.
Real-time tracking of human hands from a sign-language image sequence.
In The Third Asian Conference on Computer Vision '98, 1998., 1998.
[Imagawa et al.(2000)]
K. Imagawa, H. Matsuo, R. Taniguchi, D. Arita, S. Lu, and S. Igi.
Recognition of local features for camera-based sign language recognition system.
In Proceedings of the International Conference on Pattern Recognition, pages 849-853, 2000.
http://www.computer.org/proceedings/icpr/0750/Volume 4/07504849abs.htm
[Kadous(1995)]
M. W. Kadous.
Grasp - recognition of australian sign language using instrumented gloves.
Bachelor's thesis, University of New South Wales, Oct. 1995.
http://www.cse.unsw.edu.au/~waleed/thesis/
[Kadous(1996)]
M. W. Kadous.
Machine recognition of auslan signs using powergloves: Towards large-lexicon recognition of sign language.
In L. Messing, editor, Proceedings of WIGLS. The Workshop on the Integration of Gesture in Language and Speech, pages 165-174, Oct. 1996.
http://www.cse.unsw.edu.au/~waleed/paper-wigls/paper-wigls.html
[Kadous(1998a)]
M. W. Kadous.
Auslan sign recognition using computers and gloves.
In Deaf Studies Research Symposium, 1998.
http://www.cse.unsw.edu.au/~waleed/phd/auslrec2.ps
[Kadous(1998b)]
M. W. Kadous.
A general architecture for supervised classification of multivariate time series.
Technical Report UNSW-CSE-TR-9806, Department of Artificial Intelligence, School of Computer Science & Engineering, University of New South Wales, Sept. 1998.
ftp://ftp.cse.unsw.edu.au/pub/users/waleed/tr9806.ps.gz
[Kadous(1999)]
M. W. Kadous.
Learning comprehensible descriptions of multivariate time series.
In Proceedings of the International Conference on Machine Learning, pages 454-463. Morgan Kaufmann, San Francisco, CA, 1999.
http://citeseer.nj.nec.com/kadous99learning.html
[Kadous(2002a)]
M. W. Kadous.
Expanding the scope of concept learning using metafeatures. Submitted paper.
In Proceedings of the International Conference on Machine Learning, 2002.
http://www.cse.unsw.edu.au/~waleed/phd/paper.pdf
We present a general automated preprocessing technique called metafeatures. Using metafeatures, the scope of traditional propositional attribute-value learning is expanded to domains that do not normally fit in the propositional model. These are domains that contain instances that have some kind of recurring substructure, such as strokes in handwriting recognition, or local maxima in time series data. Metafeatures are applied to three domains: sign language recognition, ECG classification and Chinese handwriting recognition. Using metafeatures we are able to generate classifiers that are both comprehensible and accurate, producing results that are comparable to hand-crafted feature extraction and in one case comparable to human experts.
[Kadous(2002b)]
M. W. Kadous.
Temporal Classification: Extending the Classifiation Paradigm to Multivariate Time Series.
PhD thesis, The University of New South Wales, School of Computer Science and Engineering, Jan. 2002.
http://www.cse.unsw.edu.au/~waleed/phd/phd.pdf
[Kamata et al.(1989)]
K. Kamata, T. Yoshida, M. Watanabe, and Y. Usui.
An approach to japanese-sign language translation system.
In IEEE International Conference on Systems, Man and Cybernetics, volume 3, pages 1089-1090, 1989.
http://ieeexplore.ieee.org/iel2/861/2458/00071466.pdf
A method was developed for translating written (spoken) language (text in Japanese) into a sequence of sign words of Japanese sign language called Simultaneous Japanese Sign Language (SJSL). The Japanese-to-Sign translation system is described that uses a Japanese-to-Sign translation dictionary with about 250 signs.
[Kang and Park()]
S. H. Kang and S. H. Park.
Toward korean text-to-sign language translation system (test).
http://citeseer.nj.nec.com/34981.html
[Kim et al.(1999a)]
J. Kim, A. Hasimoto, Y. Aoki, and A. Burger.
Design of a sign-language translation system between the japanese-korean by java-lifo language.
In Proceedings of the IEEE Region 10 Conference TENCON 99, volume 1, pages 423-426, 1999.
[Kim et al.(1996)]
J.-S. Kim, W. Jang, and Z. Bien.
A dynamic gesture recognition system for the korean sign language (ksl).
IEEE Transactions on Systems, Man and Cybernetics, Part B, 26 (2): 354-359, Apr. 1996.
http://ieeexplore.ieee.org/iel1/3477/10374/00485888.pdf
The sign language is a method of communication for the deaf-mute. Articulated gestures and postures of hands and fingers are commonly used for the sign language. This paper presents a system which recognizes the Korean sign language (KSL) and translates into a normal Korean text. A pair of data-gloves are used as the sensing device for detecting motions of hands and fingers. For efficient recognition of gestures and postures, a technique of efficient classification of motions is proposed and a fuzzy min-max neural network is adopted for on-line pattern recognition.
[Kim et al.(1999b)]
S.-W. Kim, J.-W. Lee, J.-Y. Oh, and Y. Aoki.
Development of a sign-language communication system between korea and japan through a 3d character model on internet.
In International Conference on Image Processing, 1999.
[Klima and Bellugi(1979)]
E. Klima and U. Bellugi.
The Signs of Language.
Harvard University Press, 1979.
[Kurokawa et al.(1993)]
T. Kurokawa, T. Morichi, and S. Watanabe.
Bidirectional translation between sign language and japanese for communication with deaf-mute people.
In G. Salvendy and
M. J. Smith, editors, Human-Computer Interaction: A: Applications & Case
Studies; B: Software & Hardware
Interfaces., pages 1109-1114. Elsevier, 1993.
[Lamar(2001)]
M. Lamar.
Hand Gesture Recognition using T-CombNET - A Neural Network dedicated to Temporal Information Processing.
PhD thesis, Nagoya Institute of Technology, Japan, Mar. 2001.
http://www.eletrica.ufpr.br/lamar/public/mvlamar.pdf
[Lamar et al.(1998)]
M. Lamar, M. Bhuiyan, and A. Iwata.
From hand sign to japanese hiragana alphabet recognition using principal component analysis and neural networks.
In Proc. of 12th Conference of the Japan Biomedical Society, pages 162-165, Niigata, Japan, 1998.
http://www.eletrica.ufpr.br/lamar/public/cjbs98.pdf
[Lamar et al.(1999a)]
M. Lamar, M. Bhuiyan, and A. Iwata.
Hand alphabet recognition using principal component analysis and neural networks.
In Proc. of International Joint Conference on Neural Networks, volume 4, pages 2838-2844, Washington, USA, July 1999.
http://www.eletrica.ufpr.br/lamar/public/ijcnn99.pdf
[Lamar et al.(1999b)]
M. Lamar, M. Bhuiyan, and A. Iwata.
Hand gesture recognition using morphological principal component analysis and an improved combnet-ii.
In Proc. of IEEE International Conference on System, Man, and Cybernetics, volume IV, pages 57-62, Tokyo, Japan, Oct. 1999.
http://www.eletrica.ufpr.br/lamar/public/smc99.pdf
[Lamar et al.(1999c)]
M. Lamar, M. Bhuiyan, and A. Iwata.
Japanese finger spelling recognition using t-combnet: A new neural network model.
In Proc. of 13th Conference of the Japan Biomedical Society, page 103, Osaka, Japan, Oct. 1999.
http://www.eletrica.ufpr.br/lamar/public/cjbs99.pdf
[Lamar et al.(1999d)]
M. Lamar, M. Bhuiyan, and A. Iwata.
Temporal series recognition using a new neural network structure t-combnet.
In Proc. of IEEE International Conference on Neural Information Processing, volume III, pages 1112-117, Perth, Australia, Nov. 1999.
http://www.eletrica.ufpr.br/lamar/public/iconip99.pdf
[Lamar et al.(2000a)]
M. Lamar, M. Bhuiyan, and A. Iwata.
Hand gesture recognition using t-combnet: A new neural network model.
IEICE Trans. on Information and Systems, E83-D (11): 1986-1995, Nov. 2000.
http://www.eletrica.ufpr.br/lamar/public/e83-d_11_1986.pdf
[Lamar et al.(2000b)]
M. Lamar, M. S. Bhuiyan, and A. Iwata.
T-combnet - a neural network dedicated to hand gesture recognition.
In S.-W. Lee, H. Buelthoff, and T. Poggio, editors, Proceedings First IEEE International Workshop, volume 1811 of Lecture Notes in Computer Science, pages 613-622, Seoul, Korea, May 2000.
http://www.eletrica.ufpr.br/lamar/public/bmcv2000.pdf
[Lamar et al.(2001)]
M. Lamar, S. Wysoski, and A. Iwata.
User independent japanese kana finger spelling translation using neural networks.
In 40th Conference of the Japan Society of Medical Electronics and Biological Engineering, Nagoya, Japan, 2001.
http://www.eletrica.ufpr.br/lamar/public/ME2001_4451.pdf
[Lee et al.(1997)]
C.-S. Lee, Z. Bien, G.-T. Park, W. Jang, J.-S. Kim, and S.-K. Kim.
Real-time recognition system of korean sign language based on elementary components.
In IEEE International Conference on Fuzzy Systems Proceedings, pages 1463-1468, 1997.
http://ieeexplore.ieee.org/iel3/4864/13484/00619759.pdf
Sign language is a method of communication for deaf persons. In communication using hand gesture, sign words and manual alphabets are used together. In this paper a system is proposed, which recognizes Korean sign Language (KSL). KSL is composed of Korean sign words and Korean manual alphabets continuously. To recognize meanings of continuous gestures which have no token of beginning and end, this system segments current motion states using speed and change of speed in motions and state automata. To understand the meaning of a gesture, basic component classifiers using fuzzy min-max neural network and fuzzy logic are used. Basic elements of meaning used in this system are 14 hand directions, 23 hand postures, and 14 hand orientations. Meaning of signed gesture is interpreted using basic elements which were recognized by 3 classifiers. This system recognizes 31 Korean manual alphabets and 131 Korean signs in real-time with recognition rate 96.7% for Korean manual alphabets and 94.3% for Korean sign words, excluding no recognition case.
[Lee and Kunii(1992)]
J. Lee and T. L. Kunii.
Visual translation: From native language to sign language.
In Proc. of the 1992 IEEE Workshop on Visual Languages, pages 103-109, Seattle, WA, 1992.
[Lee and Kunii(1993)]
J. Lee and T. L. Kunii.
Computer animated visual translation from natural language to sign language.
The Journal of Visualization and Computer Animation, 4 (2): 63-78, April - June 1993.
[Lee and T.L.(1994)]
J. Lee and K. T.L.
Generation and recognition of sign language using graphic models.
In H. Yamada, Y. Kambayashi, and S. Ohta, editors, Proceedings of the IISF/ACM Japan International Symposium. Computers as our Better Partners, pages 96-103, Tokyo, Japan, 1994. World Scientific.
[Lejeune et al.(2001)]
F. Lejeune, A. Braffort, and J.-P. Desclés.
Study on semantic representations of french sign language sentences.
In Proceedings of the Gesture Workshop, London, Apr. 2001.
http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/GW%20Lejeune.pdf
Draft
[Liang and Ouhyoung(1997)]
R. Liang and M. Ouhyoung.
A real-time continuous gesture interface for taiwanese sign language.
Submitted to UIST, 1997.
http://www.cmlab.csie.ntu.edu.tw/~f1506028/publication/uist97.ps.gz
[Liang and Ouhyoung(1998)]
R. Liang and M. Ouhyoung.
A real-time continuous gesture recognition system for sign language.
In IEEE International Conference on Automatic Face and Gesture Recognition, pages 558-567, 1998.
http://ieeexplore.ieee.org/iel4/5501/14786/00671007.pdf
A large vocabulary sign language interpreter is presented with real-time continuous gesture recognition of sign language using a data glove. Sign language, which is usually known as a set of natural language with formal semantic definitions and syntactic rules, is a large set of hand gestures that are daily used to communicate with the hearing impaired. The most critical problem, end-point detection in a stream of gesture input is first solved and then statistical analysis is done according to four parameters in a gesture: posture, position, orientation, and motion. The authors have implemented a prototype system with a lexicon of 250 vocabularies and collected 196 training sentences in Taiwanese Sign Language (TWL). This system uses hidden Markov models (HMMs) for 51 fundamental postures, 6 orientations, and 8 motion primitives. In a signer-dependent way, a sentence of gestures based on these vocabularies can be continuously recognized in real-time and the average recognition rate is 80.4%.
[Liang and Ouhyoung(1995)]
R.-H. Liang and M. Ouhyoung.
A real-time continuous alphabetic sign language to speech conversion vr system.
Computer Graphics Forum - EUROGRAPHICS '95, 14 (3): 67-76, 1995.
[Liang and Ouhyoung(1996)]
R.-H. Liang and M. Ouhyoung.
A sign language recognition system using hidden markov model and context sensive search.
In Proc. of the ACM Symposium on Virtual Reality and Software Technology, pages 59-66, Hong Kong, July 1996.
http://www.cmlab.csie.ntu.edu.tw/~f1506028/publication/vrst96.ps.gz
[Lu et al.(1996)]
S. Lu, S. Igi, and H. Matsuo.
A jsl conversational character system with nonverbal information - recognition of jsl, detection of gazing line and user position.
In Proc. of 2nd Symposium on Intelligent Information Media, pages 69-74, Dec. 1996.
[Lu et al.(1997a)]
S. Lu, S. Igi, and H. Matsuo.
Development of sign language-voice conversation support system.
Human with Technology, (15): 85-94, Dec. 1997.
[Lu et al.(1997b)]
S. Lu, S. Igi, H. Matsuo, and Y. Nagashima.
Towards a dialogue system based on recognition and synthesis of japanese sign language.
In I. Wachsmuth and M. Fröhlich, editors, Proceedings of Gesture Workshop, number 1371 in Lecture Notes in Computer Science, Bielefeld, Germany, 1997. Springer.
http://link.springer-ny.com/link/service/series/0558/bibs/1371/13710259.htm
[Lu et al.(1997c)]
S. Lu, K. Imagawa, and S. Igi.
An active gazing-line generation system for improving sign-language conversation.
In Proceedings of the Seventh International Conference on Human-Computer Interaction (HCI International '97), volume 2, pages 283-6, Amsterdam, Netherlands, 1997. Elsevier.
[Lu et al.(1997d)]
S. Lu, K. Imagawa, and S. Igi.
An gazing-line generation system for improving sign language conversation.
In Proc. of 7th International Conference on
Human-Computer Interaction, volume
2, pages 283-286, California, USA, Aug. 1997.
[Ma et al.(2000a)]
J. Ma, W. Gao, and R. Wang.
A parallel multistream model for integration of sign language recognition and lip motion.
In T. Tan, Y. Shi, and W. Gao, editors, Proceedings of the Third International Conference on Advances in Multimodal Interfaces, volume 1948 of Lecture Notes in Computer Science, pages 582-589, Oct. 2000.
http://link.springer.de/link/service/series/0558/bibs/1948/19480582.htm
[Ma et al.(2000b)]
J. Ma, W. Gao, J. Wu, and C. Wang.
A continuous chinese sign language recognition system.
In IEEE International Conference on Automatic Face and Gesture Recognition, 2000.
http://computer.org/proceedings/fg/0580/05800428abs.htm
In this paper, we describe a system for recognizing both the isolated and continuous Chinese Sign Language (CSL) using two Cybergloves and two 3SAPCE-position trackers as gesture input devices. To get robust gesture features, each joint-angle collected by Cybergloves is normalized. The relative position and orientation of the left hand to those of the right hand are proposed as the signer position independent features. To speed up the recognition process, a fast match and a frame predicting techniques are proposed. To tackle epenthesis movement problem, context-dependent models are obtained by the Dynamic Programming (DP) technique. HMMs are utilized to model basic word units. Then we describe training techniques of the bigram language model and the search algorithm used in our baseline system. The baseline system converts sentence level gestures into synthesis speech and gestures of 3D virtual human synchronously. Experiments show that these techniques are efficient both in recognition speed and recognition performance.
[Matsuo et al.(1997)]
H. Matsuo, S. Igi, S. Lu, Y. Nagashima, Y. Takata, and T. Teshima.
The recognition algorithm with non-contact for japanese sign language using morphological analysis.
In I. Wachsmuth and M. Fröhlich, editors, Proceedings of Gesture Workshop, number 1371 in Lecture Notes in Computer Science, Bielefeld, Germany, 1997. Springer.
http://link.springer.de/link/service/series/0558/bibs/1371/13710273.htm
This paper documents the recognition method of deciphering Japanese sign language(JSL) using projected images. The goal of the movement recognition is to foster communication between hearing impaired and people capable of normal speech. We uses a stereo camera for recording three-dimensional movements, a image processing board for tracking movements, and a personal computer for an image processor charting the recognition of JSL patterns. This system works by formalizing the space area of the signers according to the characteristics of the human body, determining components such as location and movements, and then recognizing sign language patterns. The system is able to recognize JSL by determining the extent of similarities in the sign field, and does so even when vibrations in hand movements occur and when there are differences in body build. We obtained useful results from recognition experiments in 38 different JSL in two signers.
[Messing et al.(1994)]
L. Messing, R. Erenshteyn, R. Foulds, S. Galuska, and G. Stern.
American sign language computer recognition: Its present and its promise.
In Conf. of the Intl. Society for Augmentative and Alternative Communication, pages 289-291, 1994.
[Ohira et al.(1995)]
E. Ohira, H. Sagawa, T. Sakiyama, and M. Ohki.
A segmentation method for sign language recognition.
IEICE Transactions on Information and Systems, E78-D (1): 49-57, 1995.
[Ohki et al.(1995)]
M. Ohki, H. Sagawa, and N. Hataoka.
Sign language translation system using pattern recognition and synthesis.
Hitachi Review, 44 (4), 1995.
[Ohki et al.(1994)]
M. Ohki, H. Sagawa, T. Sakiyama, E. Oohira, H. Ikeda, and H. Fujisawa.
Pattern recognition and synthesis for sign language translation system.
In First Annual ACM Conference on Assistive Technologies, Hearing Impairments, pages 1-8, 1994.
http://www.acm.org/pubs/citations/proceedings/assets/191028/p1-ohki/
[Pashaloudi(2001)]
V. N. Pashaloudi.
On greek sign language alphabet character recognition: Using back-propagation neural networks.
In Proccedings of the 5th Hellenic European Research on Computer Mathematics & its Applications (HERCMA) Conference, Athens, Greece, Sept. 2001.
[Prime(1995)]
M. Prime.
Integrating sign language into a virtual reality environments.
In IEE Colloquium on Visualisation of Three-Dimensional Fields, pages 5/1-5/4, 1995.
http://ieeexplore.ieee.org/iel3/3672/10797/00498895.pdf
In a multi-user distributed virtual reality environment, such as DIVE (Distributed Interactive Virtual Environment) where the various participants each have a 3-D representation, a model for interaction is necessary. The Spatial Interaction Model describes how objects should interact with each other in the virtual environment. This model has since been extended to cover more interactions and is described in the paper. Language and gesture can be coordinated to form a single communication system more powerful than either alone. Hand gesture in particular is critical for fast interactive human to human communication. A VR system that allows a sign language channel and thus also a gestural channel will have an enriched communication medium. The main concept of this paper is that in a virtual reality environment it is possible to support users with different capabilities for interaction. The factors considered are whether the user can hear or not; their ability to use sign language; and the technology they have at their disposal on entry into the VR environment. A user is thus represented in the virtual environment as an entity with those capabilities available to it.
[Roberts(1994)]
G. D. Roberts.
Statistical pettern classification in computer recognition of sign language.
Master's thesis, Department of Computer Science, Royal Melbourne Institute of Technology, May 1994.
[Safar and Marshall(2001)]
E. Safar and I. Marshall.
The architecture of an english-text-to-sign-languages translation system.
In G. A. et al, editor, Recent Advances in Natural Language Processing (RANLP), pages 223-228, Tzigov Chark Bulgaria, 2001.
http://www.visicast.sys.uea.ac.uk/Papers/confbulgarianew.pdf
[Sagawa et al.(1994)]
H. Sagawa, T. Sakiyama, E. Oohira, H. Sakou, and M. Abe.
Prototype sign language translation system.
In Proceedings of the IISF/ACM Japan International Symposium. Computers as our Better Partners, pages 152-3, Singapore, 1994. World Scientific.
[Sagawa and Takeuchi(1999)]
Hirohiko Sagawa and Masaru Takeuchi.
A method for analyzing spatial relationships between words in sign language recognition.
In Proceedings of the Gesture Workshop, volume 1739 of Lecture Notes in Computer Science, pages 197-209, 1999.
http://link.springer-ny.com/link/service/series/0558/bibs/1739/17390197.htm
There are expressions using spatial relationships in sign language that are called directional verbs. To understand a sign-language sentence that includes a directional verb, it is necessary to analyze the spatial relationship between the recognized sign-language words and to find the proper combination of a directional verb and the sign-language words related to it. In this paper, we propose an analysis method for evaluating the spatial relationship between a directional verb and other sign-language words according to the distribution of the parameters representing the spatial relationship.
[Sagawa and Takeuchi(2000a)]
H. Sagawa and M. Takeuchi.
Development of an information kiosk with a sign language recognition system.
In Proceedings of the 2000 International Conference on Intelligent User Interfaces, Posters, pages 149-150, 2000.
http://www.acm.org/pubs/articles/proceedings/chi/355460/p149-sagawa/p149-sagawa.pdf
[Sagawa and Takeuchi(2000b)]
H. Sagawa and M. Takeuchi.
A method for recognizing a sequence of sign language words represented in a japanese sign language sentence.
In IEEE International Conference on Automatic Face and Gesture Recognition, Grenoble, France, Mar. 2000.
http://computer.org/proceedings/fg/0580/05800434abs.htm
A JSL (Japanese sign language) sentence is represented by connecting several sign-language words continuously. And there are transitions between the sign-language words that have no meaning in the sign-language sentence. Therefore, to translate JSL into Japanese, first, it is necessary to detect each sign-language word from the inputted gesture of a JSL sentence with high accuracy. And then, a proper sequence of the recognized sign-language words must be generated. To achieve this, we have developed (1) a method for effectively detecting the borders of the sign-language words from ordinary sign-language gestures and segmenting the sign-language gestures, (2) a method for detecting whether the sign-language gesture is represented by one hand or both hands, and (3) a method for distinguishing the segments representing the sing-language words from the segments representing the transitions. We have carried out an experiment with 200 samples of 10 JSL sentences to recognize the sequences of the sign language words using the developed methods. As the result, The accuracy for the word is improved from 77.6% to 86.6%, and the accuracy for the sentence is improved from 46.0% to 58.0% by using the developed methods. These results indicate that the developed methods are effective.
[Sagawa et al.(1997)]
H. Sagawa, M. Takeuchi, and M. Ohki.
Description and recognition methods for sign language based on gesture components.
In Proceedings of the 1997 International Conference on Intelligent User Interfaces, I/O Support/Spatial Awareness, pages 97-104, 1997.
http://www.acm.org/pubs/citations/proceedings/uist/238218/p97-sagawa/
[Sagawa et al.(1998)]
H. Sagawa, M. Takeuchi, and M. Ohki.
Methods to describe and recognize sign language based on gesture components represented by symbols and numerical values.
Knowledge-based Systems 10 (5):287-294, March 1998.
http://www.elsevier.com/gej-ng//10/30/56/12/11/14/abstract.html
Sign-language gestures inflect according to the context. To recognize such sign language properly, the structure of sign language must be made clear. It is well known that the structure of sign language is represented as a combination of gesture components. In this paper, methods for the description and recognition of sign-language gestures based on the gesture components are discussed. In these methods, which we have developed, a sign-language gesture is recognized by integrating the recognized gesture components according to the structure of the gesture. The results of an experiment on recognizing sign-language gestures are also examined and it is shown that the developed methods are effective.
[Shin et al.(1999)]
S.-H. Shin, S.-W. Kim, and Y. Aoki.
A structural learning of mlp classifiers using pfsga and its application to korean sign language recognition.
In Proceedings of the IEEE Region 10 Conference TENCON 99, pages 190-193, 1999.
http://ieeexplore.ieee.org/iel5/6630/17685/00818382.pdf
We present experimental results for a structural learning of multilayered perceptron (MLP) classifiers using PfSGA (Parameter-free Species Genetic Algorithm) and its application to the recognition of Korean sign language. The PfSGA is a combined method of the SGA (Species Genetic Algorithm) and PfSGA (Parameter-free Genetic Algorithm). The SGA is a modified GA for reducing the search space based on species concepts and PfGA is another modified GA to reduce the learning time without determining the learning parameters. Experimental results show that the proposed method could be a useful tool for choosing an appropriate architecture for high dimensions.
[Starner and Pentland(1995a)]
T. Starner and A. Pentland.
Real-time american sign language recognition from video using hidden markov models.
In Proceedings of the IEEE International Conference on Computer Vision, Coral Gables, FL, 1995.
http://citeseer.nj.nec.com/starner96realtime.html
[Starner and Pentland(1995b)]
T. Starner and A. Pentland.
Visual recognition of american sign language using hidden markov models.
In International Workshop on Automatic Face and Gesture Recognition (IWAFGR), Zurich, Switzerland, 1995.
http://vismod.www.media.mit.edu/cgi-bin/tr_pagemaker#TR306
[Starner et al.(1997)]
T. Starner, J. Weaver, and A. Pentland.
A wearable computing based american sign language recognizer.
Personal Technologies, 1 (4), 1997.
ftp://whitechapel.media.mit.edu/pub/tech-reports/TR-425.ps.Z
Also in: International Symposium on Wearable Computers 1997.
[Starner et al.(1998a)]
T. Starner, J. Weaver, and A. Pentland.
Real-time american sign language recognition using desk and wearable computer based video.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 20 (12): 1371-137, Dec. 1998.
http://computer.org/tpami/tp1998/i1371abs.htm
[Starner et al.(1998b)]
T. Starner, J. Weaver, and A. Pentland.
A wearable computer based american sign language recognizer.
In IEEE International Symposium on Wearable Computing, volume 1458 of Lecture Notes in Computer Science, 1998.
http://link.springer-ny.com/link/service/series/0558/bibs/1458/14580084.htm
Modern wearable computer designs package workstation level performance in systems small enough to be worn as clothing. These machines enable technology to be brought where it is needed the most for the handicapped: everyday mobile environments. This paper describes a research effort to make a wearable computer that can recognize (with the possible goal of translating) sentence level American Sign Language (ASL) using only a baseball cap mounted camera for input. Current accuracy exceeds 97% per word on a 40 word lexicon.
[Sugiyama et al.(1997)]
H. Sugiyama, S. Tanahashi, and Y. Aoki.
Recovering three dimensional hand motions of sign language from monocular image sequence.
In Proceedings of 1997 International Conference on Information, Communications and Signal Processing, volume 2, pages 1098-1101, 1997.
This paper proposes a method to recover three dimensional hand motions of Japanese sign language. This method uses skin-color information and edge and difference images. In this method, joint points are determined by considering a human body consisting of hierarchical structure. Also, we attempt to spot each word from a sentence by using estimated hand motion transitions. The experimental results show the validity of the proposed method.
[Sujan and Meggiolaro(2000)]
V. A. Sujan and M. A. Meggiolaro.
Sign language recognition using competitive learning in the havnet neural network.
In N. M. Nasrabadi and A. K. Katsaggelos, editors, Applications of Artificial Neural Networks in Imaging V (Electronic Imaging 2000), volume 3962 of Proc. SPIE, pages 2-12, San Jose, CA, 2000.
[Sutherland(1996)]
A. Sutherland.
Real-time video-based recognition of sign language gestures using guided template matching.
In P. A. Harling and A. D. Edwards, editors, Proceedings of Gesture Workshop, pages 31-38, 1996.
[Sweeney and Downton(1996)]
G. Sweeney and A. Downton.
Towards appearance-based multi-channel gesture recognition.
In P. A. Harling and A. D. Edwards, editors, Proceedings of Gesture Workshop, pages 7-16. Springer, 1996.
[Sweeney and Downton(1997)]
G. Sweeney and A. Downton.
Sign language recognition using a cheremic architecture.
In International Conference on Image Processing and Its Applications, volume 2, pages 483-486, 1997.
http://ieeexplore.ieee.org/iel3/4857/13412/00615572.pdf
In this paper we analyse the gestures that compose British Sign Language (BSL), where an immediate response or detection of a single gesture is not required but instead a sequence must be translated with a small delay to allow for contextual information to be considered. BSL is a natural language that uses two communication channels comprising hand/body postures (manual channel) and facial expressions (non-manual channel) to communicate information. In this paper, analysis is restricted to the manual channel only.
[Tamura and Kawasaki(1988)]
S. Tamura and S. Kawasaki.
Recognition of sign language motion images.
Pattern Recognition, 21 (4): 343-353, 1988.
[Tokuda and Okumura(1998a)]
M. Tokuda and M. Okumura.
Towards automatic translation from japanese into japanese sign language.
In Applications in Robotics, User Interfaces, and Natural Language Processing, volume 1458 of Lecture Notes in Computer Science. Springer, 1998.
http://link.springer.de/link/service/series/0558/bibs/1458/14580097.htm
In this paper, we present a prototype translation system named SYUWAN which translates Japanese into Japanese sign language. One of the most important problems in this task is that there are very few entries in a sign language dictionary compared with a Japanese one. To solve this problem, when the original input word does not exist in a sign language dictionary SYUWAN applies several techniques to find a similar word from a Japanese dictionary and substitutes this word for the original word. As the result, SYUWAN can translate up to 82% of words which are morphologically analyzed.
[Tokuda and Okumura(1998b)]
M. Tokuda and M. Okumura
Automatic complement of sign language dictionary in japanese-sign language machine translation.
Transactions of the Information Processing Society of Japan, 39, Mar. 1998.
[Uchida et al.(1994)]
M. Uchida, K. Ishikawa, and H. Ide.
Finger character recognition system and application to sign language.
Transactions of the Institute of Electrical Engineers of Japan, 114-C (10): 995-1000, Oct. 1994.
[Vamplew(1996)]
P. Vamplew.
Recognition of Sign Language Using Neural Networks.
PhD thesis, Department of Computer Science, University of Tasmania, 1996.
[Vamplew and Adams(1992)]
P. Vamplew and A. Adams.
The slarti system: Applying artificial neural networks to sign language recognition.
In H. Murphy, editor, Proceedings of the Seventh Anual Conference on Technology and persons with disabilities, pages 575-579, Northridge, CA, 1992.
[Vamplew and Adams(1995)]
P. Vamplew and A. Adams.
Recognition and anticipation of hand motions using a neural network.
In Proceedings of IEEE International Conference on Neural Networks, volume 3, pages 2904-2907, 1995.
http://citeseer.nj.nec.com/vamplew95recognition.html
[Veale et al.(1998)]
Tony Veale, Alan Conway and Brona Collins.
The challenges of cross-modal translation: English-to-sign-language translation in the zardoz system.
Machine translation, 13 (1): 81-106, 1998.
http://www.wkap.nl/oasis.htm/180039
[Vogler and Metaxas(1997)]
C. Vogler and D. Metaxas.
Adapting hidden markov models for asl recognition by using three-dimensional computer vision methods.
In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pages 156-161, Orlando, FL, Oct. 1997.
http://citeseer.nj.nec.com/271989.html
[Vogler and Metaxas(1998)]
C. Vogler and D. Metaxas.
Asl recognition based on a coupling between hmms and 3d motion analysis.
In Proceedings of the IEEE International Conference on Computer Vision, pages 363-369, Mumbai, India, Jan. 1998.
http://citeseer.nj.nec.com/vogler98asl.html
Also as Technical report MS-CIS-98-21.
[Vogler and Metaxas(1999a)]
C. Vogler and D. Metaxas.
Parallel hidden markov models for american sign language recognition.
In Proceedings of the IEEE International Conference on Computer Vision, Kerkyra, Greece, Sept. 1999.
ftp://ftp.cis.upenn.edu/pub/cvogler/iccv99.pdf
[Vogler and Metaxas(1999b)]
C. Vogler and D. Metaxas.
Toward scalability in asl recognition: Breaking down signs into phonemes.
In Proceedings of the Gesture Workshop, volume 1739 of Lecture Notes in Computer Science, Gif-sur-Yvette, France, Mar. 1999. Springer.
ftp://ftp.cis.upenn.edu/pub/cvogler/gw99.pdf
[Vogler and Metaxas(2001)]
C. Vogler and D. Metaxas.
A framework for recognizing the simultaneous aspects of american sign language.
Computer and Vision Image Understanding, (3): 358-384, Mar. 2001.
http://www.idealibrary.com/links/doi/10.1006/cviu.2000.0895
[Vogler et al.(2000)]
C. Vogler, H. Sun, and D. Metaxas.
A framework for motion recognition with applications to american sign language and gait recognition.
In Proceedings of the Workshop on Human Motion, Austin, TX, 2000.
http://www.computer.org/proceedings/humo/0939/09390033abs.htm
Human motion recognition has many important applications, such as improved human-computer interaction and surveillance. A big problem that plagues this research area is that human movements can be very complex. Managing this complexity is difficult. We turn to American sign language (ASL) recognition to identify general methods that reduce the complexity of human motion recognition. We present a framework for continuous 3D ASL recognition based on linguistic principles, especially the phonology of ASL. This framework is based on parallel hidden Markov models (HMMs), which are able to capture both the sequential and the simultaneous aspects of the language. Each HMM is based on a single phoneme of ASL. Because the phonemes are limited in number, as opposed to the virtually unlimited number of signs that can be composed from them, we expect this framework to scale well to larger applications. We then demonstrate the general applicability of this framework to other human motion recognition tasks by extending it to gait recognition.
[Waldron and Kim(1994)]
M. B. Waldron and S. Kim.
Increasing manual sign recognition vocabulary through relabeling.
In International Joint Conference on Neural Networks, pages 2885-2889, June 1994.
http://ieeexplore.ieee.org/iel2/3013/8560/00374689.pdf
In this paper we present the results of relabelling a self organizing map (SOM) to increase the dynamic manual signs it can recognize. Relabelling exploits the global ordering of self organizing map and abrogates the need for retraining, thereby reducing the computational costs and increasing the recognition ability of the network. This relabelling technique was applied to a dynamic sign recognition system to increase the recognition vocabulary from 10 to 14 signs. The data was collected from a person wearing a DataGlove with a Polhemus sensor and signing the 14 signs. The sampled hand data over the duration of sign was fed to phonemic recognition modules and the collective outputs of these modules were fed to the sign recognition module consisting of a relabelled self organizing network. The results showed that the overall recognition rate of the relabelled network was 84% as compared to 86% for the retrained network. Further, it was found that the dynamic sampling of the signs made the movement phoneme module unnecessary.
[Waldron and Kim(1995)]
M. B. Waldron and S. Kim.
Isolated asl sign recognition system for deaf persons.
IEEE Transactions on Rehabilitation Engineering, 3 (3), Sept. 1995.
http://ieeexplore.ieee.org/iel4/86/9209/00413199.pdf
The design and evaluation of a two-stage neural network which can recognize isolated ASL signs is given. The input to this network is the hand shape and position data obtained from a DataGlove mounted with a Polhemus sensor. The first level consists of four backpropagation neural networks which can recognize the sign language phonology, namely, the 36 hand shapes, 10 locations, 11 orientations, and 11 hand movements. The recognized phonemes from the beginning, middle, and end of the sign are fed to the second stage which recognizes the actual signs. Both backpropagation and Kohonen's self-organizing neural work was used to compare the performance and the expandability of the learned vocabulary. In the current work, six signers with differing hand sizes signed 14 signs which included hand shape, position, and motion fragile and triple robust signs. When a backpropagation network was used for the second stage, the results show that the network was able to recognize these signs with an overall accuracy of 86%. Further, the recognition results were linearly dependent on the size of the finger in relation to the metacarpophalangeal joint and the total length of the hand. When the second stage was a Kohonen's self-organizing network, the network could not only recognize the signs with 84% accuracy, but also expand its learned vocabulary through relabeling.
[Wang et al.(2000)]
C. Wang, W. Gao, and J. Ma.
An approach to automatically extracting the basic units in chinese sign language recognition.
In Proceedings 5th International Conference on Signal Processing, volume 2, pages 855 --858, 2000.
http://ieeexplore.ieee.org/iel5/7173/19290/00891645.pdf
The recognition of large vocabulary continuous Chinese sign language (CSL) is a challenging problem. It is effective to use phonemes instead of whole signs as the basic units. In this paper, an approach to extracting the basic units in CSL automatically is described. In order to find subwords in each data streams in sign signals, dynamic programming (DP) is proposed to segment the data streams, and then ANN approach combining k-means is used to classify these segments. 71 hand postures are automatically extracted from 1063 words and 200 continuous sentences. These postures accompanied with locations and orientations are used as basic units in large vocabulary continuous CSL recognition.
[Wang et al.(2001a)]
C. L. Wang, W. Gao, and Z. G. Xuan.
A real-time large vocabulary continuous recognition system for chinese sign language.
In IEEE Pacific Rim Conference on Multimedia 2001, volume 2195 of Lecture Notes in Computer Science, Bejing, China, 2001.
[Wang et al.(2001b)]
A Real-Time Large Vocabulary Recognition System for Chinese Sign Language.
C.L. Wang, W. Gao, and J. Ma.
In Proceedings of the Gesture Workshop,
London, Apr. 2001.
http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/GWpaper_15.pdf
Draft.
[Wei et al.(2001)]
Z. Wei, Y. Kui, L. Jindong, and L. Bencheng.
A method for hand tracking and motion recognizing in chinese sign language.
In Proceedings 2001 International Conferences on Info-tech and Info-net, volume 3, pages 543-549, Beijing, China, 2001.
[Wilson and Anspach(1993a)]
E. Wilson and G. Anspach.
Neural networks for sign language translation.
In S. K. Rogers, editor, Applications of Artificial Neural Networks IV, volume 1965 of Proceedings of the SPIE - The International Society for Optical Engineering, pages 589-599, 1993.
[Wilson and Anspach(1993b)]
E. Wilson and G. Anspach.
Wavelets for sign language translation.
In B. G. Haskell and H.-M. Hang, editors, Visual Communications and Image Processing '93, volume 2094 of Proceedings of the SPIE - The International Society for Optical Engineering, pages 1300-1308, 1993.
http://spie.org/scripts/abstract.pl?bibcode=1993SPIE%2e2094%2e1300W&db_key=INST&qs=spie&s_type=paper
[Wilson and Gretel(1993)]
E. J. Wilson and A. Gretel.
Applying neural network developments to sign language translation.
In C. A. e. a. Kamm, editor, Proceedings of the 1993 IEEE-SP Workshop on Neural Networks for signal processing III, New York, NY, 1993.
http://ieeexplore.ieee.org/iel2/3312/9944/00471858.pdf
Neural networks are used to extract relevant features of sign language from video images of a person communicating in American Sign Language or Signed English. The key features are hand motion, hand location with respect to the body, and handshape. A modular design is under way to apply various techniques, including neural networks, in the development of a translation system that will facilitate communication between deaf and hearing people. Signal processing techniques developed for defense-related programs have been adapted and applied to this project. Algorithm development and transition using neural network architectures has been encouraging. The results of the feasibility study for this project are described.
[Wu and Gao(2000)]
J. Wu and W. Gao.
A fast sign word recognition method for chinese sign language.
In T. Tan, Y. Shi, and W. Gao, editors, Proceedings of the Third International Conference on Advances in Multimodal Interfaces, volume 1948 of Lecture Notes in Computer Science, pages 582-589, Oct. 2000.
http://link.springer-ny.com/link/service/series/0558/bibs/1948/19480599.htm
Sign language is the language used by the deaf, which is a comparatively steadier expressive system composed of signs corresponding to postures and motions assisted by facial expression. The objective of sign language recognition research is to ``see'' the language of deaf. The integration of sign language recognition and sign language synthesis jointly comprise a ``human-computer sign language interpreter'', which facilitates the interaction between deaf and their surroundings. Considering the speed and performance of the recognition system, Cyberglove is selected as gesture input device in our sign language recognition system, Semi-Continuous Dynamic Gaussian Mixture Model (SCDGMM) is used as recognition technique, and a search scheme based on relative entropy is proposed and is applied to SCDGMM-based sign word recognition. Comparing with SCDGMM recognizer without searching scheme, the recognition time of SCDGMM recognizer with searching scheme reduces almost 15 times.
[Wu and Gao(2001)]
J. Wu and W. Gao.
The recognition of finger-spelling for chinese sign language.
In Proceedings of the Gesture Workshop, 2001.
http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/gw2001_29.pdf
Draft.
In this paper 3-layer feedforward network is introduced to recognize Chinese manual alphabet. As it is plagued by a large amount of significant time required for training, a kind of fast learning algorithm - Single Parameter Dynamic Search Algorithm (SPDS) - is used to learn net parameters. In addition, multi-classifiers based on multi-features are constructed and a recognition algorithm for recognizing manual alphabets based on multi-features and multi-classifiers is proposed to promote the recognition performance of finger-spelling. From experiment result, it is shown that Chinese finger-spelling recognition based on multi-features and multi-classifiers outperforms its recognition based on single-classifier.
[Wu et al.(1998)]
J. Wu, W. Gao, Y. Song, W. Liu, and B. Pang.
A simple sign language recognition system based on data glove.
In Proceedings Fourth International Conference on Signal Processing, volume 2, pages 1257-1260, 1998.
http://ieeexplore.ieee.org/iel5/6237/16697/00770847.pdf
Human beings usually interact with each other either by via a natural language channel such as speech and writing, or by body languag, e.g. hand gestures, head gestures, facial expression, lip motion and so on. As a part of natural language understanding, sign language recognition is very important. On one hand, it is one of the main methods of human-computer interaction in virtual reality; on the other hand, it is an auxiliary tool for a deaf-mute to communicate with ordinary people through computer. In this paper the process of building a simple word-level sign language recognition system is presented, and the method for recognizing sign language words is also proposed.
[Wu et al. (2002)]
Chung-Hsien Wu, Yu-Hsien Chiu, Kung-Wei Cheng.
Sign language translation using an error tolerant retrieval algorithm.
In Proceedings of the International Conference on Spoken Language Processing (ICSLP), 2002.
[Xu et al.(2000)]
M. Xu, B. Raytchev, K. Sakaue, O. Hasegawa, A. Koizumi, M. Takeuchi, and H. Sagawa.
A vision-based method for recognizing non-manual information in japanese sign language.
In Third International Conference on Multimodal Interfaces, volume 1948 of Lecture Notes in Computer Science, pages 572-581, Beijing, China, Oct. 2000. Springer.
http://link.springer-ny.com/link/service/series/0558/bibs/1948/19480572.htm
[Yamaguchi et al.(1995)]
T. Yamaguchi, M. Yoshihara, M. Akiba, M. Kuga, N. Kanazawa, and K. Kamata.
Japanese sign language recognition system using information infrastructure.
In Proceedings of 1995 IEEE International Conference on Fuzzy Systems, 1995.
http://ieeexplore.ieee.org/iel2/3225/9179/00410043.pdf
The paper proposes a Japanese sign language recognition system that makes associative inferences from video information. The proposed system uses two associative memory features: association robustness and associative memory combination. A test of the actual system was made using sixteen words and the recognition system correctly distinguished all words.
[Yamamoto et al.(1997)]
Y. Yamamoto, M. Uchida, and H. Ide.
Sign language recognition by statistics method.
The Transactions of the Institute of Electrical Engineers of Japan, 117 (3), 1997.
[Yang and Ahuja(1998)]
M.-H. Yang and N. Ahuja.
Extraction and classification of visual motion patterns for hand gesture recognition.
In IEEE Conference on Computer Vision and Pattern Recognition, pages 892-897, Santa Barbara, CA, 1998.
http://citeseer.nj.nec.com/yang98extraction.html
We present a new method for extracting and classifying motion patterns to recognize hand gestures. First, motion segmentation of the image sequence is generated based on a multiscale transform and attributed graph matching of regions across frames. This produces region correspondences and their affine transformations. Second, color information of motion regions is used to determine skin regions. Third, human head and palm regions are identified based on the shape and size of skin areas in motion. Finally, affine transformations defining a region's motion between successive frames are concatenated to construct the region's motion trajectory. Gestural motion trajectories are then classified...
[Yang and Ahuja(1999)]
M.-H. Yang and N. Ahuja.
Recognizing hand gesture using motion trajectories.
In IEEE Conference on Computer Vision and Pattern Recognition, pages 466-472, Ft. Collins, CO, June 1999.
http://ieeexplore.ieee.org/iel5/6370/17045/00786979.pdf
We present an algorithm for extracting and classifying two-dimensional motion in an image sequence based on motion trajectories. First, a multiscale segmentation is performed to generate homogeneous regions in each frame. Regions between consecutive frames are then matched to obtain 2-view correspondences. Affine transformations are computed from each pair of corresponding regions to define pixel matches. Pixels matches over consecutive images pairs are concatenated to obtain pixel-level motion trajectories across the image sequence. Motion patterns are learned from the extracted trajectories using a time-delay neural network. We apply the proposed method to recognize 40 hand gestures of American Sign Language. Experimental results show that motion patterns in hand gestures can be extracted and recognized with high recognition rate using motion trajectories.
[Yoshino et al.(1996)]
K. Yoshino, T. Kawashima, and Y. Aoki.
Recognition of japanese sign language from image sequence using color combination.
In Proceedings of the IEEE International Conference on Image Processing, volume 3, pages 511-14, 1996.
This paper proposes a method of recognising Japanese sign language indirectly using a colored glove which has multiple color patches on its surface. Japanese sign language has finger alphabets and words which include a hand motion. In the proposed method, the finger alphabets are estimated from three parameters: the color combination of the visible patches, the dispersion of the visible patches, and the hand direction. The color combination is represented by the mean of the color histogram of the visible patches in an input image. Words which include a hand motion are estimated from the transition of characteristic hand structures which are detected from an image sequence by changes in the color combination of the visible patches. The experimental results show the validity of the proposed method.
[Zhang et al. (2002)]
Ying Zhang, Bing Zhao, Jie Yang, Alex Waibel
Automatic Sign Translation
In Proceedings of the International Conference on Spoken Language Processing (ICSLP), 2002.
[Zhao et al.(2000)]
L. Zhao, K. Kipper, W. Schuler, C. Vogler, N. Badler, and M. Palmer.
A machine translation system from english to american sign language.
In Proceedings of the Association for Machine Translation in the Americas, 2000.
http://www.cis.upenn.edu/~lwzhao/papers/amta00.pdf
Abramatic, J.F., Letellier, P., Nadler, M.,
A Narrow-Band Video Communication System for the Transmission of Sign Language over Ordinary Telephone Lines,
ISPDSA83(314-346).
Letellier, P., Nadler, M., Abramatic, J.F.
The Telesign Project
PIEEE(73), 1985, pp. 813-827. BibRef 8500
Mozelle, Gerard; Preteux, Francoise; Viallet, Jean-Emmanuel;
Telesign: a videophone system for sign language distant communication
http://spie.org/scripts/abstract.pl?bibcode=1998SPIE%2e3457%2e%2e157M&db_key=INST&qs=spie&s_type=paper
Sperling, G., Landy, M.S., Cohen, Y., and Pavel, M.,
Intelligible Encoding of ASL Image Sequences at Extremely Low Information Rates,
CVGIP(31), No. 3, September 1985, pp. 335-391. BibRef 8509 And: Correction: CVGIP(35), No. 2, August 1986, pp. 274. Application, Sign Language. NYU, The various methods get to 80% intelligibility at 9600Baud.
Woelders W., Frowein H.W., Nielsen J., Questa P. and Sandini G.
New developments in low-bit rate videotelephony for people who are deaf
Journal of Speech, Language, and Hearing Research, Vol: 40 Iss: 6 p. 1425-33, American Speech-Language-Hearing Assoc., Dec. 1997
Higashino
S. and Kasahara H.
A hierarchical browsing system for sign language video
Design of Computing Systems: Cognitive Considerations. Proceedings of the Seventh International Conference on Human-Computer Interaction (HCI International '97), Elsevier Amsterdam, Netherlands, 1997.
Zaharia, Titus; Preda, Marius; Preteux, Francoise
Sign language indexation within the MPEG-7 framework
Pook, Polly K.; Ballard, Dana H.
Sign language for telemanipulation
Proc.
SPIE Vol. 2351, p. 13-24, Telemanipulator and Telepresence Technologies, Hari
Das; Ed., 1995
http://spie.org/scripts/abstract.pl?bibcode=1995SPIE%2e2351%2e%2e%2e13P&db_key=INST&qs=spie&s_type=paper
Encoding of sign language image sequences at very low rate
http://spie.org/scripts/abstract.pl?bibcode=1990SPIE%2e1360%2e1140H&db_key=INST&qs=spie&s_type=paper
Tomohiro Kuroda, Kosuke Sato, Kunihiro Chihara
S-TEL: An Avatar Based Sign Language Telecommunication System
International Journal of Virtual Reality, Vol.3, No.4, pp.21-27, (1998)
S.Malassiotis and M.G.Strintzis
Motion Estimation Based on Spatio-Temporal Warping for Very Low Bit-Rate Coding
IEEE Trans. on Communications, Vol.45, No.10, pp.1172-1176, October 1997.
http://www.iti.gr/db.php/en/publications/details/9.html
Richard P. Schumeyer, Kenneth E. Barner
Gesture Color-Based Content Coding with Applications to Sign Language Video Communications (1997)
http://citeseer.nj.nec.com/1091.html
J. Loomis and H. Poizner and U. Bellugi and A. Blakemore and J. Hollerbach
Computer graphic modeling of American sign language
Computer
Graphics, 17(3), pp. 105-114, July 1983.
An interface for transcribing American sign language
SIGGRAPH 99. Proceedings of the 1999 SIGGRAPH annual conference: Conference abstracts and applications, Computer Graphics, pp. 229-229, ACM Press, 1999.
http://www.acm.org/pubs/citations/proceedings/graph/311625/p229-wolfe/
Eric Sedgwick and Karen Alkoby and Mary Jo Davidson and Roymieco Carter and Juliet Christopher and Brock Craft and Jacob Furst and Damien Hinkle and Brian Konie andGlenn Lancaster and Steve Luecking and Ashley Morris and John McDonald and Noriko Tomuro and Jorge Toro and Rosalee Wolfe
Toward the Effective Animation of American Sign Language
WSCG 2001 Conference Proceedings, 2001.
http://visinfo.zib.de/EVlib/Show?EVL-2001-50
Jorge Toro, Jacob Furst, Karen Alkoby, Roymieco Carter, Juliet Christopher, Brock Craft, Mary Jo Davidson, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Eric Sedgwick, Noriko Tomuro, Rosalee Wolfe.
A Graphical Environment for Transcription of American Sign Language.
Proceedings of the ISCA 16th International Conference on Computers and Their Applications (CATA-2001). To appear.
Mary Jo Davidson, Karen Alkoby, Eric Sedgwick, Roymieco
Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian
Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko
Tomuro, Jorge Toro, Rosalee Wolfe.
Improved Hand Animation for
American Sign Language.
“Technology and Persons with Disabilities” Conference 2001. California State University at Northridge, Los Angeles, CA. March 19-24, 2001.
Eric Sedgwick, Karen Alkoby, Mary Jo Davidson, Roymieco Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.
Toward the effective animation of American Sign Language.
Proceedings of the 9th International Conference in Central Europe on Computer Graphics, Visualization and Interactive Digital Media. To appear.
Brock Craft, Damien Hinkle, Eric Sedgwick, Karen Alkoby, Mary Jo Davidson, Roymieco Carter, Juliet Christopher, Jacob Furst, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.
An Approach To Modeling Facial Expressions Used In American Sign Language.
Presented At The 2000 DePaul CTI Research Conference, Chicago, IL November 4, 2000.
Mary Jo Davidson, Karen Alkoby, Eric Sedgwick, Andre Berthiaume, Roymieco Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.
Usability Testing of Computer Animation of Fingerspelling for American Sign Language.
Presented at the 2000 DePaul CTI Research Conference, Chicago, IL, November 4, 2000.
John McDonald, Jorge Toro, Karen Alkoby, Andre Berthiaume, Pattaraporn Chomwong, Juliet Christopher, Mary Jo Davidson, Jacob Furst, Brian Konie, Glenn Lancaster, Steven Lytinen, Lopa Roychoudhuri, Eric Sedgwick, Noriko Tomuro, Rosalee Wolfe.
An Improved Articulated Model of the Human Hand.
Proceedings of the 8th International Conference in Central
Europe on Computer Graphics, Visualization and Interactive Digital Media. 2000.
306 - 313.
Karen Alkoby and Eric Sedgwick,
Using a Computer to Fingerspell.
DeafExpo 99, San Diego, CA, November 19-22, 1999.
Mary Jo Davidson, Karen Alkoby, Eric Sedgwick, Roymieco Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.
Improved Hand Animation for American Sign Language.
“Technology and Persons with Disabilities” Conference 2001. California State University at Northridge, Los Angeles, CA. March 19-24, 2001.
Eric Sedgwick, Karen Alkoby, Mary Jo Davidson, Roymieco Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.
Toward the effective animation of American Sign Language.
Proceedings of the 9th International Conference in Central Europe on Computer Graphics, Visualization and Interactive Digital Media. To appear.
Brock Craft, Damien Hinkle, Eric Sedgwick, Karen Alkoby, Mary Jo Davidson, Roymieco Carter, Juliet Christopher, Jacob Furst, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.
An Approach To Modeling Facial Expressions Used In American Sign Language.
Presented At The 2000 DePaul CTI Research Conference, Chicago, IL November 4, 2000.
Jorge Toro, Jacob Furst, Karen Alkoby, Roymieco Carter, Juliet Christopher, Brock Craft, Mary Jo Davidson, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Eric Sedgwick, Noriko Tomuro, Rosalee Wolfe.
A Graphical Environment for Transcription of American Sign Language.
Proceedings of the ISCA 16th International Conference on Computers and Their Applications (CATA-2001). To appear
http://asl.cs.depaul.edu/Toro01.pdf
Angus B. Grieve-Smith
SignSynth: A Sign Language Synthesis Application Using Web3D and Perl
Gesture Workshop 2001
Sign synthesis (also known as text-to-sign) has recently seen a large increase in the number of projects under development. Many of these focus on translation from spoken languages, but other applications include dictionaries and language learning. I will discuss the architecture of typical sign synthesis applications and mention some of the applications and prototypes currently available. I will focus on SignSynth, a CGI-based articulatory sign synthesis prototype I am developing at the University of New Mexico. SignSynth takes as its input a sign language text in ASCII-Stokoe notation (chosen as a simple starting point) and converts it to an internal feature tree. This underlying linguistic representation is then converted into a three-dimensional animation sequence in Virtual Reality Modeling Language (VRML or Web3D), which is automatically rendered by a Web3D browser.
Richard Kennaway
Synthetic Animation of Deaf Signing Gestures
Gesture Workshop 2001
We describe a method for automatically synthesizing deaf signing animations from a high-level description of signs in terms of the HamNoSys transcription system. Lifelike movement is achieved by combining a simple control model of hand movement with inverse kinematic calculations for placement of the arms. The realism can be further enhanced by mixing the synthesized animation with motion capture data for the spine and neck, to add natural "ambient motion".
A linguistic approach to sign language synthesis
People and Computers (1994), 211-222 p.
Franc Solina, Slavko Krapez, Vito Komac, Ales Jaklic
Multimedia Dictionary and Synthesis of Sign Language
http://citeseer.nj.nec.com/308411.html
Krapez, S. and Solina, F. (1999).
Synthesis of the sign language of the deaf from the sign video clips
Electrotechnical Review, 66(4--5):260--265
Kawano, S. and Kurokawa, Takao
The effects of facial expression on understanding Japanese Sign Language animation.
Bullinger, H.-J. / Ziegler, Jürgen (eds): Human-Computer Interaction: Ergonomics and User Interfaces. Proceedings of HCI International '99 (8th International Conference on Human-Computer Interaction) 2 Vols. Mahwah, NJ : Erlbaum (1999) - pp. 783-787
Kurokawa, Takao
Gesture coding and a gesture dictionary for a nonverbal interface.
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E75-A:2 E75-A: 2 (1992) - pp. 112-121
Kurokawa, Takao and Ono, Makoto and Kamiya, Hiroyuki
Coding of hand shapes and reduction of their semantic redundancy fundamental studies on noonverbal communication between man and machinery.
Proceedings of the 4th Symposion on Human Interface. (Hyumanu-Intafesu-Shinpojiumu-Ronbunshu; 4). (1988) - pp. 167-172
Ohashi, Kazuaki and Shibata, Akihiro and Kurokawa, Takao
3-D reconstruction of human body pose from a single view for the purpose of gesture interpretation.
Proceedings of the 2nd Symposion on Human Interface. (Hyumanu-Intafesu-Shinpojiumu-Ronbunshu; 2). (1986) - pp. 345-350
Kato, Yushi et al
A present overview and future problems on sign language engineering.
Human interface news and report 12 (1997) - pp. 37-44
Kawano, S. and Senba, K. and Kurokawa, Takao
Introduction of facial movement into sign language animation and its effects.
Human interface news and report 12 (1997) - pp. 357-362
Kurokawa, Takao and Senba, K. and Kawano, S.
Synthesis of facial movement images for sign language animation and their effects.
Advances in human factors / ergonomics 21 (1997) - pp. 5-8
Kurokawa, Takao and Takahashi, Y.
An analysis of blinking, gazing, nodding and head movement of an interpreter in signed TV news and rules of their occurrence.
Research Report of Cooperative Research Center, Kyoto Institute of Technology 6 (1997) - pp. 9-16
Kurokawa, Takao and Senba, K. and Kawano, S.
Synthesis of facial movement images for sign language animation and their effects.
Research Report of Cooperative Research Center, Kyoto Institute of Technology 6 (1997) - pp. 17-22
Hong, Mun-Ho; Choi, Chang-Seok; Kim, Chang-Seok; Jeon, Joon-Hyeon;
Synthesis of image sequences for Korean sign language using 3D shape model
H. Sakato, S. Lu, S. Igi
Development and Experiment of Generating JSL Animation System Based on Motion-Primitives
IPSJ SIG Notes, 98-HI-77, pp.81-86, 1998.
H. Sakato, S. Lu, S. Igi
Japanese Sign-Language Animation based on Motion-Primitives
Proc. of the Thirteenth Symposium on Human Interface, pp.243-248, Oct., 1997.
H. Sakato, S. Lu, and S. Igi
A Study of Generating JSL Annimation Based on Motion-Primitives
Proc. of 1997 EICE General Conference, No.1, pp.3271997
S Lu, K. Imagawa, S. Igi
A study to Motion Control of Sign Language Animation Character
Proc. of the 12th Symposium on Human Interface, pp.79-86, Oct.22-25, 1996.
Grobel, Kirsti and Hienz, Hermann
The video input system
Soede, M. (ed): SignPS - a System for Sign Writing, Final Report of the EU TIDE 1202 Project. Hoensbroek (The Netherlands) (1997), pp. 7/61- 7/67
Hienz, Hermann (editor)
Sign writing using a video-based input system
The practicability and avantages of writing and printing natural signs
American Annals of the Deaf and Dumb, Volume 14 (1869), 157-182 p.
Ascii-Stokoe notation: A computer-writeable transliteration system for Stokoe notation of American Sign Language
Unpubl. Manuscript, 993 - 12 p.
Dubuisson, Colette and Leclerc, Sylvie and DeMaisonneuve, Serge
Les graphes conceptuels : un outil de représentation des langues signées, ACL 95
Unpubl. Manuscript,Montréal 1995
Prillwitz, Siegmund and Leven, Regina and Zienert, Heiko and Hanke, Thomas and Henning, Jan
HamNoSys. Version 2.0; Hamburg Notation System for Sign Languages. An introductory guide
International Studies on Sign Language and Communication of the Deaf; 5, Hamburg : Signum 1989 - 46 p. ISBN: 3-927731-01-3
Hanke, Thomas and Prillwitz, Siegmund
syncWRITER: Integrating video into the transcription and analysis of sign language
Ikehara, Wako and Kamikubo, Emiko and Hiki, Shizuo
A new descriptive system for hand shapes used in signing: A proposal based on anatomical structure modeling of finger actions
Unpubl. Manuscript,1995 - 12 p.
Miller, Christopher and Radutzky, Elena
Toward a common notation standard for sign language phonology. Paper presented at the 2nd Intersign Workshop Leiden, December
Manuscript, 1998
Notational representation of sign language: A structural description of hand configurations
Lecture Notes in Computer Science, Vol. 860, p. 38-??, 1994
Brown, Maxine D. and Smoliar, Stephen W.
A graphics editor for Labanotation
Computer Graphics 10:2 (1976), 60-65 p.
Coding and transcription for BSL acquisition
Manuscript, Bristol : Centre for Deaf Studies 1996 - 43 p.
SignGlyphics: How to sketch sign pictures
Flint, MI : SignQuest Publishers 1993 - 16 p., ISBN: 1-878819-31-3
Antônio
Carlos da Rocha Costa, Graçaliz Pereira Dimuro
SignWriting-based Sign Language Processing
Gesture Workshop 2001
This paper proposes an approach to the computer processing of deaf sign languages that uses SignWriting as the writing system for deaf sign languages, and SWML (SignWriting Markup Language) as its computer encoding. Every kind of language and document processing (storage and retrieval, analysis and generation, translation, spell-checking, search, animation, dictionary automation, etc.) can be applied to sign language texts and phrases when they are written in SignWriting and encoded in SWML. This opens the whole area of deaf sign languages to the methods and techniques of text-oriented computational linguistics.
http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/gw38.pdf (draft)
Kathleen F. McCoy Lisa
A Tutor for Teaching English as a Second Language for Deaf Users of American Sign Language (1997)
http://citeseer.nj.nec.com/mccoy97tutor.html
Lisa N. Michaud and Kathleen F. McCoy and Christopher A. Pennington
An Intelligent Tutoring System for Deaf Learners of Written English
Fourth Annual ACM Conference on Assistive Technologies, pp. 92-100, ACM, 2000.
http://www.acm.org/pubs/articles/proceedings/assets/354324/p92-michaud/p92-michaud.pdf
Fernando
Alonso, Angélica de Antonio, José L. Fuertes, César Montes
Teaching Communication Skills to Hearing-Impaired Children
http://computer.org/multimedia/mu1995/u4055abs.htm
Sutherland, A. and Padden, Tessa
Videoconferencing for deaf people: a case study of on-line education for deaf people
Deafness and education international 1:2 (1999), 114-120 p.
Sign language structure: An outline of the visual communication systems of the American deaf
Studies in Linguistics. Occasional Paper; 8, Buffalo, NY : Univ. of Buffalo 1960 - 78 p.
Sign language structure: An outline of the visual communication systems of the American deaf
Edition: 2. ed, Silver Spring, Md. : Linstok Pr. 1993 - 94 p., ISBN: 0-932130-03-8
Ascension Technology Corporation -
Motion Sensors
Waleed’s data
Consisting of 95 different AUSLAN signs, with 27 samples per sign, for a total of 2565 signs, signed by a native signer volunteer.
Collected using :
http://www.cse.unsw.edu.au/~waleed/phd/
Purdue ASL database
This database consists of several videos of a set of ASL (American Sign Language) signs. The database is divided in three main parts: i) a set of videos of signs with distinct motion patterns, ii) a set of words with distinct handshapes, and iii) several ASL sentences to study prosody and sentence structure.
Fourteen native signers participated in the acquisition of the data. This data, for ten of the participants, will be made public available in Spring 2002. Due to its large size, the database will only be available on a set of 30 DVD. The DVDs will be made available at production cost
http://rvl1.ecn.purdue.edu/~aleix/ASLdatabase.htm