Click To Preview
 

 


Sign Language Processing

Paulo Barthelmess

University of Colorado at Boulder - Computer Science Department

Collaboration Technology Research Group

 

 

 

Introduction

Here’s a collection of Sign Language related resources. It focuses mainly on literature on automatic Sign Language processing. Other links were collected as a result of this main search and included whenever found to be potentially interesting. This is of course an ongoing effort. If you have contributions not listed here, please send me a link (your contribution will of course be acknowledged).

 

Additionally, you may want to look at my Multimodality page.

 

Sign Languages are visual languages consisting of formalized gestures, facial expressions including lip movement and eye-gaze. Gestures usually involve whole body movements, in particular movements of the hands and arms. Facial expressions are commonly used as part of the language. In American Sign Language (ASL), for instance, determining if an utterance is a statement of fact or a question, depends on eyebrow and facial configurations (raised eyebrows on a tilted forward head are used to mark questions); in some sentences, determining if it is affirmative or negative depends on interpreting head movements. Being full languages, Sign Languages are complex and pose at least similar obstacles to machine processing as spoken languages do.

 

Sign language processing constitutes a specialization of the broader human movement analysis research  and is anchored on similar techniques as the ones employed for, e.g. gesture and facial analysis in general. There are a variety of algorithms that target gesture (both through vision and through instrumented gloves), facial expression and lipreading, as well as other sources of data (modalities) such as eye-gaze detection. Most of the work targets one of the possible modalities independently from the others, particularly in Sign Language processing, where the bulk of the work is done on interpretation of hand-gestures, or even more narrowly on fingerspelling. Existing techniques for analyzing independent human motion modalities are mainly based on traditional processing tools and methods, e.g., Hidden Markov Models (HMMs) (e.g. Starner and Pentland, 1996; Assan and Grobel, 1998; Hienz et al, 1999), neural nets (e.g. Wilson and Anspach, 1993; Erenshteyn et al., 1994; Vamplew, 1996), rules extracted by a temporal concept learner  (Kadous, 1998), bayesian nets (Garg et al., 2000), finite state machines (Hong et al., 2000), colored Petri nets coupled to HMMs (Huang and Jeng, 2001).

 

Research Centers / Projects

Listed are centers that did research in the listed areas at some point in time. Some continue their research efforts, others not. Researchers may also have moved to other centers. We list the year the last related publication appeared in press.

 

Center/Project

Last pub

People

University of New South Wales

2002

Mohammed Waleed Kadous

Visicast Project

2002

 

 

2001

Zon Wei; Yuan Kui; Liu Jindong; Luo Bencheng

Aachen University of Technology

2001

Hermann Hienz, Britta Bauer, Kirsti Grobel, Karl-Friedrich Kraiss, Marcell Assan

Nagoya Institute of Technology

2001

Marcus Vinicius Lamar (U Federal do Paraná, Brazil), Md.S.Bhuiyan, A.Iwata

National Tsing-Hua University, Taiwan

2001

Chung-Lin Huang, Ming-Shan Wu, Sheng-Hung Jeng

Paris XI Orsay University

2001

Annelies Braffort

University of Macedonia in Thessaloniki, Greece

2001

Vassilia N. Pashaloudi

University of Pennsylvania

2001

Christian Vogler, Dimitris Metaxas

University of Western Australia

2001

Eun-Jung Holden, Robyn Owens, Geoffrey G. Roy

 

2000

K. Imagawa, Shan Lu, Seiji Igi, Hideaki Matsuo, Yuji Nagashima, Yuji Takata, and Terutaka Teshima

Hitachi Research Laboratory

2000

Hirohiko Sagawa, Masaru Takeuchi, Masaru Ohki, Tomoko Sakiyama, Eiji Oohira, Hisashi Ikeda, Hiromichi Fujisawa

Institute of Computing Tech, Chinese Academy of Sciences

2000

Wen Gao, Jiyong Ma, Jiangqin Wu Chunli Wang

Michigan State University

2000

Yuntao Cui, John J. Weng

MIT · Mechanical Engineering Department

2000

Vivek A. Sujan, Marco A. Meggiolaro (PUC, Brazil)

 

1999

Sugiyama, H.; Tanahashi, S.; Aoki, Y. - Seong-Hyo Shin; Sang-Woon Kim

Kyoto Institute of Technology

1999

Takao Kurokawa, Sumihiro Kawano, K. Senba

University of Illinois at Urbana-Champaign Beckman Institute

1999

Ming-Hsuan Yang, Narendra Ahuja

University of Ljubljana

1999

Franc Solina, Slavko Krapez, Ales Jaklic

Communications Research Laboratory, Japan

1998

Seiji Igi, Shan Lu, K. Imagawa, H. Matsuo, H. Sakato, Y. Nagashima

DalTech (Technical University of Nova Scotia)

1998

Moussa Habib Abdallah, A.E. Marble, Charoensak Charayaphan

MIT Media Lab

1998

Alex Pentland, Thad Starner, Joshua Weaver

National Taiwan University

1998

Rung-Huei Liang, Ming Ouhyoung (Shih-Chien University, Taiwan)

Ohio State University

1998

Chung-Lin Huang

 

1997

Yamamoto, Y., Uchida, Masafumi, Ide, Hideto

 

1997

Henrik Birk, Thomas B. Moeslund, Claus B. Madsen

KAIST, Korea, Electrical Engineering

1997

Zeungnam Bien; Gyu-Tae Park; Won Jang (Agency for Devence Development, Korea); Jong-Sung Kim

University of Delaware

1997

Roman Erenshteyn, Pavel Laskov, Richard A. Foulds, Garland Stern, Lynn Messing

University of Essex, UK

1997

G.J. Sweeney, Andy C. Dowton

University of Tasmania

1996

Peter Vamplew, Anthony Adams

 

1995

Yamaguchi T., Yoshihara M., Akiba M., Kuga M., Kanazawa N. and Kamata K

 

1995

Chung-Lin Huang; Wen-Yi Huang; Cheng-Chang Lien

Ohio State University, Biomedical Engineering

1995

M.B. Waldron, S. Kim

 

1994

Hamilton J. and Micheli-Tzanakou E.

 

1994

Geoff D. Roberts

Simon Fraser University, Canada

1994

Brigitte Dorner, Eli Hagen

 

1993

A. Sutherland

Raytheon Co

1993

Elizabeth J. Wilson, Gretel Anspach

University of Tokyo

1993

Tosiyasu L. Kunii, Jintae Lee

 

1988

S. Tamura, S. Kawasaki

 

1982

R. Harrison

 

Web resources

Sign Language

 

American Sign Language Research Project at Boston U

 http://www.bu.edu/asllrp/

National Center for Sign Language and Gesture Resources

http://www.bu.edu/asllrp/cslgr/

 

Communications Research Laboratory, Japan - Research and development on sign language recognition

http://www2.crl.go.jp/jt/a131/research/univ-e.html

 

Leiden U’s Sign Phonology Group Sign language sites on the WWW

http://www.leidenuniv.nl/hil/sign-lang/sl-sites.html

Sign Stream

http://www.bu.edu/asllrp/SignStream/

U of Hamburg’s International Bibliography of Sign Language

http://www.sign-lang.uni-hamburg.de/bibweb/

 

Waleed’s page on Machine Gesture and Sign Language Recognition

http://www.cse.unsw.edu.au/~waleed/gsl-rec/

Beckman Institute’s Gesture Interpretation using Spatio-Temporal analysis – GIST

http://vision.ai.uiuc.edu/mhyang/gist.html

 

 

 

Animation

 

DePaul University

http://asl.cs.depaul.edu/relwork.html

Signing avatar

http://www.signingavatar.com/

Visicast

http://www.visicast.sys.uea.ac.uk

 

Transcription / Notation

 

HamNoSys 

http://www.sign-lang.uni-hamburg.de/Projects/HamNoSys.html

SignWriting

http://www.SignWriting.org/

U Hamburg Bibliography “Transcription / notation”

http://www.sign-lang.uni-hamburg.de/bibweb/Lidat.acgi?KEYWORDALTID=459

 

Translation

 

Visicast 

http://www.visicast.sys.uea.ac.uk

Penn’s TEAM

http://www.cis.upenn.edu/~lwzhao/research/TEAM.html

The SASL Project

http://www.cs.sun.ac.za/~lynette/signlang.html

 

Linguistics

 

Kearsy Cormier

http://ccwf.cc.utexas.edu/~kearsy/

 

 

 

Education

 

ICICLE Project

http://www.asel.udel.edu/nli/nlp/icicle.html

 

 

 

Software

 

Intel OpenCV

http://www.intel.com/research/mrl/research/opencv/

Entropic’s HTK

http://htk.eng.cam.ac.uk/index.shtml

 

Journals

Computer Vision and Image Understanding: CVIU

IJCV - International Journal of Computer Vision

IEEE Trans on Pattern Analysis and Machine Intelligence

International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI)

 

Conferences

IEEE International Conference on Automatic Face and Gesture Recognition

 

IEEE Conference on Computer Vision and Pattern Recognition

 

IEEE International Conference on Computer Vision

 

International Workshop on Gesture and Sign Language based Human-Computer Interaction

 

Workshop on Human Motion

 

International Conference on Spoken Language Processing

News lists

So-qiwoa, a computational sign linguistics discussion list:

Send an email to <listserv@list.unm.edu> saying

"subscribe so-qiwoa-l firstname lastname", example:

"subscribe so-qiwoa-l Laurent Clerc"

Bibliography

Surveys / Overviews

 

[Aggarwal and Cai(1999)]

J. K. Aggarwal and Q. Cai.

Human motion analysis: A review.

Computer and Vision Image Understanding, 73  (3): 428-440, March 1999.

http://www.idealibrary.com/servlet/doi/10.1006/cviu.1998.0744

Human motion analysis is receiving increasing attention from computer  vision researchers. This interest is motivated by a wide spectrum of  applications, such as athletic performance analysis, surveillance,  man-machine interfaces, content-based image storage and retrieval, and video  conferencing. This paper gives an overview of the various tasks involved in  motion analysis of the human body. We focus on three major areas related to  interpreting human motion: (1) motion analysis involving human body parts,  (2) tracking a moving human from a single view or multiple camera  perspectives, and (3) recognizing human activities from image sequences.  Motion analysis of human body parts involves the low-level segmentation of  the human body into segments connected by joints and recovers the 3D  structure of the human body using its 2D projections over a sequence of  images. Tracking human motion from a single view or multiple perspectives  focuses on higher-level processing, in which moving humans are tracked  without identifying their body parts. After successfully matching the moving  human image from one frame to another in an image sequence, understanding the  human movements or activities comes naturally, which leads to our discussion  of recognizing human activities.

 

[Cedras and Shah(1995)]

C. Cedras and M. Shah.

Motion based recognition: A survey.

Image and Vision Computing, 13 (2): 129-155,  1995.

 

[Gavrila(1999)]

D. Gavrila.

The visual analysis of human movement: A survey.

Computer and Vision Image Understanding, 73  (1): 82-98, January 1999.

http://citeseer.nj.nec.com/400214.html

The ability to recognize humans and their activities by vision is key for a machine to interact intelligently and effortlessly with a human inhabited environment. Because of many potentially important applications, "Looking at People" is currently one of the most active application domains in computer vision. This survey identifies a number of promising applications and provides an overview of recent developments in this domain. The scope of this survey is limited to work on whole-body or hand motion; it does not include work on human faces. The emphasis is on discussing the various methodologies; they are grouped in 2-D approaches with or without explicit shape models and 3-D...

 

 [Kohler and Schroeter(1998)]

M. Kohler and S. Schroeter.

A survey of video-based gesture recognition - stereo and mono  systems.

Technical Report 693, Informatik VII, University of Dortmund,  Germany, August 1998.

ftp://ls7-ftp.cs.uni-dortmund.de/pub/reports/ls7/1998/rr-693.ps.gz

 

[LaViola(1999)]

Joseph J. LaViola, Jr.

A survey of hand posture and gesture recognition techniques and technology.

Technical Report CS-99-11, Department of Computer Science, Brown University, June 1999.

ftp://ftp.cs.brown.edu/pub/techreports/99/cs99-11.ps.Z

 

 [Moeslund and Granum(2001)]

T. B. Moeslund and E. Granum.

A survey of computer vision-based human motion capture.

International Journal of Computer Vision, 81 (3),  2001.

http://citeseer.nj.nec.com/moeslund01survey.html

 

 [Pavlovic et al.(1997)]

V. I. Pavlovic, R. Sharma, and T. S. Huang.

Visual interpretation of hand gestures for human-computer  interaction: A review.

IEEE Transactions on Pattern Analysis and Machine Intelligence,  19 (7): 677-695, July 1997.

http://computer.org/tpami/tp1997/i0677abs.htm

The use of hand gestures provides an attractive alternative to  cumbersome interface devices for human-computer interaction (HCI). In  particular, visual interpretation of hand gestures can help in achieving the  ease and naturalness desired for HCI. This has motivated a very active  research area concerned with computer vision-based analysis and  interpretation of hand gestures. We survey the literature on visual  interpretation of hand gestures in the context of its role in HCI. This  discussion is organized on the basis of the method used for modeling,  analyzing, and recognizing gestures. Important differences in the gesture  interpretation approaches arise depending on whether a 3D model of the human  hand or an image appearance model of the human hand is used. 3D hand models  offer a way of more elaborate modeling of hand gestures but lead to  computational hurdles that have not been overcome given the real-time  requirements of HCI. Appearance-based models lead to computationally  efficient "purposive" approaches that work well under constrained situations  but seem to lack the generality desirable for HCI. We also discuss  implemented gestural systems as well as other potential applications of  vision-based gesture recognition. Although the current progress is  encouraging, further theoretical as well as computational advances are needed  before gestures can be widely used for HCI. We discuss directions of future  research in gesture recognition, including its integration with other natural  modes of human-computer interaction.

 

[Wu and Huang(1999)]

Y. Wu and T. S. Huang.

Vision-based gesture recognition: A review.

In A. Braffort, R. Gherbi, S. Gibet, J. Richardson, and D. Teil,  editors, Proceedings of the Gesture Workshop, volume 1739 of Lecture  Notes in Computer Science. Springer, 1999.

http://link.springer-ny.com/link/service/series/0558/papers/1739/17390103.pdf

 

Recognition / Translation

 

[Abdallah(1998)]

M. Abdallah.

A neuro-hierarchical multilayer network in the translation of the  american sign language.

In Proceedings of the IEEE Southeastcon:Region 3, pages  224-227, 1998.

http://ieeexplore.ieee.org/iel4/5503/14789/00673334.pdf

A neuro hierarchial approach based on an adaptive back-propagation  algorithm is proposed. In a separate preprocessing step, the input and the  output vector are generated using the encoded-sequence representation. The  algorithm is applied to translate the American sign language from spoken  words. Experimental results indicate that this approach results in fast  convergence, stable learning and a relatively small network size when  compared to traditional methods.

 

 [Abe et al.(1994)]

M. Abe, H. Sakou, and H. Sagawa.

Sign language translation based on syntactic and semantic analysis.

Systems and Computers in Japan, 25, 1994.

 

[Akyol et al.(2000)]

S. Akyol, B. Bauer, and U. Canzler.

Gesture and mimic interpretation for sign language recognition.

In Proceedings of the 4th International Student Conference on  Electrical Engineering, May 2000.

http://www.techinfo.rwth-aachen.de/Veroeffentlichungen/v001_2000.pdf

 

[Aoki et al.(1998)]

Y. Aoki, R. Mitsumori, J. Li, and A. Burger.

Sign language communication between japanese-korean and  japanese-portuguese using cg animation.

In IEEE International Conference on acoustics speech and signal  processing, pages 3765-3768, 1998.

http://ieeexplore.ieee.org/iel4/5518/14898/00679703.pdf

We propose a sign language communication between different languages  such as Japanese-Korean and Japanese-Portuguese using computer graphics (CG)  animation of sign language based on the intelligent image communication  method. For this purpose sign language animation is produced using gesture or  text data expressing sign language. In the production process of CG animation  of sign language, MATLAB and LIFO language are used, where MATLAB is useful  for three-dimensional signal processing of gestures and for displaying  animation of sign language. On the other hand the LIFO language, which is a  descendant of the LISP and FORTH language families, is developed and used to  produce live CG animations, resulting in a high-speed interactive system of  designing and displaying sign language animations. A simple experiment was  conducted to translate Japanese sign language into Korean and Portuguese sign  languages using the developed CG animation system.

 

[Assan and Grobel(1997)]

M. Assan and K. Grobel.

Video-based sign language recognition using hidden markov models.

In Proceedings of the Gesture Workshop, volume 1371 of Lecture  Notes in Artificial Intelligence, pages 97-110. Springer, 1997.

http://link.springer-ny.com/link/service/series/0558/bibs/1371/13710097.htm

This paper is concerned with the video-based recognition of signs.  Concentrating on the manual parameters of sign language, the system aims for  the signer dependent recognition of 262 different signs taken from Sign  Language of the Netherlands . For Hidden Markov Modelling a sign is  considered a doubly stochastic process, represented by an unobservable state  sequence. The observations emitted by the states are regarded as feature  vectors, that are extracted from video frames. This work deals with three  topics: Firstly the recognition of isolated signs, secondly the influence of  variations of the feature vector on the recognition rate and thirdly an  approach for the recognition of connected signs. The system achieves  recognition rates up to 94% for isolated signs and 73% for a reduced  vocabulary of connected signs.

 

[Bauer(2001)]

B. Bauer.

Towards an automatic sign language recognition system using subunits.

In Proceedings of the Gesture Workshop, London, 2001.

http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/bauer_gw01_02.pdf

Draft.

 

[Bauer and Hienz(2000)]

B. Bauer and H. Hienz.

Relevant features for video-based continuous sign language  recognition.

In IEEE International Conference on Automatic Face and Gesture  Recognition, Grenoble, France, 2000.

http://computer.org/proceedings/fg/0580/05800440abs.htm

 

[Bauer et al.(2000)]

B. Bauer, H. Hienz, and K.-F. Kraiss.

Video-based continuous sign language recognition using statistical  methods.

In Proceedings of the International Conference on Pattern  Recognition, volume II, pages 463-466, 2000.

http://www.computer.org/proceedings/icpr/0750/Volume%202/07502463abs.htm

 

[Bauer et al.(1999)]

B. Bauer, S. Niessen, and H. Hienz.

Towards an automatic sign language translation system.

In Proc. of the International Workshop on Physicality and  Tangibility in Interaction: Towards New Paradigms for Interaction Beyond the  Desktop, Siena, Italy, Oct. 1999.

http://www-i6.informatik.rwth-aachen.de/PostScript/InterneArbeiten/Niessen_SignTranslation_Physicality_Workshop_99.ps

 

[Birk et al.(1997)]

H. Birk, T. B. Moeslund, and C. B. Madsen.

Real-time recognition of hand alphabet gestures using principal component analysis.

In Scandinavian Conference on Image Analysis, 1997.

http://www.lut.fi/scia97/papers/paper131.htm

 

[Braffort(1996a)]

A. Braffort.

Argo: An architecture for sign language recognition and interpretation.

In P. Harling and A. Edwards, editors, Proceedings of the Gesture  Workshop, pages 17-30. Springer, 1996.

 

[Braffort(1996b)]

A. Braffort.

A gesture recognition architecture for sign language.

In Second International ACM/SIGCAPH Conference on Assistive  Technologies (ASSETS'96), pages 102-109, British Columbia, Canada,  1996.

http://www.acm.org/pubs/citations/proceedings/assets/228347/p102-braffort/

 

[Braffort(1997)]

A. Braffort.

A computer system dedicated to sign language.

In IEA'97, Tampere, Finland, 1997.

 

[Braffort et al.(1994)]

A. Braffort, C. Collet, and D. Teil.

Anthropomorphic model for hand gesture interface.

In M. Galer, S. Harker, and J. Ziegler, editors, Proceedings of  ACM CHI'94 Conference on Human Factors in Computing Systems, 1994.

 

[Chapman(1998)]

R. N. Chapman.

Lexicon for computer translation of american sign language.

In Applications in Robotics, User Interfaces, and Natural  Language Processing, volume 1458 of Lecture Notes in Computer Science.  Springer, 1998.

http://link.springer-ny.com/link/service/series/0558/bibs/1458/14580033.htm

This work presents a method for translation of American Sign Language  (ASL) to English using a feature-based lexicon, designed to exploit ASL's  phonology by searching the lexicon for the sign's manual and non-manual  information. Manual sign information consists of phonemes sig (movement), tab  (location), dez (handshape), and ori (hand orientation), which we use as the  ASL unit of analysis. Non-manual sign information consists of specific facial  and body configurations. A camera acquires the sign and individual frames are  analyzed and values assigned to the sig, tab, dez, and ori sign parameters as  well as other sign features, for referencing during lexical search. ASL  formational constraints are exploited to target specific image segments for  analysis and linguistic constraints serve to further reduce the lexical  search space. Primary keys for lexical search are sig and tab, the most  discriminating sign features, followed by the remaining features, as  necessary, until a single lexical entry is extracted from the lexicon. If a  single lexical candidate cannot be determined, an exception is raised,  signaling search failure. This method of using ASL phonological constraints  to aid image analysis and lexical search process simplifies the task of sign  identification.

 

[Charayaphan and Marble(1992)]

C. Charayaphan and A. Marble.

An image processing system for interpreting motion in american sign  language.

Journal of Biomedical Engineering, 14 (15):  419-425, 1992.

 

[Cui et al.(1995)]

Y. Cui, D. L. Swets, and J. J. Weng.

Learning-based hand sign recognition using shoslif-m.

In Proceedings of the IEEE International Conference on Computer  Vision, pages 631-636, Boston, MA, June 1995.

http://citeseer.nj.nec.com/cui95learningbased.html

 

[Cui and Weng(1999)]

Y. Cui and J. Weng.

A learning-based prediction-and-verification segmentation scheme for  hand sign image sequence.

IEEE Transactions on Pattern Analysis and Machine Intelligence,  21 (8): 798-804, Aug. 1999.

http://dlib.computer.org/tp/books/tp1999/pdf/i0798.pdf

We present a prediction-and-verification segmentation scheme using  attention images from multiple fixations. A major advantage of this scheme is  that it can handle a large number of different deformable objects presented  in complex backgrounds. The scheme is also relatively efficient. The system  was tested to segment hands in sequences of intensity images, where each  sequence represents a hand sign in American Sign Language. The experimental  result showed a 95 percent correct segmentation rate with a 3 percent false  rejection rate.

 

[Cui and Weng(2000)]

Y. Cui and J. Weng.

Appearance-based hand sign recognition from intensity image  sequences.

Computer and Vision Image Understanding, 78: 157-176,  2000.

http://www.cse.msu.edu/~weng/research/SHOSLIF-M-rec.ps

 

[Cui and Weng(1995)]

Y. Cui and J. J. Weng.

Learning-based hand sign recognition.

In IEEE International Conference on Automatic Face and Gesture  Recognition, pages 201-206, Zurich, Switzerland, June 1995.

 

[Cui and Weng(1996a)]

Y. Cui and J. J. Weng.

Hand segmentation using learning-based prediction and verification  for hand sign recognition.

In IEEE Conference on Computer Vision and Pattern Recognition,  pages 88-93, San Francisco, CA, June 1996.

http://www.cps.msu.edu/~weng/research/HandCVPR96.ps

 

[Cui and Weng(1996b)]

Y. Cui and J. J. Weng.

View-based hand segmentation and hand-sequence recognition with  complex backgrounds.

In Proceedings of the International Conference on Pattern  Recognition, volume III, pages 617-621, Vienna, Austria, Aug.  1996.

http://www.cps.msu.edu/~weng/research/SegICPR96.ps

 

[Dorner(1993)]

B. Dorner.

Hand shape identification and tracking for sign language  interpretation.

In International Joint Conference on Artificial Intelligence,  Chambery, 1993.

 

[Dorner(1994)]

B. Dorner.

Chasing the colour glove: Visual hand tracking.

Master's thesis, Simon Fraser University, 1994.

ftp://fas.sfu.ca/pub/thesis/1994/BrigitteDornerMSc.ps

 

[Dorner and Hagen(1994)]

B. Dorner and E. Hagen.

Towards an american sign language interface.

Artificial Intelligence Review, 8 (2/3), 1994.

http://www.wkap.nl/oasis.htm/61598

 

[Erenshteyn et al.(1995)]

R. Erenshteyn, R. Foulds, L. Messing, G. Stern, and S. Galuska.

Handshape recognition-a step toward computer translation of american  sign language into english.

In WCNN '95. World Congress on Neural Networks, volume 1, pages  216-19, Mahwah, NJ, USA, 1995. Lawrence Erlbaum Associates.

 

[Erenshteyn and Laskov(1996)]

R. Erenshteyn and P. Laskov.

A multi-stage approach to fingerspelling and gesture recognition.

In Proceedings of the Workshop on the Integration of Gesture in  Language and Speech, pages 185-194, Wilmington, DE, 1996.

http://citeseer.nj.nec.com/73836.html

Human gesture analysis and recognition as many others recognition  applications require the development and implementation of multiclass  recognition system. We examine the application of two approaches to  multiclass recognition: a hierarchical (multi-stage) neural network  classifier and error-correcting codes together with collective decision  making (mixture of formal "experts"). Applications that have been solved  include the recognition of fingerspelled letters from American Sign Language  and recognition of static handshapes. These applications are examples of  multiclass problem - 26 and 77 classes respectively. We address the problem  of finding a hierarchy of classifiers, optimal with respect to recognition  accuracy, given a labeled corpus of signing samples. This problem stems from  the necessity of recognizing a large number of classes for which a single  neural network classifier requires prohibitively large training sets and  infeasible training time. We outline the general framework of the problem of  finding an optimal hierarchy and explain its relationship to the problems of  decision tree inference and object clustering. Another approach involves  Golay(23,12) code generation and its use for system outputs encoding. These  codes have to satisfy some initial condi tions, such as minimum inter-code  Hamming distance, number of classes and inter-digit correlation. Code  redundancy digits are used to improve overall recognition accuracy. Coding  has been combined with two-output neural networks that produced output codes.  Collective decision making procedures have been applied. Higher accuracy and  less training time are the main advantage of the developed approach. Both  approaches have been compared on two problems of human gesture recognition.

 

[Erenshteyn et al.(1997)]

R. Erenshteyn, P. Laskov, and R. Foulds.

Human gesture recognition using neural networks and multi-class  encoding.

In Proceedings of ANNIE'97 (Artificial Neural Networks in  Engineering), volume 7, pages 549-554, St.Louis, MO, 1997.

http://www.cis.udel.edu/~laskov/publications/ANNIE-97.ps.gz

Human gesture analysis and recognition as many others recognition  applications require the development and implementation of multiclass  recognition system. We examine the application of two approaches to  multiclass recognition: a hierarchical (multi-stage) neural network  classifier and error-correcting codes together with collective decision  making (mixture of formal "experts"). Applications that have been solved  include the recognition of fingerspelled letters from American Sign Language  and recognition of static handshapes. These applications are examples of  multiclass problem - 26 and 77 classes respectively. We address the problem  of finding a hierarchy of classifiers, optimal with respect to recognition  accuracy, given a labeled corpus of signing samples. This problem stems from  the necessity of recognizing a large number of classes for which a single  neural network classifier requires prohibitively large training sets and  infeasible training time. We outline the general framework of the problem of  finding an optimal hierarchy and explain its relationship to the problems of  decision tree inference and object clustering. Another approach involves  Golay(23,12) code generation and its use for system outputs encoding. These  codes have to satisfy some initial condi tions, such as minimum inter-code  Hamming distance, number of classes and inter-digit correlation. Code  redundancy digits are used to improve overall recognition accuracy. Coding  has been combined with two-output neural networks that produced output codes.  Collective decision making procedures have been applied. Higher accuracy and  less training time are the main advantage of the developed approach. Both  approaches have been compared on two problems of human gesture recognition.

 

[Erenshteyn et al.(1996a)]

R. Erenshteyn, P. Laskov, R. Foulds, L. Messing, and G. Stern.

Recognition approach to gesture language understanding.

In Proceedings of the International Conference on Pattern  Recognition, Vienna, 1996.

http://www.cis.udel.edu/~laskov/publications/ICPR-96.ps.gz

We explore recognition implications of understanding gestural  communication, having chosen American Sign Language as an example of a  gestural language. An instrumented glove and specially developed software  have been used for data collection and labeling. We address the problem of  recognizing dynamic signing, i.e. signing performed at natural speed. Two  neural network architectures have been used for recognition of different  types of fingerspelled sentences. Experimental results are presented  suggesting that two features of signing affect recognition accuracy: signing  frequency which to a large extent can be accounted for by training a network  on the samples of the respective frequency, and coarticulation effect which a  network fails to identify. As a possible solution to coarticulation problem  two post-processing algorithms for temporal segmentation are proposed and  experimentally evaluated.

 

[Erenshteyn et al.(1996b)]

R. Erenshteyn, P. Laskov, R. Foulds, L. Messing, G. Stern, and S. Galuska.

Static and dynamic recognition of fingerspelled sentences.

In World Congress on Neural Networks, International Neural  Network Society 1996 Annual Meeting, pages 368-71, Mahwah, NJ, USA,  1996. Lawrence Erlbaum Assoc.

http://www.cis.udel.edu/~laskov/publications/ANNIE-97.ps.gz

 

[Erenshteyn et al.(1994)]

R. Erenshteyn, L. Messing, R. A. Foulds, G. Stern, and S. Galuska.

Back propagation neural network for american sign language  recognition.

In P. Werbos, H. Szu, and B. Widrow, editors, World Congress on  Neural Networks, volume I, pages 405-409, San Diego, CA, USA, 1994.  Erlbaum.

 

[Fang et al.(2001a)]

G. Fang, W. Gao, X. Chen, C. Wang, and J. Ma.

Signer-independent continuous sign language recognition based on  srn/hmm.

In Proceedings of the Gesture Workshop, London, Apr.  2001.

http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/SRNpaper_10.pdf

Draft.

 

[Fang et al.(2001b)]

G. Fang, W. Gao, and J. Ma.

Signer-independent sign language recognition based on sofm/hmm.

In IEEE ICCV Workshop on Recognition, Analysis, and Tracking of  Faces and Gestures in Real-Time Systems, pages 90-95, 2001.

http://ieeexplore.ieee.org/iel5/7480/20325/00938915.pdf

The aim of sign language recognition is to provide an efficient and  accurate mechanism to transcribe sign language into text or speech.  State-of-the-art sign language recognition should be able to solve the  signer-independent problem for practical application. In this paper, a hybrid  SOFM/HMM system, which combines self-organizing feature maps (SOFMs) with  hidden Markov models (HMMs), is presented for signer-independent Chinese sign  language recognition. We implement the SOFM/HMM sign recognition system.  Meanwhile, results from the HMM-based system are provided as comparison.  Experimental results show the SOFM/HMM system increases the recognition  accuracy by 5% than the HMM-based one. Furthermore, a self-adjusting  recognition algorithm is also proposed for improving the SOFM/HMM  discrimination. When it is applied to the SOFM/HMM system it can improve the  recognition accuracy by 1.9%. All experiments were performed in real-time  with the dictionary size 208.

 

[Fujishige and Kurokawa(1997)]

E. Fujishige and T. Kurokawa.

Japanese processing for japanese-to-sign language translation via semantic network.

Human interface news and report, 12: 45-50, 1997.

 

[Gao et al.(2000a)]

W. Gao, J. Ma, S. Shan, X. Chen, W. Zheng, H. Zhang, J. Yan, and J. Wu.

Handtalker: A multimodal dialog system using sign language and 3-d  virtual human.

In T. Tan, Y. Shi, and W. Gao, editors, Proceedings of the Third  International Conference on Advances in Multimodal Interfaces, volume 1948  of Lecture Notes in Computer Science, pages 582-589, Oct.  2000.

http://link.springer-ny.com/link/service/series/0558/bibs/1948/19480564.htm

In this paper, we describe HandTalker: a system we designed for  making friendly communication reality between deaf people and normal hearing  society. The system consists of GTS (Gesture/Sign language To Spoken  language) part and STG (Spoken language To Gesture/Sign language) part. GTS  is based on the technology of sign language recognition, and STG is based on  3D virtual human synthesis. Integration of the sign language recognition and  3D virtual human techniques greatly improves the system performance. The  computer interface for deaf people is data-glove, camera and computer  display, and the interface for hearing-abled is microphone, keyboard, and  display. HandTalker now can support no domain limited and continuously  communication between deaf and hearing-abled Chinese people.

 

[Gao et al.(2000b)]

W. Gao, J. Ma, J. Wu, and C. Wang.

Sign language recognition based on hmm/ann/dp.

International Journal of Pattern Recognition and Artificial  Intelligence, 14 (5), Aug. 2000.

http://www.wspc.com.sg/profiles/anncat/anncat/jnlarticle/ijpraiv14n5/S0218001400000386.pdf

In this paper, a system designed for helping the deaf to communicate  with others is presented. Some useful new ideas are proposed in design and  implementation. An algorithm based on geometrical analysis for the purpose of  extracting invariant feature to signer position is presented. An ANN-DP  combined approach is employed for segmenting subwords automatically from the  data stream of sign signals. To tackle the epenthesis movement problem, a  DP-based method has been used to obtain the context-dependent models. Some  techniques for system implementation are also given, including fast matching,  frame prediction and search algorithms. The implemented system is able to  recognize continuous large vocabulary Chinese Sign Language. Experiments show  that proposed techniques in this paper are efficient on either recognition  speed or recognition performance.

 

[Gibet et al.(1996)]

S. Gibet, A. Braffort, C. Collet, F. Forest, R. Gherbi, and T. Lebourque.

Gesture in human-machine communication: Capture, analysis-synthesis,  recognition, semantics.

In P. Harling and A. Edwards, editors, Proceedings of the Gesture  Workshop, pages 89-95. Springer, 1996.

 

[Gibet et al.(1997)]

S. Gibet, J. Richardson, T. Lebourque, and A. Braffort.

Corpus of 3d natural movements and sign language primitives of  movement.

In I. Wachsmuth and M. Fröhlich, editors, Proceedings of the Gesture Workshop, volume 1371 of Lecture Notes in Computer Science,  pages 111-121, Bielefeld, Germany, 1997. Springer.

This paper describes the development of a corpus or database of  hand-arm pointing gestures, considered as a basic element for gestural  communication. The structure of the corpus is defined for natural pointing  movements carried out in different directions, heights and amplitudes. It is  then extended to movement primitives habitually used in sign language  communication. The corpus is based on movements recorded using an  optoelectronic recording system that allows the 3D description of movement  trajectories in space. The main technical characteristics of the capture and  pretreatment system are presented, and perspectives are highlighted for  recognition and generation purposes.

 

[Grobel(1994)]

K. Grobel.

Recognition of fingerspelling from video.

In Proceedings of the Sixth Biennial Conference of the  International Society for Augmentative and Alternative Communication, pages  282-286, 1994.

 

[Grobel and Assan(1997)]

K. Grobel and M. Assan.

Isolated sign language recognition using hidden markov models.

In IEEE International Conference on Systems, Man &  Cybernetics, pages 162-167, Piscataway, NJ, 1997.

http://ieeexplore.ieee.org/iel3/4942/13619/00625742.pdf

This paper is concerned with the video-based recognition of isolated  signs. Concentrating on the manual parameters of sign language, the system  aims for the signer dependent recognition of 262 different signs. For hidden  Markov modelling a sign is considered a doubly stochastic process,  represented by an unobservable state sequence. The observations emitted by  the states are regarded as feature vectors, that are extracted from video  frames. The system achieves recognition rates up to 94%.

 

[Grobel and Hienz(1996a)]

K. Grobel and H. Hienz.

Fuzzy video-based handshape recognition.

In K. e. a. George, editor, Proceedings of the 1996 ACM  Symposium on Applied Computing, pages 614-618, Philadelphia, Feb.  1996.

 

[Grobel and Hienz(1996b)]

K. Grobel and H. Hienz.

Video-based handshape recognition using a handshape structure model  in real-time.

In Proceedings of the International Conference on Pattern  Recognition, pages 446-450, Wien, 1996.

 

[Grobel and Hienz(1996c)]

K. Grobel and H. Hienz.

Video-based recognition of fingerspelling in real-time.

In T. Lehmann, I. Scholl, and K. Spitzer, editors, Proceedings  des Aachener Workshops am Bildverarbeitung für die Medizin. Algorithmen.  Systeme. Anwendungen, pages 197-202, 1996.

http://citeseer.nj.nec.com/118789.html

 

[Hagen(1994)]

E. Hagen.

A flexible american sign language interface to deductive databases.

Master's thesis, Computer Science, Simon Fraser University, 1994.

ftp://fas.sfu.cs/pub/theses/1994/EliHagenMSc.ps

 

[Hamilton and Micheli-Tzanakou(1994)]

J. Hamilton and E. Micheli-Tzanakou.

Alopex neural networks for manual alphabet recognition.

In Proceedings of the 16th Annual International Conference of  the IEEE Engineering in Medicine and Biology Society, 1994.

http://ieeexplore.ieee.org/iel4/3230/9234/00415347.pdf

Alopex and backpropagation were used to train neural networks to  recognize signs from the American Manual Alphabet. In many cases, the  resulting networks gave comparable performance. The Alopex optimization  technique did not converge to low error percentages during training as well  as backpropagation did; backpropagation gave poorer performance on networks  with a small number of hidden nodes.

 

[Harrison(1982)]

R. Harrison.

Computer representation of deaf sign language.

In D. S. Greenaway and E. Warman, editors, Eurographics '82 :  proceedings of the international conference and exhibition, pages 303-308,  Manchester, UK, 1982. North-Holland Publ. Co.

 

[Hienz and Bauer(1999)]

H. Hienz and B. Bauer.

Sign language recognition based on statistical methods.

In Bi-Annual Report - Department of Technical Computer Science  1997/1998, pages 34-37. Shaker, 1999.

 

[Hienz and Grobel(1998)]

H. Hienz and K. Grobel.

An automatic video-based sign language recognition system as part of  a sign printing system.

In P. Kopacek, editor, IEEE International Conference on  Intelligent Engineering Systems, pages 163-168, 1998.

 

[Hienz et al.(1996)]

H. Hienz, K. Grobel, and G. Beckers.

Video-based handshape recognition using artificial neural networks.

In H.-J. Zimmermann, editor, Fourth European Congress on  Intelligent Techniques and Soft Computing, pages 1659-1663, Aachen,  Germany, Sept. 1996. Wissenschaftsverlag.

 

[Hienz and Kraiss(1999)]

H. Hienz and K.-F. Kraiss.

Hmm-based continuous sign language recognition using stochastic  grammars.

In Proceedings of the Gesture Workshop, volume 1739 of Lecture  Notes in Artificial Intelligence, Gif-sur-Yvette, France, Mar. 1999.  Springer.

http://www.techinfo.rwth-aachen.de/Veroeffentlichungen/V001_1999.pdf

 

[Hienz et al.(1999)]

H. Hienz, K.-F. Kraiss, and B. Bauer.

Continuous sign language recognition using hidden markov models.

In Y. Tang, editor, The Second International Conference on  Multimodal Interface, pages IV10--IV15, Hong Kong, China, 1999.

http://www.techinfo.rwth-aachen.de/Veroeffentlichungen/V002_1999.pdf

 

[Holden and Roy(1992)]

E. Holden and G. Roy.

The graphical translation of english text into signed english in the  hand sign translator system.

Computer Graphics Forum (Eurographics'92), 11  (3): C357--C366, 1992.

http://www.cs.uwa.edu.au/~eunjung/mypubs/GForumpaper.ps.gz

Signed English is a manual interpretation of English using  fingerspelling and signs. A prototype of the Hand Sign Translator (HST)  system was developed to graphically translate English into Signed English,  using two-handed animation. The HST consists of a practical interface that  aims to help users learn Signed English, and the translation process where  English text is transformed into a series of images that represent  corresponding signs. This paper describes the translation process which  involves two stages; the input environment and the animation process. The  input environment consists of text analysis in order to extract corresponding  kinematic data from the database, named English-Sign ...

 

[Holden(1991)]

E. J. Holden.

Graphical representation of hand movement as in deaf sign language:  The hand sign translator system.

Master's thesis, University of Western Australia, 1991.

 

[Holden(1997)]

E.-J. Holden.

Visual Recognition of Hand Motion.

PhD thesis, University of Western Australia, 1997.

http://www.cs.uwa.edu.au/~eunjung/mypubs/PhDThesis.ps.gz

 

[Holden and Owens(2001)]

E. J. Holden and R. Owens.

Visual sign language recognition.

In R. Klette, T. Huang, and G. Gimen'garb, editors, 10th  International Workshop on Theoretical Foundations of Computer Vision, volume  2032 of Lecture Notes in Computer Science. Springer, Mar. 2001.

http://link.springer.de/link/service/series/0558/bibs/2032/20320270.htm

Also in Proceedings of DICTA'99 (Digital Image Computing: Techniques  & Applications), pp. 275-279.

Automatic gesture recognition systems generally require two separate  processes: a motion sensing process where some motion features are extracted  from the visual input; and a classification process where the features are  recognised as gestures. We have developed the Hand Motion Understanding (HMU)  system that uses the combination of a 3D model-based hand tracker for motion  sensing and an adaptive fuzzy expert system for motion classification. The  HMU system understands static and dynamic hand signs of the Australian Sign  Language (Auslan). This paper presents the hand tracker that extracts 3D hand  configuration data with 21 degrees-of-freedom (DOFs) from a 2D image sequence  that is captured from a single viewpoint, with the aid of a colour-coded  glove. Then the temporal sequence of 3D hand configurations detected by the  tracker is recognised as a sign by an adaptive fuzzy expert system. The HMU  system was evaluated with 22 static and dynamic signs. Before training the  HMU system achieved 91% recognition, and after training it achieved over 95%  recognition.

 

[Holden et al.(1999a)]

E.-J. Holden, R. Owens, and G. G. Roy.

3d hand tracker for visual sign recognition.

Technical Report 99/2, Department of Computer Science, The University  of Western Australia, 1999.

http://citeseer.nj.nec.com/263562.html

 

[Holden et al.(1999b)]

E.-J. Holden, R. Owens, and G. G. Roy.

Adaptive fuzzy expert system for sign recognition.

Technical Report 99/3, Department of Computer Science, The University  of Western Australia, 1999.

http://citeseer.nj.nec.com/holden99adaptive.html

 

[Holden et al.(1996)]

E. J. Holden, G. G. Roy, and R. Owens.

Hand movement classification using an adaptive fuzzy expert system.

International Journal of Expert Systems, 9 (4): 465-480, 1996.

http://www.cs.uwa.edu.au/~eunjung/mypubs/IJESpaper.ps.gz

 

[Huang and Huang(1998)]

C. Huang and W. Huang.

Sign language recognition using model-based tracking and a 3d  hopfield neural-network.

Machine Vision and Applications, 10 (5/6):  292-307, 1998.

http://link.springer.de/link/service/journals/00138/bibs/8010005/80100292.htm

 

[Huang et al.(1992)]

C.-L. Huang, C. L. Cheng, and L. L. Pau.

Chinese sign language interpretation through motion and shape  analysis.

In V. Srinivasa, O. S. Heng, and A. Y. Hock, editors,  Proceedings of the 2nd Singapore International Conference on Image  Processing, pages 576-580, 1992.

 

[Huang et al.(1995)]

C.-L. Huang, W.-Y. Huang, and C.-C. Lien.

Sign language recognition using 3-d hopfield neural network.

In Proceedings International Conference on Image Processing,  pages 611-614, 1995.

http://ieeexplore.ieee.org/iel3/4052/11607/00537553.pdf

This paper presents a sign language recognition system which consists  of three modules: model-based hand tracking, feature extraction, and gesture  recognition using a 3-D Hopfield neural network. In the experiments, we  illustrate that this system can recognize 15 different gestures accurately.

 

[Imagawa et al.(1997)]

K. Imagawa, S. Lu, and S. Igi.

Tracking hands by skin color from sign language image sequences.

In Proc. of 54th IPSJ Annual Conference, pages 249-250, 1997.

 

[Imagawa et al.(1998a)]

K. Imagawa, S. Lu, and S. Igi.

Color-based hands tracking system for sign language recognition.

In IEEE International Conference on Automatic Face and Gesture  Recognition, pages 462-467, 1998.

http://dlib.computer.org/conferen/fg/8344/pdf/83440462.pdf

 

[Imagawa et al.(1998b)]

K. Imagawa, S. Lu, and S. Igi.

Real-time tracking of human hands from a sign-language image  sequence.

In The Third Asian Conference on Computer Vision '98, 1998.,  1998.

 

[Imagawa et al.(2000)]

K. Imagawa, H. Matsuo, R. Taniguchi, D. Arita, S. Lu, and S. Igi.

Recognition of local features for camera-based sign language  recognition system.

In Proceedings of the International Conference on Pattern  Recognition, pages 849-853, 2000.

http://www.computer.org/proceedings/icpr/0750/Volume  4/07504849abs.htm

 

[Kadous(1995)]

M. W. Kadous.

Grasp - recognition of australian sign language using instrumented  gloves.

Bachelor's thesis, University of New South Wales, Oct. 1995.

http://www.cse.unsw.edu.au/~waleed/thesis/

 

[Kadous(1996)]

M. W. Kadous.

Machine recognition of auslan signs using powergloves: Towards  large-lexicon recognition of sign language.

In L. Messing, editor, Proceedings of WIGLS. The Workshop on the  Integration of Gesture in Language and Speech, pages 165-174, Oct. 1996.

http://www.cse.unsw.edu.au/~waleed/paper-wigls/paper-wigls.html

 

[Kadous(1998a)]

M. W. Kadous.

Auslan sign recognition using computers and gloves.

In Deaf Studies Research Symposium, 1998.

http://www.cse.unsw.edu.au/~waleed/phd/auslrec2.ps

 

[Kadous(1998b)]

M. W. Kadous.

A general architecture for supervised classification of multivariate  time series.

Technical Report UNSW-CSE-TR-9806, Department of Artificial  Intelligence, School of Computer Science & Engineering, University of New  South Wales, Sept. 1998.

ftp://ftp.cse.unsw.edu.au/pub/users/waleed/tr9806.ps.gz

 

[Kadous(1999)]

M. W. Kadous.

Learning comprehensible descriptions of multivariate time series.

In Proceedings of the International Conference on Machine  Learning, pages 454-463. Morgan Kaufmann, San Francisco, CA, 1999.

http://citeseer.nj.nec.com/kadous99learning.html

 

[Kadous(2002a)]

M. W. Kadous.

Expanding the scope of concept learning using metafeatures. Submitted  paper.

In Proceedings of the International Conference on Machine  Learning, 2002.

http://www.cse.unsw.edu.au/~waleed/phd/paper.pdf

We present a general automated preprocessing technique called  metafeatures. Using metafeatures, the scope of traditional propositional  attribute-value learning is expanded to domains that do not normally fit in  the propositional model. These are domains that contain instances that have  some kind of recurring substructure, such as strokes in handwriting  recognition, or local maxima in time series data. Metafeatures are applied to  three domains: sign language recognition, ECG classification and Chinese  handwriting recognition. Using metafeatures we are able to generate  classifiers that are both comprehensible and accurate, producing results that  are comparable to hand-crafted feature extraction and in one case comparable  to human experts.

 

[Kadous(2002b)]

M. W. Kadous.

Temporal Classification: Extending the Classifiation Paradigm to  Multivariate Time Series.

PhD thesis, The University of New South Wales, School of Computer  Science and Engineering, Jan. 2002.

http://www.cse.unsw.edu.au/~waleed/phd/phd.pdf

 

[Kamata et al.(1989)]

K. Kamata, T. Yoshida, M. Watanabe, and Y. Usui.

An approach to japanese-sign language translation system.

In IEEE International Conference on Systems, Man and  Cybernetics, volume 3, pages 1089-1090, 1989.

http://ieeexplore.ieee.org/iel2/861/2458/00071466.pdf

A method was developed for translating written (spoken) language  (text in Japanese) into a sequence of sign words of Japanese sign language  called Simultaneous Japanese Sign Language (SJSL). The Japanese-to-Sign  translation system is described that uses a Japanese-to-Sign translation  dictionary with about 250 signs.

 

[Kang and Park()]

S. H. Kang and S. H. Park.

Toward korean text-to-sign language translation system (test).

http://citeseer.nj.nec.com/34981.html

 

[Kim et al.(1999a)]

J. Kim, A. Hasimoto, Y. Aoki, and A. Burger.

Design of a sign-language translation system between the  japanese-korean by java-lifo language.

In Proceedings of the IEEE Region 10 Conference TENCON 99,  volume 1, pages 423-426, 1999.

 

[Kim et al.(1996)]

J.-S. Kim, W. Jang, and Z. Bien.

A dynamic gesture recognition system for the korean sign language  (ksl).

IEEE Transactions on Systems, Man and Cybernetics, Part B,  26 (2): 354-359, Apr. 1996.

http://ieeexplore.ieee.org/iel1/3477/10374/00485888.pdf

The sign language is a method of communication for the deaf-mute.  Articulated gestures and postures of hands and fingers are commonly used for  the sign language. This paper presents a system which recognizes the Korean  sign language (KSL) and translates into a normal Korean text. A pair of  data-gloves are used as the sensing device for detecting motions of hands and  fingers. For efficient recognition of gestures and postures, a technique of  efficient classification of motions is proposed and a fuzzy min-max neural  network is adopted for on-line pattern recognition.

 

[Kim et al.(1999b)]

S.-W. Kim, J.-W. Lee, J.-Y. Oh, and Y. Aoki.

Development of a sign-language communication system between korea and  japan through a 3d character model on internet.

In International Conference on Image Processing,  1999.

 

[Klima and Bellugi(1979)]

E. Klima and U. Bellugi.

The Signs of Language.

Harvard University Press, 1979.

 

[Kurokawa et al.(1993)]

T. Kurokawa, T. Morichi, and S. Watanabe.

Bidirectional translation between sign language and japanese for  communication with deaf-mute people.

In G. Salvendy and M. J. Smith, editors, Human-Computer  Interaction: A: Applications & Case Studies; B: Software & Hardware  Interfaces., pages 1109-1114. Elsevier, 1993.

 

[Lamar(2001)]

M. Lamar.

Hand Gesture Recognition using T-CombNET - A Neural Network  dedicated to Temporal Information Processing.

PhD thesis, Nagoya Institute of Technology, Japan, Mar. 2001.

http://www.eletrica.ufpr.br/lamar/public/mvlamar.pdf

 

[Lamar et al.(1998)]

M. Lamar, M. Bhuiyan, and A. Iwata.

From hand sign to japanese hiragana alphabet recognition using  principal component analysis and neural networks.

In Proc. of 12th Conference of the Japan Biomedical Society,  pages 162-165, Niigata, Japan, 1998.

http://www.eletrica.ufpr.br/lamar/public/cjbs98.pdf

 

[Lamar et al.(1999a)]

M. Lamar, M. Bhuiyan, and A. Iwata.

Hand alphabet recognition using principal component analysis and  neural networks.

In Proc. of International Joint Conference on Neural Networks,  volume 4, pages 2838-2844, Washington, USA, July 1999.

http://www.eletrica.ufpr.br/lamar/public/ijcnn99.pdf

 

[Lamar et al.(1999b)]

M. Lamar, M. Bhuiyan, and A. Iwata.

Hand gesture recognition using morphological principal component  analysis and an improved combnet-ii.

In Proc. of IEEE International Conference on System, Man, and  Cybernetics, volume IV, pages 57-62, Tokyo, Japan, Oct. 1999.

http://www.eletrica.ufpr.br/lamar/public/smc99.pdf

 

[Lamar et al.(1999c)]

M. Lamar, M. Bhuiyan, and A. Iwata.

Japanese finger spelling recognition using t-combnet: A new neural  network model.

In Proc. of 13th Conference of the Japan Biomedical Society,  page 103, Osaka, Japan, Oct. 1999.

http://www.eletrica.ufpr.br/lamar/public/cjbs99.pdf

 

[Lamar et al.(1999d)]

M. Lamar, M. Bhuiyan, and A. Iwata.

Temporal series recognition using a new neural network structure  t-combnet.

In Proc. of IEEE International Conference on Neural Information  Processing, volume III, pages 1112-117, Perth, Australia, Nov.  1999.

http://www.eletrica.ufpr.br/lamar/public/iconip99.pdf

 

[Lamar et al.(2000a)]

M. Lamar, M. Bhuiyan, and A. Iwata.

Hand gesture recognition using t-combnet: A new neural network model.

IEICE Trans. on Information and Systems, E83-D  (11): 1986-1995, Nov. 2000.

http://www.eletrica.ufpr.br/lamar/public/e83-d_11_1986.pdf

 

[Lamar et al.(2000b)]

M. Lamar, M. S. Bhuiyan, and A. Iwata.

T-combnet - a neural network dedicated to hand gesture recognition.

In S.-W. Lee, H. Buelthoff, and T. Poggio, editors, Proceedings  First IEEE International Workshop, volume 1811 of Lecture Notes in  Computer Science, pages 613-622, Seoul, Korea, May 2000.

http://www.eletrica.ufpr.br/lamar/public/bmcv2000.pdf

 

[Lamar et al.(2001)]

M. Lamar, S. Wysoski, and A. Iwata.

User independent japanese kana finger spelling translation using  neural networks.

In 40th Conference of the Japan Society of Medical Electronics  and Biological Engineering, Nagoya, Japan, 2001.

http://www.eletrica.ufpr.br/lamar/public/ME2001_4451.pdf

 

[Lee et al.(1997)]

C.-S. Lee, Z. Bien, G.-T. Park, W. Jang, J.-S. Kim, and S.-K. Kim.

Real-time recognition system of korean sign language based on  elementary components.

In IEEE International Conference on Fuzzy Systems Proceedings,  pages 1463-1468, 1997.

http://ieeexplore.ieee.org/iel3/4864/13484/00619759.pdf

Sign language is a method of communication for deaf persons. In  communication using hand gesture, sign words and manual alphabets are used  together. In this paper a system is proposed, which recognizes Korean sign  Language (KSL). KSL is composed of Korean sign words and Korean manual  alphabets continuously. To recognize meanings of continuous gestures which  have no token of beginning and end, this system segments current motion  states using speed and change of speed in motions and state automata. To  understand the meaning of a gesture, basic component classifiers using fuzzy  min-max neural network and fuzzy logic are used. Basic elements of meaning  used in this system are 14 hand directions, 23 hand postures, and 14 hand  orientations. Meaning of signed gesture is interpreted using basic elements  which were recognized by 3 classifiers. This system recognizes 31 Korean  manual alphabets and 131 Korean signs in real-time with recognition rate  96.7% for Korean manual alphabets and 94.3% for Korean sign words,  excluding no recognition case.

 

[Lee and Kunii(1992)]

J. Lee and T. L. Kunii.

Visual translation: From native language to sign language.

In Proc. of the 1992 IEEE Workshop on Visual Languages, pages  103-109, Seattle, WA, 1992.

 

[Lee and Kunii(1993)]

J. Lee and T. L. Kunii.

Computer animated visual translation from natural language to sign  language.

The Journal of Visualization and Computer Animation, 4  (2): 63-78, April - June 1993.

 

[Lee and T.L.(1994)]

J. Lee and K. T.L.

Generation and recognition of sign language using graphic models.

In H. Yamada, Y. Kambayashi, and S. Ohta, editors, Proceedings  of the IISF/ACM Japan International Symposium. Computers as our Better  Partners, pages 96-103, Tokyo, Japan, 1994. World Scientific.

 

[Lejeune et al.(2001)]

F. Lejeune, A. Braffort, and J.-P. Desclés.

Study on semantic representations of french sign language sentences.

In Proceedings of the Gesture Workshop, London, Apr. 2001.

http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/GW%20Lejeune.pdf

Draft

 

[Liang and Ouhyoung(1997)]

R. Liang and M. Ouhyoung.

A real-time continuous gesture interface for taiwanese sign language.

Submitted to UIST, 1997.

http://www.cmlab.csie.ntu.edu.tw/~f1506028/publication/uist97.ps.gz

 

[Liang and Ouhyoung(1998)]

R. Liang and M. Ouhyoung.

A real-time continuous gesture recognition system for sign language.

In IEEE International Conference on Automatic Face and Gesture  Recognition, pages 558-567, 1998.

http://ieeexplore.ieee.org/iel4/5501/14786/00671007.pdf

A large vocabulary sign language interpreter is presented with  real-time continuous gesture recognition of sign language using a data glove.  Sign language, which is usually known as a set of natural language with  formal semantic definitions and syntactic rules, is a large set of hand  gestures that are daily used to communicate with the hearing impaired. The  most critical problem, end-point detection in a stream of gesture input is  first solved and then statistical analysis is done according to four  parameters in a gesture: posture, position, orientation, and motion. The  authors have implemented a prototype system with a lexicon of 250  vocabularies and collected 196 training sentences in Taiwanese Sign Language  (TWL). This system uses hidden Markov models (HMMs) for 51 fundamental  postures, 6 orientations, and 8 motion primitives. In a signer-dependent way,  a sentence of gestures based on these vocabularies can be continuously  recognized in real-time and the average recognition rate is 80.4%.

 

[Liang and Ouhyoung(1995)]

R.-H. Liang and M. Ouhyoung.

A real-time continuous alphabetic sign language to speech conversion  vr system.

Computer Graphics Forum - EUROGRAPHICS '95, 14  (3): 67-76, 1995.

 

[Liang and Ouhyoung(1996)]

R.-H. Liang and M. Ouhyoung.

A sign language recognition system using hidden markov model and  context sensive search.

In Proc. of the ACM Symposium on Virtual Reality and Software  Technology, pages 59-66, Hong Kong, July 1996.

http://www.cmlab.csie.ntu.edu.tw/~f1506028/publication/vrst96.ps.gz

 

[Lu et al.(1996)]

S. Lu, S. Igi, and H. Matsuo.

A jsl conversational character system with nonverbal information - recognition of jsl, detection of gazing line and user position.

In Proc. of 2nd Symposium on Intelligent Information Media, pages 69-74, Dec. 1996.

 

[Lu et al.(1997a)]

S. Lu, S. Igi, and H. Matsuo.

Development of sign language-voice conversation support system.

Human with Technology,  (15): 85-94, Dec.  1997.

 

[Lu et al.(1997b)]

S. Lu, S. Igi, H. Matsuo, and Y. Nagashima.

Towards a dialogue system based on recognition and synthesis of  japanese sign language.

In I. Wachsmuth and M. Fröhlich, editors, Proceedings of  Gesture Workshop, number 1371 in Lecture Notes in Computer Science,  Bielefeld, Germany, 1997. Springer.

http://link.springer-ny.com/link/service/series/0558/bibs/1371/13710259.htm

 

[Lu et al.(1997c)]

S. Lu, K. Imagawa, and S. Igi.

An active gazing-line generation system for improving sign-language conversation.

In Proceedings of the Seventh International Conference on Human-Computer Interaction (HCI International '97), volume 2, pages 283-6, Amsterdam, Netherlands, 1997. Elsevier.

 

[Lu et al.(1997d)]

S. Lu, K. Imagawa, and S. Igi.

An gazing-line generation system for improving sign language  conversation.

In Proc. of 7th International Conference on Human-Computer  Interaction, volume 2, pages 283-286, California, USA, Aug.  1997.

 

[Ma et al.(2000a)]

J. Ma, W. Gao, and R. Wang.

A parallel multistream model for integration of sign language  recognition and lip motion.

In T. Tan, Y. Shi, and W. Gao, editors, Proceedings of the Third  International Conference on Advances in Multimodal Interfaces, volume 1948  of Lecture Notes in Computer Science, pages 582-589, Oct.  2000.

http://link.springer.de/link/service/series/0558/bibs/1948/19480582.htm

 

[Ma et al.(2000b)]

J. Ma, W. Gao, J. Wu, and C. Wang.

A continuous chinese sign language recognition system.

In IEEE International Conference on Automatic Face and Gesture  Recognition, 2000.

http://computer.org/proceedings/fg/0580/05800428abs.htm

In this paper, we describe a system for recognizing both the isolated  and continuous Chinese Sign Language (CSL) using two Cybergloves and two  3SAPCE-position trackers as gesture input devices. To get robust gesture  features, each joint-angle collected by Cybergloves is normalized. The  relative position and orientation of the left hand to those of the right hand  are proposed as the signer position independent features. To speed up the  recognition process, a fast match and a frame predicting techniques are  proposed. To tackle epenthesis movement problem, context-dependent models are  obtained by the Dynamic Programming (DP) technique. HMMs are utilized to  model basic word units. Then we describe training techniques of the bigram  language model and the search algorithm used in our baseline system. The  baseline system converts sentence level gestures into synthesis speech and  gestures of 3D virtual human synchronously. Experiments show that these  techniques are efficient both in recognition speed and recognition  performance.

 

[Matsuo et al.(1997)]

H. Matsuo, S. Igi, S. Lu, Y. Nagashima, Y. Takata, and T. Teshima.

The recognition algorithm with non-contact for japanese sign language  using morphological analysis.

In I. Wachsmuth and M. Fröhlich, editors, Proceedings of  Gesture Workshop, number 1371 in Lecture Notes in Computer Science,  Bielefeld, Germany, 1997. Springer.

http://link.springer.de/link/service/series/0558/bibs/1371/13710273.htm

This paper documents the recognition method of deciphering Japanese  sign language(JSL) using projected images. The goal of the movement  recognition is to foster communication between hearing impaired and people  capable of normal speech. We uses a stereo camera for recording  three-dimensional movements, a image processing board for tracking movements,  and a personal computer for an image processor charting the recognition of  JSL patterns. This system works by formalizing the space area of the signers  according to the characteristics of the human body, determining components  such as location and movements, and then recognizing sign language patterns.  The system is able to recognize JSL by determining the extent of similarities  in the sign field, and does so even when vibrations in hand movements occur  and when there are differences in body build. We obtained useful results from  recognition experiments in 38 different JSL in two signers.

 

[Messing et al.(1994)]

L. Messing, R. Erenshteyn, R. Foulds, S. Galuska, and G. Stern.

American sign language computer recognition: Its present and its  promise.

In Conf. of the Intl. Society for Augmentative and Alternative  Communication, pages 289-291, 1994.

 

[Ohira et al.(1995)]

E. Ohira, H. Sagawa, T. Sakiyama, and M. Ohki.

A segmentation method for sign language recognition.

IEICE Transactions on Information and Systems, E78-D  (1): 49-57, 1995.

 

[Ohki et al.(1995)]

M. Ohki, H. Sagawa, and N. Hataoka.

Sign language translation system using pattern recognition and  synthesis.

Hitachi Review, 44 (4), 1995.

 

[Ohki et al.(1994)]

M. Ohki, H. Sagawa, T. Sakiyama, E. Oohira, H. Ikeda, and H. Fujisawa.

Pattern recognition and synthesis for sign language translation  system.

In First Annual ACM Conference on Assistive Technologies,  Hearing Impairments, pages 1-8, 1994.

http://www.acm.org/pubs/citations/proceedings/assets/191028/p1-ohki/

 

[Pashaloudi(2001)]

V. N. Pashaloudi.

On greek sign language alphabet character recognition: Using  back-propagation neural networks.

In Proccedings of the 5th Hellenic European Research on Computer  Mathematics & its Applications (HERCMA) Conference, Athens, Greece, Sept.  2001.

 

[Prime(1995)]

M. Prime.

Integrating sign language into a virtual reality environments.

In IEE Colloquium on Visualisation of Three-Dimensional Fields,  pages 5/1-5/4, 1995.

http://ieeexplore.ieee.org/iel3/3672/10797/00498895.pdf

In a multi-user distributed virtual reality environment, such as DIVE  (Distributed Interactive Virtual Environment) where the various participants  each have a 3-D representation, a model for interaction is necessary. The  Spatial Interaction Model describes how objects should interact with each  other in the virtual environment. This model has since been extended to cover  more interactions and is described in the paper. Language and gesture can be  coordinated to form a single communication system more powerful than either  alone. Hand gesture in particular is critical for fast interactive human to  human communication. A VR system that allows a sign language channel and thus  also a gestural channel will have an enriched communication medium. The main  concept of this paper is that in a virtual reality environment it is possible  to support users with different capabilities for interaction. The factors  considered are whether the user can hear or not; their ability to use sign  language; and the technology they have at their disposal on entry into the VR  environment. A user is thus represented in the virtual environment as an  entity with those capabilities available to it.

 

[Roberts(1994)]

G. D. Roberts.

Statistical pettern classification in computer recognition of sign  language.

Master's thesis, Department of Computer Science, Royal Melbourne  Institute of Technology, May 1994.

 

[Safar and Marshall(2001)]

E. Safar and I. Marshall.

The architecture of an english-text-to-sign-languages translation  system.

In G. A. et al, editor, Recent Advances in Natural Language  Processing (RANLP), pages 223-228, Tzigov Chark Bulgaria, 2001.

http://www.visicast.sys.uea.ac.uk/Papers/confbulgarianew.pdf

 

[Sagawa et al.(1994)]

H. Sagawa, T. Sakiyama, E. Oohira, H. Sakou, and M. Abe.

Prototype sign language translation system.

In Proceedings of the IISF/ACM Japan International Symposium.  Computers as our Better Partners, pages 152-3, Singapore, 1994. World  Scientific.

 

[Sagawa and Takeuchi(1999)]

Hirohiko Sagawa and Masaru Takeuchi.

A method for analyzing spatial relationships between words in sign  language recognition.

In Proceedings of the Gesture Workshop, volume 1739 of Lecture  Notes in Computer Science, pages 197-209, 1999.

http://link.springer-ny.com/link/service/series/0558/bibs/1739/17390197.htm

There are expressions using spatial relationships in sign language  that are called directional verbs. To understand a sign-language sentence  that includes a directional verb, it is necessary to analyze the spatial  relationship between the recognized sign-language words and to find the  proper combination of a directional verb and the sign-language words related  to it. In this paper, we propose an analysis method for evaluating the  spatial relationship between a directional verb and other sign-language words  according to the distribution of the parameters representing the spatial  relationship.

 

[Sagawa and Takeuchi(2000a)]

H. Sagawa and M. Takeuchi.

Development of an information kiosk with a sign language recognition  system.

In Proceedings of the 2000 International Conference on  Intelligent User Interfaces, Posters, pages 149-150, 2000.

http://www.acm.org/pubs/articles/proceedings/chi/355460/p149-sagawa/p149-sagawa.pdf

 

[Sagawa and Takeuchi(2000b)]

H. Sagawa and M. Takeuchi.

A method for recognizing a sequence of sign language words  represented in a japanese sign language sentence.

In IEEE International Conference on Automatic Face and Gesture  Recognition, Grenoble, France, Mar. 2000.

http://computer.org/proceedings/fg/0580/05800434abs.htm

A JSL (Japanese sign language) sentence is represented by connecting  several sign-language words continuously. And there are transitions between  the sign-language words that have no meaning in the sign-language sentence.  Therefore, to translate JSL into Japanese, first, it is necessary to detect  each sign-language word from the inputted gesture of a JSL sentence with high  accuracy. And then, a proper sequence of the recognized sign-language words  must be generated. To achieve this, we have developed (1) a method for  effectively detecting the borders of the sign-language words from ordinary  sign-language gestures and segmenting the sign-language gestures, (2) a  method for detecting whether the sign-language gesture is represented by one  hand or both hands, and (3) a method for distinguishing the segments  representing the sing-language words from the segments representing the  transitions. We have carried out an experiment with 200 samples of 10 JSL  sentences to recognize the sequences of the sign language words using the  developed methods. As the result, The accuracy for the word is improved from  77.6% to 86.6%, and the accuracy for the sentence is improved from 46.0% to  58.0% by using the developed methods. These results indicate that the  developed methods are effective.

 

[Sagawa et al.(1997)]

H. Sagawa, M. Takeuchi, and M. Ohki.

Description and recognition methods for sign language based on  gesture components.

In Proceedings of the 1997 International Conference on  Intelligent User Interfaces, I/O Support/Spatial Awareness, pages 97-104,  1997.

http://www.acm.org/pubs/citations/proceedings/uist/238218/p97-sagawa/

 

[Sagawa et al.(1998)]

H. Sagawa, M. Takeuchi, and M. Ohki.

Methods to describe and recognize sign language based on gesture components represented by symbols and numerical values.

Knowledge-based Systems 10 (5):287-294, March 1998.

http://www.elsevier.com/gej-ng//10/30/56/12/11/14/abstract.html

Sign-language gestures inflect according to the context. To recognize such sign language properly, the structure of sign language must be made clear. It is well known that the structure of sign language is represented as a combination of gesture components. In this paper, methods for the description and recognition of sign-language gestures based on the gesture components are discussed. In these methods, which we have developed, a sign-language gesture is recognized by integrating the recognized gesture components according to the structure of the gesture. The results of an experiment on recognizing sign-language gestures are also examined and it is shown that the developed methods are effective.

 

 

[Shin et al.(1999)]

S.-H. Shin, S.-W. Kim, and Y. Aoki.

A structural learning of mlp classifiers using pfsga and its  application to korean sign language recognition.

In Proceedings of the IEEE Region 10 Conference TENCON 99,  pages 190-193, 1999.

http://ieeexplore.ieee.org/iel5/6630/17685/00818382.pdf

We present experimental results for a structural learning of  multilayered perceptron (MLP) classifiers using PfSGA (Parameter-free Species  Genetic Algorithm) and its application to the recognition of Korean sign  language. The PfSGA is a combined method of the SGA (Species Genetic  Algorithm) and PfSGA (Parameter-free Genetic Algorithm). The SGA is a  modified GA for reducing the search space based on species concepts and PfGA  is another modified GA to reduce the learning time without determining the  learning parameters. Experimental results show that the proposed method could  be a useful tool for choosing an appropriate architecture for high  dimensions.

 

[Starner and Pentland(1995a)]

T. Starner and A. Pentland.

Real-time american sign language recognition from video using hidden  markov models.

In Proceedings of the IEEE International Conference on Computer  Vision, Coral Gables, FL, 1995.

http://citeseer.nj.nec.com/starner96realtime.html

 

[Starner and Pentland(1995b)]

T. Starner and A. Pentland.

Visual recognition of american sign language using hidden markov  models.

In International Workshop on Automatic Face and Gesture  Recognition (IWAFGR), Zurich, Switzerland, 1995.

http://vismod.www.media.mit.edu/cgi-bin/tr_pagemaker#TR306

 

[Starner et al.(1997)]

T. Starner, J. Weaver, and A. Pentland.

A wearable computing based american sign language recognizer.

Personal Technologies, 1 (4), 1997.

ftp://whitechapel.media.mit.edu/pub/tech-reports/TR-425.ps.Z

Also in: International Symposium on Wearable Computers 1997.

 

[Starner et al.(1998a)]

T. Starner, J. Weaver, and A. Pentland.

Real-time american sign language recognition using desk and wearable  computer based video.

IEEE Transactions on Pattern Analysis and Machine Intelligence,  20 (12): 1371-137, Dec. 1998.

http://computer.org/tpami/tp1998/i1371abs.htm

 

[Starner et al.(1998b)]

T. Starner, J. Weaver, and A. Pentland.

A wearable computer based american sign language recognizer.

In IEEE International Symposium on Wearable Computing, volume  1458 of Lecture Notes in Computer Science, 1998.

http://link.springer-ny.com/link/service/series/0558/bibs/1458/14580084.htm

Modern wearable computer designs package workstation level  performance in systems small enough to be worn as clothing. These machines  enable technology to be brought where it is needed the most for the  handicapped: everyday mobile environments. This paper describes a research  effort to make a wearable computer that can recognize (with the possible goal  of translating) sentence level American Sign Language (ASL) using only a  baseball cap mounted camera for input. Current accuracy exceeds 97% per word  on a 40 word lexicon.

 

[Sugiyama et al.(1997)]

H. Sugiyama, S. Tanahashi, and Y. Aoki.

Recovering three dimensional hand motions of sign language from  monocular image sequence.

In Proceedings of 1997 International Conference on Information,  Communications and Signal Processing, volume 2, pages 1098-1101, 1997.

This paper proposes a method to recover three dimensional hand  motions of Japanese sign language. This method uses skin-color information  and edge and difference images. In this method, joint points are determined  by considering a human body consisting of hierarchical structure. Also, we  attempt to spot each word from a sentence by using estimated hand motion  transitions. The experimental results show the validity of the proposed  method.

 

[Sujan and Meggiolaro(2000)]

V. A. Sujan and M. A. Meggiolaro.

Sign language recognition using competitive learning in the havnet  neural network.

In N. M. Nasrabadi and A. K. Katsaggelos, editors, Applications  of Artificial Neural Networks in Imaging V (Electronic Imaging 2000), volume  3962 of Proc. SPIE, pages 2-12, San Jose, CA, 2000.

http://spie.org/scripts/abstract.pl?bibcode=2000SPIE%2e3962%2e%2e%2e%2e%2S&db_key=INST&qs=spie&s_type=paper

 

[Sutherland(1996)]

A. Sutherland.

Real-time video-based recognition of sign language gestures using  guided template matching.

In P. A. Harling and A. D. Edwards, editors, Proceedings of  Gesture Workshop, pages 31-38, 1996.

 

[Sweeney and Downton(1996)]

G. Sweeney and A. Downton.

Towards appearance-based multi-channel gesture recognition.

In P. A. Harling and A. D. Edwards, editors, Proceedings of  Gesture Workshop, pages 7-16. Springer, 1996.

 

[Sweeney and Downton(1997)]

G. Sweeney and A. Downton.

Sign language recognition using a cheremic architecture.

In International Conference on Image Processing and Its  Applications, volume 2, pages 483-486, 1997.

http://ieeexplore.ieee.org/iel3/4857/13412/00615572.pdf

In this paper we analyse the gestures that compose British Sign  Language (BSL), where an immediate response or detection of a single gesture  is not required but instead a sequence must be translated with a small delay  to allow for contextual information to be considered. BSL is a natural  language that uses two communication channels comprising hand/body postures  (manual channel) and facial expressions (non-manual channel) to communicate  information. In this paper, analysis is restricted to the manual channel  only.

 

[Tamura and Kawasaki(1988)]

S. Tamura and S. Kawasaki.

Recognition of sign language motion images.

Pattern Recognition, 21 (4): 343-353, 1988.

 

[Tokuda and Okumura(1998a)]

M. Tokuda and M. Okumura.

Towards automatic translation from japanese into japanese sign  language.

In Applications in Robotics, User Interfaces, and Natural  Language Processing, volume 1458 of Lecture Notes in Computer Science.  Springer, 1998.

http://link.springer.de/link/service/series/0558/bibs/1458/14580097.htm

In this paper, we present a prototype translation system named SYUWAN  which translates Japanese into Japanese sign language. One of the most  important problems in this task is that there are very few entries in a sign  language dictionary compared with a Japanese one. To solve this problem, when  the original input word does not exist in a sign language dictionary SYUWAN  applies several techniques to find a similar word from a Japanese dictionary  and substitutes this word for the original word. As the result, SYUWAN can  translate up to 82% of words which are morphologically analyzed.

 

[Tokuda and Okumura(1998b)]

M. Tokuda and M. Okumura

Automatic complement of sign language dictionary in japanese-sign  language machine translation.

Transactions of the Information Processing Society of Japan,  39, Mar. 1998.

 

[Uchida et al.(1994)]

M. Uchida, K. Ishikawa, and H. Ide.

Finger character recognition system and application to sign language.

Transactions of the Institute of Electrical Engineers of Japan,  114-C (10): 995-1000, Oct. 1994.

 

[Vamplew(1996)]

P. Vamplew.

Recognition of Sign Language Using Neural Networks.

PhD thesis, Department of Computer Science, University of Tasmania,  1996.

 

[Vamplew and Adams(1992)]

P. Vamplew and A. Adams.

The slarti system: Applying artificial neural networks to sign  language recognition.

In H. Murphy, editor, Proceedings of the Seventh Anual  Conference on Technology and persons with disabilities, pages 575-579,  Northridge, CA, 1992.

 

[Vamplew and Adams(1995)]

P. Vamplew and A. Adams.

Recognition and anticipation of hand motions using a neural network.

In Proceedings of IEEE International Conference on Neural  Networks, volume 3, pages 2904-2907, 1995.

http://citeseer.nj.nec.com/vamplew95recognition.html

 

[Veale et al.(1998)]

Tony Veale, Alan Conway and Brona Collins.

The challenges of cross-modal translation: English-to-sign-language  translation in the zardoz system.

Machine translation, 13 (1): 81-106, 1998.

http://www.wkap.nl/oasis.htm/180039

 

[Vogler and Metaxas(1997)]

C. Vogler and D. Metaxas.

Adapting hidden markov models for asl recognition by using  three-dimensional computer vision methods.

In Proceedings of the IEEE International Conference on Systems,  Man and Cybernetics, pages 156-161, Orlando, FL, Oct. 1997.

http://citeseer.nj.nec.com/271989.html

 

[Vogler and Metaxas(1998)]

C. Vogler and D. Metaxas.

Asl recognition based on a coupling between hmms and 3d motion  analysis.

In Proceedings of the IEEE International Conference on Computer  Vision, pages 363-369, Mumbai, India, Jan. 1998.

http://citeseer.nj.nec.com/vogler98asl.html

Also as Technical report MS-CIS-98-21.

 

[Vogler and Metaxas(1999a)]

C. Vogler and D. Metaxas.

Parallel hidden markov models for american sign language recognition.

In Proceedings of the IEEE International Conference on Computer  Vision, Kerkyra, Greece, Sept. 1999.

ftp://ftp.cis.upenn.edu/pub/cvogler/iccv99.pdf

 

[Vogler and Metaxas(1999b)]

C. Vogler and D. Metaxas.

Toward scalability in asl recognition: Breaking down signs into  phonemes.

In Proceedings of the Gesture Workshop, volume 1739 of Lecture  Notes in Computer Science, Gif-sur-Yvette, France, Mar. 1999.  Springer.

ftp://ftp.cis.upenn.edu/pub/cvogler/gw99.pdf

 

[Vogler and Metaxas(2001)]

C. Vogler and D. Metaxas.

A framework for recognizing the simultaneous aspects of american sign  language.

Computer and Vision Image Understanding,   (3): 358-384, Mar. 2001.

http://www.idealibrary.com/links/doi/10.1006/cviu.2000.0895

 

[Vogler et al.(2000)]

C. Vogler, H. Sun, and D. Metaxas.

A framework for motion recognition with applications to american sign  language and gait recognition.

In Proceedings of the Workshop on Human Motion, Austin, TX,  2000.

http://www.computer.org/proceedings/humo/0939/09390033abs.htm

Human motion recognition has many important applications, such as  improved human-computer interaction and surveillance. A big problem that  plagues this research area is that human movements can be very complex.  Managing this complexity is difficult. We turn to American sign language  (ASL) recognition to identify general methods that reduce the complexity of  human motion recognition. We present a framework for continuous 3D ASL  recognition based on linguistic principles, especially the phonology of ASL.  This framework is based on parallel hidden Markov models (HMMs), which are  able to capture both the sequential and the simultaneous aspects of the  language. Each HMM is based on a single phoneme of ASL. Because the phonemes  are limited in number, as opposed to the virtually unlimited number of signs  that can be composed from them, we expect this framework to scale well to  larger applications. We then demonstrate the general applicability of this  framework to other human motion recognition tasks by extending it to gait  recognition.

 

[Waldron and Kim(1994)]

M. B. Waldron and S. Kim.

Increasing manual sign recognition vocabulary through relabeling.

In International Joint Conference on Neural Networks, pages  2885-2889, June 1994.

http://ieeexplore.ieee.org/iel2/3013/8560/00374689.pdf

In this paper we present the results of relabelling a self organizing  map (SOM) to increase the dynamic manual signs it can recognize. Relabelling  exploits the global ordering of self organizing map and abrogates the need  for retraining, thereby reducing the computational costs and increasing the  recognition ability of the network. This relabelling technique was applied to  a dynamic sign recognition system to increase the recognition vocabulary from  10 to 14 signs. The data was collected from a person wearing a DataGlove with  a Polhemus sensor and signing the 14 signs. The sampled hand data over the  duration of sign was fed to phonemic recognition modules and the collective  outputs of these modules were fed to the sign recognition module consisting  of a relabelled self organizing network. The results showed that the overall  recognition rate of the relabelled network was 84% as compared to 86% for the  retrained network. Further, it was found that the dynamic sampling of the  signs made the movement phoneme module unnecessary.

 

[Waldron and Kim(1995)]

M. B. Waldron and S. Kim.

Isolated asl sign recognition system for deaf persons.

IEEE Transactions on Rehabilitation Engineering, 3  (3), Sept. 1995.

http://ieeexplore.ieee.org/iel4/86/9209/00413199.pdf

The design and evaluation of a two-stage neural network which can  recognize isolated ASL signs is given. The input to this network is the hand  shape and position data obtained from a DataGlove mounted with a Polhemus  sensor. The first level consists of four backpropagation neural networks  which can recognize the sign language phonology, namely, the 36 hand shapes,  10 locations, 11 orientations, and 11 hand movements. The recognized phonemes  from the beginning, middle, and end of the sign are fed to the second stage  which recognizes the actual signs. Both backpropagation and Kohonen's  self-organizing neural work was used to compare the performance and the  expandability of the learned vocabulary. In the current work, six signers  with differing hand sizes signed 14 signs which included hand shape,  position, and motion fragile and triple robust signs. When a backpropagation  network was used for the second stage, the results show that the network was  able to recognize these signs with an overall accuracy of 86%. Further, the  recognition results were linearly dependent on the size of the finger in  relation to the metacarpophalangeal joint and the total length of the hand.  When the second stage was a Kohonen's self-organizing network, the network  could not only recognize the signs with 84% accuracy, but also expand its  learned vocabulary through relabeling.

 

[Wang et al.(2000)]

C. Wang, W. Gao, and J. Ma.

An approach to automatically extracting the basic units in chinese  sign language recognition.

In Proceedings 5th International Conference on Signal  Processing, volume 2, pages 855 --858, 2000.

http://ieeexplore.ieee.org/iel5/7173/19290/00891645.pdf

The recognition of large vocabulary continuous Chinese sign language  (CSL) is a challenging problem. It is effective to use phonemes instead of  whole signs as the basic units. In this paper, an approach to extracting the  basic units in CSL automatically is described. In order to find subwords in  each data streams in sign signals, dynamic programming (DP) is proposed to  segment the data streams, and then ANN approach combining k-means is used to  classify these segments. 71 hand postures are automatically extracted from  1063 words and 200 continuous sentences. These postures accompanied with  locations and orientations are used as basic units in large vocabulary  continuous CSL recognition.

 

[Wang et al.(2001a)]

C. L. Wang, W. Gao, and Z. G. Xuan.

A real-time large vocabulary continuous recognition system for  chinese sign language.

In IEEE Pacific Rim Conference on Multimedia 2001, volume 2195  of Lecture Notes in Computer Science, Bejing, China, 2001.

 

[Wang et al.(2001b)]

A Real-Time Large Vocabulary Recognition System for Chinese Sign Language.

C.L. Wang, W. Gao, and J. Ma.

In Proceedings of the Gesture Workshop, London, Apr. 2001.
http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/GWpaper_15.pdf

Draft.

 

[Wei et al.(2001)]

Z. Wei, Y. Kui, L. Jindong, and L. Bencheng.

A method for hand tracking and motion recognizing in chinese sign  language.

In Proceedings 2001 International Conferences on Info-tech and  Info-net, volume 3, pages 543-549, Beijing, China, 2001.

 

[Wilson and Anspach(1993a)]

E. Wilson and G. Anspach.

Neural networks for sign language translation.

In S. K. Rogers, editor, Applications of Artificial Neural  Networks IV, volume 1965 of Proceedings of the SPIE - The International  Society for Optical Engineering, pages 589-599, 1993.

http://spie.org/scripts/abstract.pl?bibcode=1993SPIE%2e1965%2e%2e589W&db_key=INST&qs=spie&s_type=paper

 

[Wilson and Anspach(1993b)]

E. Wilson and G. Anspach.

Wavelets for sign language translation.

In B. G. Haskell and H.-M. Hang, editors, Visual Communications  and Image Processing '93, volume 2094 of Proceedings of the SPIE - The  International Society for Optical Engineering, pages 1300-1308,  1993.

http://spie.org/scripts/abstract.pl?bibcode=1993SPIE%2e2094%2e1300W&db_key=INST&qs=spie&s_type=paper

 

[Wilson and Gretel(1993)]

E. J. Wilson and A. Gretel.

Applying neural network developments to sign language translation.

In C. A. e. a. Kamm, editor, Proceedings of the 1993 IEEE-SP  Workshop on Neural Networks for signal processing III, New York, NY, 1993.

http://ieeexplore.ieee.org/iel2/3312/9944/00471858.pdf

Neural networks are used to extract relevant features of sign  language from video images of a person communicating in American Sign  Language or Signed English. The key features are hand motion, hand location  with respect to the body, and handshape. A modular design is under way to  apply various techniques, including neural networks, in the development of a  translation system that will facilitate communication between deaf and  hearing people. Signal processing techniques developed for defense-related  programs have been adapted and applied to this project. Algorithm development  and transition using neural network architectures has been encouraging. The  results of the feasibility study for this project are described.

 

[Wu and Gao(2000)]

J. Wu and W. Gao.

A fast sign word recognition method for chinese sign language.

In T. Tan, Y. Shi, and W. Gao, editors, Proceedings of the Third  International Conference on Advances in Multimodal Interfaces, volume 1948  of Lecture Notes in Computer Science, pages 582-589, Oct. 2000.

http://link.springer-ny.com/link/service/series/0558/bibs/1948/19480599.htm

Sign language is the language used by the deaf, which is a  comparatively steadier expressive system composed of signs corresponding to  postures and motions assisted by facial expression. The objective of sign  language recognition research is to ``see'' the language of deaf. The  integration of sign language recognition and sign language synthesis jointly  comprise a ``human-computer sign language interpreter'', which facilitates  the interaction between deaf and their surroundings. Considering the speed  and performance of the recognition system, Cyberglove is selected as gesture  input device in our sign language recognition system, Semi-Continuous Dynamic  Gaussian Mixture Model (SCDGMM) is used as recognition technique, and a  search scheme based on relative entropy is proposed and is applied to  SCDGMM-based sign word recognition. Comparing with SCDGMM recognizer without  searching scheme, the recognition time of SCDGMM recognizer with searching  scheme reduces almost 15 times.

 

[Wu and Gao(2001)]

J. Wu and W. Gao.

The recognition of finger-spelling for chinese sign language.

In Proceedings of the Gesture Workshop, 2001.

http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/gw2001_29.pdf

Draft.

In this paper 3-layer feedforward network is introduced to recognize  Chinese manual alphabet. As it is plagued by a large amount of significant  time required for training, a kind of fast learning algorithm - Single  Parameter Dynamic Search Algorithm (SPDS) - is used to learn net parameters.  In addition, multi-classifiers based on multi-features are constructed and a  recognition algorithm for recognizing manual alphabets based on  multi-features and multi-classifiers is proposed to promote the recognition  performance of finger-spelling. From experiment result, it is shown that  Chinese finger-spelling recognition based on multi-features and  multi-classifiers outperforms its recognition based on single-classifier.

 

[Wu et al.(1998)]

J. Wu, W. Gao, Y. Song, W. Liu, and B. Pang.

A simple sign language recognition system based on data glove.

In Proceedings Fourth International Conference on Signal  Processing, volume 2, pages 1257-1260, 1998.

http://ieeexplore.ieee.org/iel5/6237/16697/00770847.pdf

Human beings usually interact with each other either by via a natural  language channel such as speech and writing, or by body languag, e.g. hand  gestures, head gestures, facial expression, lip motion and so on. As a part  of natural language understanding, sign language recognition is very  important. On one hand, it is one of the main methods of human-computer  interaction in virtual reality; on the other hand, it is an auxiliary tool  for a deaf-mute to communicate with ordinary people through computer. In this  paper the process of building a simple word-level sign language recognition  system is presented, and the method for recognizing sign language words is  also proposed.

 

[Wu et al. (2002)]

Chung-Hsien Wu, Yu-Hsien Chiu, Kung-Wei Cheng.

Sign language translation using an error tolerant retrieval algorithm.

In Proceedings of the International Conference on Spoken Language Processing (ICSLP), 2002.

 

[Xu et al.(2000)]

M. Xu, B. Raytchev, K. Sakaue, O. Hasegawa, A. Koizumi, M. Takeuchi, and  H. Sagawa.

A vision-based method for recognizing non-manual information in  japanese sign language.

In Third International Conference on Multimodal Interfaces,  volume 1948 of Lecture Notes in Computer Science, pages 572-581,  Beijing, China, Oct. 2000. Springer.

http://link.springer-ny.com/link/service/series/0558/bibs/1948/19480572.htm

 

[Yamaguchi et al.(1995)]

T. Yamaguchi, M. Yoshihara, M. Akiba, M. Kuga, N. Kanazawa, and K. Kamata.

Japanese sign language recognition system using information  infrastructure.

In Proceedings of 1995 IEEE International Conference on Fuzzy  Systems, 1995.

http://ieeexplore.ieee.org/iel2/3225/9179/00410043.pdf

The paper proposes a Japanese sign language recognition system that  makes associative inferences from video information. The proposed system uses  two associative memory features: association robustness and associative  memory combination. A test of the actual system was made using sixteen words  and the recognition system correctly distinguished all words.

 

[Yamamoto et al.(1997)]

Y. Yamamoto, M. Uchida, and H. Ide.

Sign language recognition by statistics method.

The Transactions of the Institute of Electrical Engineers of  Japan, 117 (3), 1997.

 

[Yang and Ahuja(1998)]

M.-H. Yang and N. Ahuja.

Extraction and classification of visual motion patterns for hand  gesture recognition.

In IEEE Conference on Computer Vision and Pattern Recognition,  pages 892-897, Santa Barbara, CA, 1998.

http://citeseer.nj.nec.com/yang98extraction.html

We present a new method for extracting and classifying motion  patterns to recognize hand gestures. First, motion segmentation of the image  sequence is generated based on a multiscale transform and attributed graph  matching of regions across frames. This produces region correspondences and  their affine transformations. Second, color information of motion regions is  used to determine skin regions. Third, human head and palm regions are  identified based on the shape and size of skin areas in motion. Finally,  affine transformations defining a region's motion between successive frames  are concatenated to construct the region's motion trajectory. Gestural motion  trajectories are then classified...

 

[Yang and Ahuja(1999)]

M.-H. Yang and N. Ahuja.

Recognizing hand gesture using motion trajectories.

In IEEE Conference on Computer Vision and Pattern Recognition,  pages 466-472, Ft. Collins, CO, June 1999.

http://ieeexplore.ieee.org/iel5/6370/17045/00786979.pdf

We present an algorithm for extracting and classifying  two-dimensional motion in an image sequence based on motion trajectories.  First, a multiscale segmentation is performed to generate homogeneous regions  in each frame. Regions between consecutive frames are then matched to obtain  2-view correspondences. Affine transformations are computed from each pair of  corresponding regions to define pixel matches. Pixels matches over  consecutive images pairs are concatenated to obtain pixel-level motion  trajectories across the image sequence. Motion patterns are learned from the  extracted trajectories using a time-delay neural network. We apply the  proposed method to recognize 40 hand gestures of American Sign Language.  Experimental results show that motion patterns in hand gestures can be  extracted and recognized with high recognition rate using motion  trajectories.

 

[Yoshino et al.(1996)]

K. Yoshino, T. Kawashima, and Y. Aoki.

Recognition of japanese sign language from image sequence using color  combination.

In Proceedings of the IEEE International Conference on Image  Processing, volume 3, pages 511-14, 1996.

This paper proposes a method of recognising Japanese sign language  indirectly using a colored glove which has multiple color patches on its  surface. Japanese sign language has finger alphabets and words which include  a hand motion. In the proposed method, the finger alphabets are estimated  from three parameters: the color combination of the visible patches, the  dispersion of the visible patches, and the hand direction. The color  combination is represented by the mean of the color histogram of the visible  patches in an input image. Words which include a hand motion are estimated  from the transition of characteristic hand structures which are detected from  an image sequence by changes in the color combination of the visible patches.  The experimental results show the validity of the proposed method.

 

[Zhang et al. (2002)]

Ying Zhang, Bing Zhao, Jie Yang, Alex Waibel

Automatic Sign Translation

In Proceedings of the International Conference on Spoken Language Processing (ICSLP), 2002.

 

 [Zhao et al.(2000)]

L. Zhao, K. Kipper, W. Schuler, C. Vogler, N. Badler, and M. Palmer.

A machine translation system from english to american sign language.

In Proceedings of the Association for Machine Translation in the  Americas, 2000.

http://www.cis.upenn.edu/~lwzhao/papers/amta00.pdf

 

Transmition/Compression

Abramatic, J.F., Letellier, P., Nadler, M.,

A Narrow-Band Video Communication System for the Transmission of Sign Language over Ordinary Telephone Lines,

ISPDSA83(314-346).

Letellier, P., Nadler, M., Abramatic, J.F.

The Telesign Project

PIEEE(73), 1985, pp. 813-827. BibRef 8500

Mozelle, Gerard; Preteux, Francoise; Viallet, Jean-Emmanuel;

Telesign: a videophone system for sign language distant communication

 

Proc. SPIE Vol. 3457, p. 157-170, Mathematical Modeling and Estimation Techniques in Computer Vision, Francoise Preteux; Jennifer L. Davidson; Edward R. Dougherty; Eds., 1998

Sperling, G., Landy, M.S., Cohen, Y., and Pavel, M.,

Intelligible Encoding of ASL Image Sequences at Extremely Low Information Rates,

CVGIP(31), No. 3, September 1985, pp. 335-391. BibRef 8509 And: Correction: CVGIP(35), No. 2, August 1986, pp. 274. Application, Sign Language. NYU, The various methods get to 80% intelligibility at 9600Baud.

 

Woelders W., Frowein H.W., Nielsen J., Questa P. and Sandini G.

New developments in low-bit rate videotelephony for people who are deaf

Journal of Speech, Language, and Hearing Research, Vol: 40 Iss: 6 p. 1425-33, American Speech-Language-Hearing Assoc., Dec. 1997

 

Higashino S. and Kasahara H.

A hierarchical browsing system for sign language video

Design of Computing Systems: Cognitive Considerations. Proceedings of the Seventh International Conference on Human-Computer Interaction (HCI International '97), Elsevier Amsterdam, Netherlands, 1997.

 

Zaharia, Titus; Preda, Marius; Preteux, Francoise

Sign language indexation within the MPEG-7 framework

Proc. SPIE Vol. 3816, p. 214-228, Mathematical Modeling, Bayesian Estimation, and Inverse Problems, Francoise Preteux; Ali Mohammad-Djafari; Edward R. Dougherty; Eds., 1999

 

Pook, Polly K.; Ballard, Dana H.

Sign language for telemanipulation

Proc. SPIE Vol. 2351, p. 13-24, Telemanipulator and Telepresence Technologies, Hari Das; Ed., 1995

 

Huang, Chung-Lin; Wu, C. H.;

Encoding of sign language image sequences at very low rate

Proc. SPIE Vol. 1360, p. 1140-1150, Visual Communications and Image Processing '90: Fifth in a Series, Murat Kunt; Ed., 1990

 

Tomohiro Kuroda, Kosuke Sato, Kunihiro Chihara

S-TEL: An Avatar Based Sign Language Telecommunication System

International Journal of Virtual Reality, Vol.3, No.4, pp.21-27, (1998)

 

S.Malassiotis and M.G.Strintzis

Motion Estimation Based on Spatio-Temporal Warping for Very Low Bit-Rate Coding

IEEE Trans. on Communications, Vol.45, No.10, pp.1172-1176, October 1997.

 

Richard P. Schumeyer, Kenneth E. Barner

Gesture Color-Based Content Coding with Applications to Sign Language Video Communications (1997)

Synthesis

J. Loomis and H. Poizner and U. Bellugi and A. Blakemore and J. Hollerbach

Computer graphic modeling of American sign language

Computer Graphics, 17(3), pp. 105-114, July 1983.          

 

 

Rosalee Wolfe

An interface for transcribing American sign language

SIGGRAPH 99. Proceedings of the 1999 SIGGRAPH annual conference: Conference abstracts and applications, Computer Graphics, pp. 229-229, ACM Press, 1999.     

Eric Sedgwick and Karen Alkoby and Mary Jo Davidson and Roymieco Carter and Juliet Christopher and Brock Craft and Jacob Furst and Damien Hinkle and Brian Konie andGlenn Lancaster and Steve Luecking and Ashley Morris and John McDonald and Noriko Tomuro and Jorge Toro and Rosalee Wolfe

Toward the Effective Animation of American Sign Language

WSCG 2001 Conference Proceedings, 2001.

Jorge Toro, Jacob Furst, Karen Alkoby, Roymieco Carter, Juliet Christopher, Brock Craft, Mary Jo Davidson, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Eric Sedgwick, Noriko Tomuro, Rosalee Wolfe.

A Graphical Environment for Transcription of American Sign Language.

Proceedings of the ISCA 16th International Conference on Computers and Their Applications (CATA-2001). To appear.

Mary Jo Davidson, Karen Alkoby, Eric Sedgwick, Roymieco Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.
Improved Hand Animation for American Sign Language.

“Technology and Persons with Disabilities” Conference 2001. California State University at Northridge, Los Angeles, CA. March 19-24, 2001.

Eric Sedgwick, Karen Alkoby, Mary Jo Davidson, Roymieco Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.

Toward the effective animation of American Sign Language.

Proceedings of the 9th International Conference in Central Europe on Computer Graphics, Visualization and Interactive Digital Media. To appear.

Brock Craft, Damien Hinkle, Eric Sedgwick, Karen Alkoby, Mary Jo Davidson, Roymieco Carter, Juliet Christopher, Jacob Furst, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.

An Approach To Modeling Facial Expressions Used In American Sign Language.

Presented At The 2000 DePaul CTI Research Conference, Chicago, IL November 4, 2000.

Mary Jo Davidson, Karen Alkoby, Eric Sedgwick, Andre Berthiaume, Roymieco Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.

Usability Testing of Computer Animation of Fingerspelling for American Sign Language.

Presented at the 2000 DePaul CTI Research Conference, Chicago, IL, November 4, 2000.

John McDonald, Jorge Toro, Karen Alkoby, Andre Berthiaume, Pattaraporn Chomwong, Juliet Christopher, Mary Jo Davidson, Jacob Furst, Brian Konie, Glenn Lancaster, Steven Lytinen, Lopa Roychoudhuri, Eric Sedgwick, Noriko Tomuro, Rosalee Wolfe.

An Improved Articulated Model of the Human Hand.

Proceedings of the 8th International Conference in Central Europe on Computer Graphics, Visualization and Interactive Digital Media. 2000. 306 - 313.

Karen Alkoby and Eric Sedgwick,

Using a Computer to Fingerspell.

DeafExpo 99, San Diego, CA, November 19-22, 1999.

Mary Jo Davidson, Karen Alkoby, Eric Sedgwick, Roymieco Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.

Improved Hand Animation for American Sign Language.

“Technology and Persons with Disabilities” Conference 2001. California State University at Northridge, Los Angeles, CA. March 19-24, 2001.

Eric Sedgwick, Karen Alkoby, Mary Jo Davidson, Roymieco Carter, Juliet Christopher, Brock Craft, Jacob Furst, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.

Toward the effective animation of American Sign Language.

Proceedings of the 9th International Conference in Central Europe on Computer Graphics, Visualization and Interactive Digital Media. To appear.

Brock Craft, Damien Hinkle, Eric Sedgwick, Karen Alkoby, Mary Jo Davidson, Roymieco Carter, Juliet Christopher, Jacob Furst, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Noriko Tomuro, Jorge Toro, Rosalee Wolfe.

An Approach To Modeling Facial Expressions Used In American Sign Language.

Presented At The 2000 DePaul CTI Research Conference, Chicago, IL November 4, 2000.

Jorge Toro, Jacob Furst, Karen Alkoby, Roymieco Carter, Juliet Christopher, Brock Craft, Mary Jo Davidson, Damien Hinkle, Brian Konie, Glenn Lancaster, Steve Luecking, Ashley Morris, John McDonald, Eric Sedgwick, Noriko Tomuro, Rosalee Wolfe.

A Graphical Environment for Transcription of American Sign Language.

Proceedings of the ISCA 16th International Conference on Computers and Their Applications (CATA-2001). To appear

Angus B. Grieve-Smith

SignSynth: A Sign Language Synthesis Application Using Web3D and Perl

Gesture Workshop 2001

Sign synthesis (also known as text-to-sign) has recently seen a large increase in the number of projects under development. Many of these focus on translation from spoken languages, but other applications include dictionaries and language learning. I will discuss the architecture of typical sign synthesis applications and mention some of the applications and prototypes currently available. I will focus on SignSynth, a CGI-based articulatory sign synthesis prototype I am developing at the University of New Mexico. SignSynth takes as its input a sign language text in ASCII-Stokoe notation (chosen as a simple starting point) and converts it to an internal feature tree. This underlying linguistic representation is then converted into a three-dimensional animation sequence in Virtual Reality Modeling Language (VRML or Web3D), which is automatically rendered by a Web3D browser.

Richard Kennaway

Synthetic Animation of Deaf Signing Gestures

Gesture Workshop 2001

We describe a method for automatically synthesizing deaf signing animations from a high-level description of signs in terms of the HamNoSys transcription system. Lifelike movement is achieved by combining a simple control model of hand movement with inverse kinematic calculations for placement of the arms. The realism can be further enhanced by mixing the synthesized animation with motion capture data for the spine and neck, to add natural "ambient motion".

 

Conway, Alan and Veale, Tony

A linguistic approach to sign language synthesis

People and Computers (1994), 211-222 p.

 

Franc Solina, Slavko Krapez, Vito Komac, Ales Jaklic

Multimedia Dictionary and Synthesis of Sign Language

Krapez, S. and Solina, F. (1999).

Synthesis of the sign language of the deaf from the sign video clips

Electrotechnical Review, 66(4--5):260--265

 

Kawano, S. and Kurokawa, Takao

The effects of facial expression on understanding Japanese Sign Language animation.

Bullinger, H.-J. / Ziegler, Jürgen (eds): Human-Computer Interaction: Ergonomics and User Interfaces. Proceedings of HCI International '99 (8th International Conference on Human-Computer Interaction) 2 Vols. Mahwah, NJ : Erlbaum (1999) - pp. 783-787

Kurokawa, Takao

Gesture coding and a gesture dictionary for a nonverbal interface.

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E75-A:2 E75-A: 2 (1992) - pp. 112-121

Kurokawa, Takao and Ono, Makoto and Kamiya, Hiroyuki

Coding of hand shapes and reduction of their semantic redundancy fundamental studies on noonverbal communication between man and machinery.

Proceedings of the 4th Symposion on Human Interface. (Hyumanu-Intafesu-Shinpojiumu-Ronbunshu; 4). (1988) - pp. 167-172

Ohashi, Kazuaki and Shibata, Akihiro and Kurokawa, Takao

3-D reconstruction of human body pose from a single view for the purpose of gesture interpretation.

Proceedings of the 2nd Symposion on Human Interface. (Hyumanu-Intafesu-Shinpojiumu-Ronbunshu; 2). (1986) - pp. 345-350

Kato, Yushi et al

A present overview and future problems on sign language engineering.

Human interface news and report 12 (1997) - pp. 37-44

Kawano, S. and Senba, K. and Kurokawa, Takao

Introduction of facial movement into sign language animation and its effects.

Human interface news and report 12 (1997) - pp. 357-362

Kurokawa, Takao and Senba, K. and Kawano, S.

Synthesis of facial movement images for sign language animation and their effects.

Advances in human factors / ergonomics 21 (1997) - pp. 5-8

Kurokawa, Takao and Takahashi, Y.

An analysis of blinking, gazing, nodding and head movement of an interpreter in signed TV news and rules of their occurrence.

Research Report of Cooperative Research Center, Kyoto Institute of Technology 6 (1997) - pp. 9-16

Kurokawa, Takao and Senba, K. and Kawano, S.

Synthesis of facial movement images for sign language animation and their effects.

Research Report of Cooperative Research Center, Kyoto Institute of Technology 6 (1997) - pp. 17-22

 

Hong, Mun-Ho; Choi, Chang-Seok; Kim, Chang-Seok; Jeon, Joon-Hyeon;

Synthesis of image sequences for Korean sign language using 3D shape model

Proc. SPIE Vol. 2513, p. 578-589, High-Speed Photography and Photonics: 21st International Congress, Ung Kim; Ed., 1995

 

H. Sakato, S. Lu, S. Igi

Development and Experiment of Generating JSL Animation System Based on Motion-Primitives

IPSJ SIG Notes, 98-HI-77, pp.81-86, 1998.

H. Sakato, S. Lu, S. Igi

Japanese Sign-Language Animation based on Motion-Primitives

Proc. of the Thirteenth Symposium on Human Interface, pp.243-248, Oct., 1997.

H. Sakato, S. Lu, and S. Igi

A Study of Generating JSL Annimation Based on Motion-Primitives

Proc. of 1997 EICE General Conference, No.1, pp.3271997

S Lu, K. Imagawa, S. Igi

A study to Motion Control of Sign Language Animation Character

Proc. of the 12th Symposium on Human Interface, pp.79-86, Oct.22-25, 1996.

 

Notation/transcription

Grobel, Kirsti and Hienz, Hermann

The video input system

Soede, M. (ed): SignPS - a System for Sign Writing, Final Report of the EU TIDE 1202 Project. Hoensbroek (The Netherlands) (1997), pp. 7/61- 7/67

Hienz, Hermann (editor)

Sign writing using a video-based input system

Kraiss, Karl-Friedrich (ed): Bi-Annual Report - Department of Technical Computer Science 1997/1998. Aachen : Shaker (1999), pp. 30-33

 

Hutton, George

The practicability and avantages of writing and printing natural signs

American Annals of the Deaf and Dumb, Volume 14 (1869), 157-182 p.

 

Mandel, Mark A.

Ascii-Stokoe notation: A computer-writeable transliteration system for Stokoe notation of American Sign Language

Unpubl. Manuscript, 993 - 12 p.

 

Dubuisson, Colette and Leclerc, Sylvie and DeMaisonneuve, Serge

Les graphes conceptuels : un outil de représentation des langues signées, ACL 95

Unpubl. Manuscript,Montréal 1995

 

Prillwitz, Siegmund and Leven, Regina  and Zienert, Heiko and Hanke, Thomas and Henning, Jan

HamNoSys. Version 2.0; Hamburg Notation System for Sign Languages. An introductory guide

International Studies on Sign Language and Communication of the Deaf; 5, Hamburg : Signum 1989 - 46 p. ISBN: 3-927731-01-3

Hanke, Thomas and Prillwitz, Siegmund

syncWRITER: Integrating video into the transcription and analysis of sign language

Bos, Heleen F. / Schermer, Gertrude M. (eds): Sign Language Research 1994: Proceedings of the Fourth European Congress on Sign Language Research, Munich, September 1-3, 1994. (International Studies on Sign Language and Communication of the Deaf; 29) Hamburg : Signum (1995), pp. 303-312

 

Ikehara, Wako and Kamikubo, Emiko and Hiki, Shizuo

A new descriptive system for hand shapes used in signing: A proposal based on anatomical structure modeling of finger actions

Unpubl. Manuscript,1995 - 12 p.

 

Miller, Christopher and Radutzky, Elena

Toward a common notation standard for sign language phonology. Paper presented at the 2nd Intersign Workshop Leiden, December

Manuscript, 1998

 

Lee, Jintae

Notational representation of sign language: A structural description of hand configurations

Zagler, Wolfgang L. / Busby, Geoffrey / Wagner, Roland R. (eds): Computers for handicapped persons. 4th International Conference, ICCHP '94 Vienna, Austria, September 14-16, 1994 Proceedings. Berlin, New York : Springer (1994), pp. 38-45

Lecture Notes in Computer Science, Vol. 860, p. 38-??, 1994

 

Brown, Maxine D. and Smoliar, Stephen W.

A graphics editor for Labanotation

Computer Graphics 10:2 (1976), 60-65 p.

 

McEntee, Lisa

Coding and transcription for BSL acquisition

Manuscript, Bristol : Centre for Deaf Studies 1996 - 43 p.

 

Massoud, LindaLee

SignGlyphics: How to sketch sign pictures

Flint, MI : SignQuest Publishers 1993 - 16 p., ISBN: 1-878819-31-3

 

Antônio Carlos da Rocha Costa, Graçaliz Pereira Dimuro

SignWriting-based Sign Language Processing

Gesture Workshop 2001

This paper proposes an approach to the computer processing of deaf sign languages that uses SignWriting as the writing system for deaf sign languages, and SWML (SignWriting Markup Language) as its computer encoding. Every kind of language and document processing (storage and retrieval, analysis and generation, translation, spell-checking, search, animation, dictionary automation, etc.) can be applied to sign language texts and phrases when they are written in SignWriting and encoded in SWML. This opens the whole area of deaf sign languages to the methods and techniques of text-oriented computational linguistics.

http://www.techfak.uni-bielefeld.de/ags/wbski/gw2001book/draftpapers/gw38.pdf (draft)

Education

Kathleen F. McCoy Lisa N. Masterman

A Tutor for Teaching English as a Second Language for Deaf Users of American Sign Language (1997)

 

Lisa N. Michaud and Kathleen F. McCoy and Christopher A. Pennington

An Intelligent Tutoring System for Deaf Learners of Written English

Fourth Annual ACM Conference on Assistive Technologies, pp. 92-100, ACM, 2000.

 

Fernando Alonso, Angélica de Antonio, José L. Fuertes, César Montes

Teaching Communication Skills to Hearing-Impaired Children

 

Sutherland, A. and Padden, Tessa

Videoconferencing for deaf people: a case study of on-line education for deaf people

Deafness and education international 1:2 (1999), 114-120 p.

 

Linguistics

Stokoe, William C.

Sign language structure: An outline of the visual communication systems of the American deaf

Studies in Linguistics. Occasional Paper; 8, Buffalo, NY : Univ. of Buffalo 1960 - 78 p.

 

Stokoe, William C.

Sign language structure: An outline of the visual communication systems of the American deaf

Edition: 2. ed, Silver Spring, Md. : Linstok Pr. 1993 - 94 p., ISBN: 0-932130-03-8

 

Devices

Ascension Technology Corporation - Motion Sensors

 

Corpora

Waleed’s data

 

Consisting of 95 different AUSLAN signs, with 27 samples per sign, for a total of 2565 signs, signed by a native signer volunteer.

Collected using :

  • Two Fifth Dimension Technologies (5DT) gloves, one right and one left.
  • Two Ascension Flock-of-Birds magnetic position trackers, one attached to each hand.

 

 

Purdue ASL database

 

This database consists of several videos of a set of ASL (American Sign Language) signs. The database is divided in three main parts: i) a set of videos of signs with distinct motion patterns, ii) a set of words with distinct handshapes, and iii) several ASL sentences to study prosody and sentence structure.

Fourteen native signers participated in the acquisition of the data. This data, for ten of the participants, will be made public available in Spring 2002. Due to its large size, the database will only be available on a set of 30 DVD. The DVDs will be made available at production cost