Gesture Keyboard - User centered design of a unique input device for Indic Scripts???
Ashish Krishna1, Rahul Ajmera1, Sandesh Halarnkar, Prashant Pandit
HP Laboratories India
March 18, 2005*
* Internal Accession Date Only
???HCI International 2005,
1Human Factors International, 4th Floor, Chemtex House, Hiranandani Gardens, Mumbai ??? 400 076, India
Approved for External Publication
??? Copyright 2005
Gesture Keyboard - User centered design of a unique input device for Indic Scripts
Abstract
Indic text input presents a unique challenge in the field of keyboard design because of the number of standalone characters, conjuncts, matras, and symbols. It poses a huge challenge to the HCI practitioners to accommodate the humongous array of Devnagari characters in a keyboard. Solutions for input in Devnagari scripts using a physical keyboard have been around for a while but none of them have emerged as a standard mechanism due to several usability related concerns. In this paper we have described the design of a new gesture based keyboard, which has a dual input mode 1. Gesturing 2. Tapping. The ???gesture keyboard??? concept is based on partial hand writing recognition as well as
1 Keywords
Devnagari text entry, Gesture Keyboard (GKB), Matras, Modifier, Syllable, IMEs
2 Introduction
Given the
This new keyboard introduces a paradigm shift in text inputting methods. We have developed a prototype for Devnagari, which takes best of both worlds; it takes cues from handwriting recognition (matras formation) and
The Gesture Keyboard is not presented as an alternative to the conventional English keyboard but has been proposed as a peripheral input device. The user would have an additional device (GKB) connected to his workstation along with the regular ASCII English keyboard and a mouse as shown in Figure below.
Figure 1: typical computer setup with a gesture keyboard
2.1Need for a Devnagari input device
Computer usage in India is almost entirely in English and is restricted to the
2.2Difference with roman script
Using a standard keyboard for Devnagari scripts creates usability issues, as the Devnagari Script possesses some conceptual differences from the Roman script.
Some of these are summarized below:
???The concept of matras (character modifiers) has no parallels in the Roman script.
???Each character has a shiro rekha (the top bounding line on Devnagari characters) on top of it.
???Character modifiers, basically matras and other special characters, can occur before, on top, below or after the main character that they modify. (Deb and Deshwal, 2003)
???Since in the written form there is no clear specification of the sequence in which modifiers should be added to the consonant, sequential entry modes like a keyboard would not fit the users mental models
???More than one modifier can be attached to one consonant
2.3Issues with using an Inscript Keyboard
Current Indian language typing solutions have a steep learning curve. According to Arjun Mahanto, the Hindi Officer in IIT Bombay, learning to type requires approximately fifty hours of training and practice for a person to reach speeds of 25 words per minute. This is too much of a barrier for ordinary people and only professional typists are willing to make this investment. (Chand et al., 2004)
Figure 2: Inscript keyboard (with multiple characters on a single key)
Commercially the demand for Devnagari input has been patched by the Inscript keyboard. This is essentially a QWERTY keyboard with Devnagari characters mapped onto the keys. The minimum number of characters that are required even representing Devnagari is 54, base consonants (34), vowels (19), and a halant key. This excludes Devnagari numerals and various Vedic symbols and modifiers i.e. Ref, Ru, Half Ra. This issue manifests itself on this keyboard, which is essentially meant for entering just 26 alphabets. The aftereffect is that each key has multiple characters (two to three characters per key) mapped onto them. These additional symbols on the keys are inputted using combinations of toggle keys like shift, alt and control. This keyboard has a very high learning curve due to the multiple shift modes as is suited for professional typewriters who undergo rigorous training to learn this keyboard.
3 Introduction to the GKB
Computing devices demand text input schemes that can be quickly learnt and retained to achieve a fair speed and easy to use rather than ???hunt and peck???. Roman keyboards are not particularly amenable to accommodate the phonetic
Keyboards support a ten fingered typing and also have two entry modes namely expert and novice. While the novice uses ???hunting and pecking??? to type, experts do a head up
There is an analogy between keyboards and styli. Keyboards can be used with no training: the letters can be tapped out
Working on this analogy we tried to develop a
3.1Gestures
Few major criterions for designing gesture (matras) were, 1.easy to learn, 2.fast to write and 3.easy to retain matras. Instead of developing new gestures for each Devnagari matra (as in the case of graffiti and unistrokes), we implemented these matras as they appear in the Varnmala. The only difference being the breakage of multi stroke matras into a combination of different single stroke matras. Also for a half character we designed a horizontal stroke, which was derived from the mental model of cutting a character in to half.
All these aforementioned criterions for matras were satisfied as there was no learning involved; users had to write them they way they normally write on paper. Also, the retention was easy as the users had to learn few matras namely horizontal strokes and oblique scoring for getting secondary glyphs from the keypad.
3.2Spatial mapping of gestures
In order to limit the technological complexity of recognition and thereby trying to reduce instances of errors, we have mapped the gestures on the coordinates of the writing space. This means that the number of elements that needs to be recognized has been reduced. The same gestures made in different areas of the pad produce different combinations wherein the consonant is selected using the spatial mapping and the modifiers are assigned after recognition of the gestures.
3.3Layout
The Varnmala structure was useful for people to locate keys and to reduce the cognitive load. Even users who had ???forgotten??? the alphabetical order could remember the local sequence of the letters they were searching for. For example, if they were looking for the (n) key, they would mutter an entire line of consonants ???(t) (th) (d) (dh) (n)??? before locating the key. (Chand et al. 2004)
Figure 3: Varnmala layout
We tried to design the gesture keypad as cognitively easy for a novice user as possible, so that with minute training and instructions they are able to achieve a decent writing speed. We kept the layout very basic, as it appears in the Devnagari varnmala books in the block.
3.4Feedback
Performance data with keyboards, where the auditory feedback can be switched on and off suggest that typing is significantly faster and more accurate with auditory feedback on than with off. (Birdwell, Monty & Snyder, 1983; Blake, Muto & Roe, 1984) Based on these prior studies we added a click sound on every pen down. This can be again toggled on and off based on user???s preference. The timings of the feedback and the frequency of sound are still to be investigated.
In addition visual feedback has been provided so that the user can see his handwritten input on the tablet
3.5Deleting using the GKB
The Devnagari script is very differently from roman scripts because of the conjuncts it forms when Consonant-
Figure 4: Editing problems eradicated using a horizontal score on the backspace
To overcome this problem we added one more gesture along with a normal ???backspace??? which was horizontal scoring on the backspace key. It deleted the whole syllable in a go, so that user can start afresh.
3.6Technology used
The technology behind GKB can be broadly categorized into three major components:
???Graphics tablet with electronic pen (hardware): This device is used to capture handwritten gestures as digital ink
???Handwritten gesture recognition (software): This algorithm recognizes the gestures and maps them to corresponding matras
???GKB controller (software): This software component maps the pen position to corresponding base character (consonant or vowel)
The tablet is used to provide a keypad layout on which user makes gestures using the pen. The digital ink is then processed & recognized. Position of the pen and recognizer output is used to form the final syllable, which is then transmitted to the application in focus.
3.7A scalable solution for multi lingual environment
One of the biggest advantages of the solution is its ability to accommodate inputs in other scripts. The tablet does not have labeled keys instead it had a layout print stuck on the pad surface, this layout can be changed easily thus enabling flexibility of multi lingual input.
4 Finalizing keys and layout size
Prior research (e.g. Clare, 1976) tells us about the design and size of keys which are used for
We had to come up with a key size and in turn layout size, which has the following characteristics:
???The users should be able to see the key label
???The key top size should be optimized for gesturing as well as tapping using a pen
???They keys should prevent the erroneous mapping of the alphabets
???As the key size and the layout size are directly proportionate, it should minimize the hand movement on the keypad
4.1User study
4.1.1Objective
The objective of this exploration was to find the average size users take to write consonants and modifiers (matras) and to find relationship between handwriting styles with different pen grip diameter and tip thickness.
4.1.2Method
Each user was given a Hindi paragraph (76 words) and they had to copy this Hindi corpus in their own handwriting on the provided paper. They were given basic instructions explaining the task and the goal of the experiment.
Participants were provided with two different size blank papers (A4 = 21 cm x 29.8 cm and A5 = 14.8 cm x 21 cm). They were first asked to write on the A4 sized paper and then on A5 paper with 0.5mm pen and 9 mm finger grip diameter pen. Same activity was then repeated with 0.1 mm tip and 12mm finger grip diameter pen.
4.1.3Subjects
21 voluntary subjects took part in this study. All of them, 1
???7 HP labs Employee (who know Hindi and occasionally write in Hindi)
???11 Students from 10th standard, from a local high school in Bangalore (who have Hindi as second language and write Hindi regularly)
???3 Teachers from the same high
Figure 5: Participants for user study
4.1.4Apparatus
4.1.4.1 Pen sizes:
???0.5 tip and 9mm diameter grip pen
???0.1 tip and 12mm diameter pen
4.1.4.2 Paper sizes:
???A4 = 21 cms x 29.8 cms
???A5 = 11.5 cms x 29.8 cms
Figure 6: Pens used for study (middle one is the stylus)
4.1.4.3 Hindi corpus used:
The Hindi paragraph used for this user study was taken from the BBC???s Hindi news portal (http://www.bbc.co.uk/hindi/news/020419_vajpayee_ac.shtml) and was modified a bit so that we could get all the nuances of Hindi matras in the corpus.
Figure 7: Hindi Corpus
4.1.5Evaluation method
Subjects??? hands written samples were collected and analyzed by measuring the base consonants and the modifiers with help of digital calipers. Each subject???s consonants and Modifiers size were noted down and divided according to the core strip, bottom strip and top strip.
Figure 8: Character study
4.1.6Analysis
Out of the four data sets that were collected, the most appropriate pen size and paper size that resembled the Gesture pad was 0.1 tip and 12mm diameter pen and A5 sized paper. So we analyzed this dataset and came out with the recommendation for the key size. We took the final data set and divided it in three groups, namely Top, Core and
Bottom strip. Since, the dataset was very distributed, we adhered to the 90th percentile data so that almost all of the population could be accommodated.
size in mms
size in mms
Participants
6
5 4.63
4
3
2
1
0
Participants
size in mms
4
3
3
2
1
0
Participants
Figure 9: Graphs showing cluster data for handwriting samples (X axis shows the size in mm and Y axis denotes the no of participants)
4.1.7Results
The final gesturable keypad sizes, which is analogous to handwriting was 11.82 mm. The core strip was 4.63 mm, space required for top strip was 4.13 mm, and the space required for the bottom strip was 3 mm. We didn???t take the left and right side into consideration as there are only few matras i.e. (aa, ee) which are added to the left and right of the core strip in the form a straight line and could easily be accommodated in the square top keys.
Figure 10: Final layout
5 Conclusions
This keyboard has been accepted quite well with the novice users who have never used Devnagari keyboard before. In our small sessions of user studies they all seem to be satisfied and were able to achieve a speed of
The keyboard has a high satisfaction factor associated with it because writing with pen on paper (in our case the keyboard) is very intuitive, powerful and efficient. It aptly exhibits the same characteristics of simplicity, naturalness and straightforwardness like handwriting.
6 Future Directions and discussions
We are currently planning to do a constrained usability study to measure the efficiency of the gesture keyboard against the other ???standard??? Devnagari input mechanism namely Inscript keyboard with novice as well as expert users. The aim of this proposed study is to see whether the satisfaction ratings change with these different user groups and also to compare the words per minute that we are able to achieve with this keyboard.
We are also studying various layouts (Varnmala, frequency and common
7 Acknowledgements
The authors would like to thank Dr. Shekhar Borgaonkar, Dr. Sriganesh Madhvanath, and Dr. Girish Prabhu for their valuable inputs and support during the project.
8 References
Chand Aditya, Ganu Ashish, Joshi Anirudha, Mathur Gaurav & Parmar Vikram (2004). ???Keylekh: A Keyboard for Text Entry in Indic Scripts???, CHI 2004, April
Clare C.R. (1976). ???Human Factors: a most important ingredient in keyboard designs???, Handbook of Human computer Interaction 2nd edition, page no pp
Goldberg David & Richardson Cate (1993).
Deb Dr. Kalyanmoy & Deshwal Priyendra Singh (2003). ???An Optimal
Joshi Anirudha & Rathod Amit (2002). ???A Dynamic Text Input scheme for phonetic scripts like Devanagari??? Development by Design, Bangalore 2002
Monty, Snyder and Birdwell, (1983) & Roe, Muto and Blake, (1984). Handbook of Human computer Interaction 2nd edition, page no pp