Dept. of IT | Hindi SiteContact us | Sitemap
 

TDIL

 
 
 
 
 
 
 
 
 
 
 
 

Human Machine Interface Systems(HUMIS)
Human Machine Interface Systems
Speech Databases
Speech database for Hindi Sounds
Optical Character Recognition Systems(OCR)
Text -to- Speech Synthesis System
OCR for Printed Devanagari Text
Technologies Developed by the respective Institutes/Organizations

Human Machine Interface Systems

       These systems aim at a better Human-machine interaction which envisages the use of other modes of communication with the computer namely: speech and vision. Speech input/output and Optical Character Recognition (OCR) Systems are still at nascent stages even in developed countries. They are successful only in specialized domains and restricted environments.

Speech Databases

       These can serve as test-bed for all application developments in the area of Speech syntComprehensive databases of the different units of speech such as phonemes, words etc. for various Indian Languages need to be developed which hesis and Speech Recognition.

Speech database for Hindi Sounds

       Speech signals for 1000 selected words of Hindi have been recorded for two utterances of about 50 speakers. PC based algorithms have been designed for speech signal processing, acoustic analysis of speech sounds and phoneme labelling. An acoustic phonetic database for these words has been created. This is useful for speech synthesis and recognition. This has been developed at CEERI, Delhi.

Optical Character Recognition Systems(OCR)

       Theses systems are useful for direct transfer of  large data into the computer with the help of scanner. The hard copy is fed into a scanner, the image file is then analysed and image processing are applied for transforming into ASCII file. Efforts are underway to develop OCR for Hindi.

Text -to- Speech Synthesis System

       Alpha version of Speech synthesis system for Hindi language has been developed. The prosodic rules are under development for further improving the quality of the speech. It is being developed at CEERI, Delhi.

OCR for Printed Devanagari Text

       Various image processing algorithms have been developed for obtaining the image matrices of the characters and identifying the Devanagari characters and words for laser printed text. This development is being carried out at C-DAC, Pune.

Technologies Developed by the respective Institutes/Organizations

HindiVani – Central Electronics Engineering Research Institute, N.Delhi

       It is a PC based Unlimited Vocabulary Text-to-Speech Conversion Software for Hindi (Alpha Version) for DOS platform. The text document is generated using a Hindi Editor which supports ISCII standard. The input words are split into syllables, using a parser. An acoustic-phonetic database of all these syllables is available in the database, which is subsequently used to create words. The concatenation of syllables into words and the superimposition of quality features is done by developing rules. The quality features of speech such as intonation, stress and timing patterns are then improved. A cascade-parallel formant synthesizer is used to synthesize the speech. The major components of the system are(Block Schematic Diagram):

Word Parser
Concatenator
Parametric Database
Knowledge Base (Frequency & Durational Rules)
Voice Quality Manager
Klatt Synthesizer
D/A Converter

       The s/w can be used as an aid for the visually handicapped and for information retrieval. The system is being ported to Windows environment and prosody rules are being incorporated for better quality speech output.

Devanagari Optical Character Recognition - C-DAC Pune
 
       OCR s/w has been developed with following input design parameters:

Laser printed raster scan image file using 300 and 600 dpi resolution
DV - Natraj and DV – Surekh Fonts
Font Sizes 12, 14, 16, 18, 20, 24, 28, 32, 36, 40 & 48
Scanning at 300 & 600 dpi using HP Desk Scan - 4C Scanner, black & white, 8 bit, .bmp file format
      
       Shiro-rekha based approach has been adopted. Based on which, Algorithms have been devised for segmentation, tilt correction and Feature Extraction to generate three sets of databases namely Top, Middle and Bottom modifiers. Approximately 95% accuracy has been obtained.

Pocket translator C-DAC Pune

       Pocket Translator is a tool for the foreign tourists to communicate with the locals. It offers instant translation in both script and voice from one language to another selected language. It stores standard phrases that are commonly used by travellers at various public utility places. It looks like calculator/casio diary having a miniature keyboard and LCD display unit that will runwith pencil cell.Language phrases have been selected in categories of Automobile, Restaurant, help/emergency, shopping, home/housekeeper, greetings, telephone/postal, directions/signs, socialising, tours/leisure, travel/hotel, medical, legal, business, technical terms. It will cover major Indian Languages viz. Hindi, Tamil, Malayalam, Kannada, Telugu, Bengali, Oriya with English as the interface.The features of the Pocket Translator are:


Language Translation
Language Phrases
Personal data with security lock
Address and Phone diary
Currency convertor
World Time Clock
Consumer Information
Games

       A prototype pocket translator for Hindi with 30,000 bi-lingual dictionary and 300 phrases has been developed to establish the feasibility of technology.