CFP last date
20 December 2024
Reseach Article

TRANSLINGUATOR: Web based application for Speech Translation of Human Voices based on Voice Forensics without Changing the Source Voice

Published on May 2012 by B. Naveen Raj, T. Pearson
National Conference on Advances in Computer Science and Applications (NCACSA 2012)
Foundation of Computer Science USA
NCACSA - Number 4
May 2012
Authors: B. Naveen Raj, T. Pearson
7c4bd93e-16a3-4560-9d5d-5c69f2dc4d75

B. Naveen Raj, T. Pearson . TRANSLINGUATOR: Web based application for Speech Translation of Human Voices based on Voice Forensics without Changing the Source Voice. National Conference on Advances in Computer Science and Applications (NCACSA 2012). NCACSA, 4 (May 2012), 5-8.

@article{
author = { B. Naveen Raj, T. Pearson },
title = { TRANSLINGUATOR: Web based application for Speech Translation of Human Voices based on Voice Forensics without Changing the Source Voice },
journal = { National Conference on Advances in Computer Science and Applications (NCACSA 2012) },
issue_date = { May 2012 },
volume = { NCACSA },
number = { 4 },
month = { May },
year = { 2012 },
issn = 0975-8887,
pages = { 5-8 },
numpages = 4,
url = { /proceedings/ncacsa/number4/6498-1023/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Conference on Advances in Computer Science and Applications (NCACSA 2012)
%A B. Naveen Raj
%A T. Pearson
%T TRANSLINGUATOR: Web based application for Speech Translation of Human Voices based on Voice Forensics without Changing the Source Voice
%J National Conference on Advances in Computer Science and Applications (NCACSA 2012)
%@ 0975-8887
%V NCACSA
%N 4
%P 5-8
%D 2012
%I International Journal of Computer Applications
Abstract

Translinguator is a web based application that can be employed with the help of cloud computing technology. The application can be integrated with devices such as Mobile phones, tablets etc, or it can be designed with dedicated hardware as an independent device. It mainly involves the integration of various existing concepts in a specific sequence to obtain the unique desired output. This application is targeted in receiving the voice input of a speaker in a foreign language and translating it to the language known by the listener. The uniqueness of this application is that it can produce the translated output in a language which is known and desired by the listener (the person who is using the device) and that too in the same voice as the speaker using voice forensic analysis of the input speech. It employs the very common algorithms used in Speech recognition Engines (SRE), Text To Speech converter (TTS), Voice comparator & morpher, etc. The device/application can benefit a wide range of users such as students, travellers, PRO's etc. It is very economical as well as affordable to employ both as a dedicated device or an embedded web-based application.

References
  1. Rajkumar Buyya, Chee Shin Yeo, and Srikumar Venugopal "Market-Oriented Cloud Computing: Vision, Hype, and Reality for Delivering IT Services as Computing Utilities" The 10th IEEE International Conference on High Performance Computing and Communications, pp. 10 Table 1
  2. Zahorian,S. A. ; Livingston,D. ; Dept. of Electr. & Comput. Eng. , Old Dominion Univ. , Norfolk, VA, "Neural networks for feature computations in automatic speech recognition" in Neural Networks, 1992, IJCNN. , International Joint Conference.
  3. Richard Corbett, Francis Jacobs, Michael Shackleton in "The European Parliament" 7th edition, page 39 -41.
  4. Dupont,S. ; Cheboub,L; TCTS-MULTITEL, Faculte Polytech. de Mons, "Fast speaker adaptation of artificial neural networks for automatic speech recognition" in Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference.
  5. Bodenhausen,U. ; Hild,H. ; Dept. of Comput. Sci. , Karlsruhe Univ. , "Automatic construction of neural networks for special purpose speech recognition systems" in Acoustics, Speech, and Signal Processing, 1995. ICASSP-95.
  6. Jacob Benesty, Jingdong Chen, Yiteng Huang in Microphone array signal processing, 2008 edition.
  7. Kusumoputro, B. ; Triyanto, A. ; Fanany, M. I. ; Jatmiko, W. ; Fac. of Comput. Sci. , Univ. of Indonesia, "Speaker identification in noisy environment using bispectrum analysis and probabilistic neural network" in Computational Intelligence and Multimedia Applications, 2001. ICCIMA 2001.
  8. David Ferrucci, Eric Brown, Jennifer Chu-Carroll, James Fan, David Gondek, Aditya A. Kalyanpur, Adam Lally, J. William Murdock, Eric Nyberg, John Prager, Nico Schlaefer and Chris Welty in Learning, Knowledge, Speech recognition, Text recognition, Lexical formulation, Translation (2010)
  9. Kung-Pu Li; ITT Aerosp. Commun. Div. , San Diego, CA, "Automatic language identification using syllabic spectral features" in ICASSP-94.
  10. Google translator http://translate. google. com/support/
  11. Wenjie Li, Diego Mollá-Aliod, "Computer Processing of Oriental Languages: Language Technology for the Knowledge-based Economy" in ICCPOL '09, Hong Kong.
  12. Farris,D. ; White,C. ; Khudanpur,S. ; Center for Language & Speech Process. , Johns Hopkins Univ. , Baltimore, MD, "Sample selection for automatic language identification" in ICASSP 2008.
  13. Tian, J. ; Suontausta, J. ; Speech & Audio Syst. Lab. , Nokia Res. Center, Tampere, Finland, "Scalable neural network based language identification from written text" in ICASSP '03.
  14. Acero, A. ; Speech Technol. Group, Microsoft Corp. , Redmond, WA, "An overview of text-to-speech synthesis" in Speech Coding, 2000
  15. Turk, O. ; Schroder, M. ; Sensory, Inc. , Portland, OR, USA, "Evaluation of Expressive Speech Synthesis With Voice Conversion and Copy Resynthesis Techniques" in Audio, Speech, and Language Processing, IEEE Transactions, July 2010.
  16. Hashimoto, K. ; Yamagishi, J. ; Byrne, W. ; King, S. ; Tokuda, K. ; Dept. of Comput. Sci. & Eng. , Nagoya Inst. of Technol. , Nagoya, Japan, "An analysis of machine translation and speech synthesis in speech-to-speech translation system" in ICASSP '11.
Index Terms

Computer Science
Information Sciences

Keywords

Cloud Computing Speech Synthesis Voice Forensics Tts (text-to-speech) Web-technology Sre (speech Recognition Engine)