eISSN:2278-5299

International Journal of Latest Research in Science and Technology

DOI:10.29111/ijlrst   ISRA Impact Factor:3.35

A News Letter Sign UP!
COMPARATIVE STUDIES FOR SPEECH ANALYSIS BASED ON MULTIRESOLUTION SPECTROGRAMS

Research Paper Open Access

International Journal of Latest Research in Science and Technology Vol.3 Issue 2, pp 50-54,Year 2014

COMPARATIVE STUDIES FOR SPEECH ANALYSIS BASED ON MULTIRESOLUTION SPECTROGRAMS

Nefissa Annabi-Elkadri

Correspondence should be addressed to :

Received : 22 April 2014; Accepted : 25 April 2014 ; Published : 30 April 2014

Share
Download 125
View 177
Article No. 10276
Abstract

This paper presents an evaluation study of the Multiresolution Spectral Analysis (MRS) method which provides a higher temporal accuracy in the upper spectral region and a better frequency resolution in the lower spectral range. We showcase the importance of this tool by attempting an automatic transition zone detection and an automatic silence/sonorant/non-sonorant classification. Our approach is compared to existing methods based on the MRS and classical spectral analysis by the means of our Visual Assistance of Speech Processing (VASP) System and two corpora. Our approach appears to yield better results in the two tasks in question than the other methods.

Key Words   
Multiresolution Spectrogram, transition zones detection, Silence/Sonorant/Non-Sonorant detection,Com
Copyright
References
  1. Annabi-Elkadri and A. Hamouda. Spectral analysis of vowels /a/ and /E/ in tunisian context. International Conference on Audio, Language and Image Processing, Novembre 2010.
  2. Annabi-Elkadri, Automatic Detection of Transition Zones in Tunisian Dialect, International Journal of Advanced Science and Technology, Vol. 60, p. 67-82, November, 2013
  3. Annabi-Elkadri and  A.  Hamouda.   Automatic  Silence/Sonorant/Non-Sonorant  Detection based  on  Multiresolution  Spectral  Analysis  and  ANOVA  Method. International  Workshop  on Future Communication and Networking, Szczecin, Poland, 2011. IEEE.
  4. Boite, H. Bourlard, T. Dutoit, J. Hancq, and H. Leich. Traitement de la parole. Presses Polytechniques et Universitaires Romandes, 2000.
  5. La parole  et  son  traitement  automatique. collection  technique  et  scientifique  des télécommunications, MASSON et CENT-ENST, Paris, 1989.
  6. Cancela, M. Rocamora, and E. Lopez. An e  cient multi-resolution spectral transform for music analysis. 10th International Society for Music Information Retrieval Conference, pages 309–314, 2009.
  7. P. Chan, Y.W. Wong, Tan. Lee, and P.C. Ching. Two-dimensional multi-resolution analysis of speech signals and its application to speech recognition. International Conference on Acoustics, Speech, and Signal Processing, volume 1, pages 405–408. IEEE, Mars 1999.
  8. Chauvin and R. Palluel-Germain. Les principes de l'Anova . Journées RJCP, 2011.
  9. Cheung and J.S. Lim. Combined multi-resolution (wideband/narrowband) spectrogram. International Conference on Acoustics, Speech, and Signal Processing, pages 457–460. IEEE, 1991.
  10. Chi  and   C.  Hsu.  Multiband   analysis  and   synthesis   of  spectro-temporal   modulations  of fourier spectrogram. Journal of the Acoustical Society of America, 129(5):EL190–EL196, May 2011.
  11. Cnockaert. Analysis of vocal tremor and application to parkinsonian speakers / Analyse du tremblement vocal et application à des locuteurs parkinsoniens. PhD thesis, F512 - Faculté des sciences appliquées - Electronique, 2008.
  12. Data. How to read (and use) a box-and-whisker plot, 2008.
  13. Dressler. Sinusoidal extraction using an e  cient implementation of a multi-resolution FFT. Proceeding of the 9th International Conference on Digital Audio E  ects, pages 247–252, September 2006.
  14. Fu and E. A. Wan. A novel speech enhancement system based on wavelet denoising. Center of Spoken Language Understanding, OGI School of Science and Engineering at OHSU, 2003.
  15. Grossmann and J. Morlet. Decomposition of hardy functions into square integrable wavelets of consonant shape. SIAM Journal on Mathematical Analysis, 15(4):723–736, 1984.
  16. P. Haton and al. Reconnaissance automatique de la parole. DUNOD, 2006.
  17. Karypidis. Asymétries  en  perception  et  traitement  de  bas  niveau:   traces  auditives, mémoire   à   court   terme   et   représentations   mentales.   PhD   thesis,   Université   Paris3-Sorbonne Nouvelle, Paris, France, 2010.
  18. Ladefoged. Elements of Acoustic Phonetics. University of Chicago Press, 1996.
  19. Leman and C. Marque. Un algorithme rapide d’extraction d’arêtes dans le scalogramme et son utilisation dans la recherche de zones stationnaires. Traitement du Signal, 15(6):577–581, 1998.
  20. Mallat.  A  theory   for   multiresolution   signal   decomposition:   the   wavelet   representation. IEEE Transaction on Pattern Analysis and Machine Intelligence, 11:674–693, 1989.
  21. Mallat. Une Exploration des Signaux en Ondelettes. Editions de l’Ecole Polytechnique, Ellipses di usion, 2000. ff
  22. Mallat. A wavelet Tour of Signal Processing. Academic Press, 3rd edition edition, 2008.
  23. Manikandan. Speech enhancement based on wavelet denoising. Academic Open Internet Journal, 17, 2006.
  24. R R. Mergu and S. K. Dixit. Multi-resolution speech spectrogram. International Journal of Computer Applications, 15(4):28–32, February 2011.
  25. Steve Simon. What is the interquartile range?, 2008.
  26. Audiocite (2011). ‘Belgian French Corpus’.
  27. Annabi-Elkadri and A. Hamouda, The Multiresolution Spectral Analysis for Automatic Detection of Transition Zones, International Journal of Advanced Science and Technology, Vol. 36, p. 95-110, November, 2011
  28. Shin, et al. (1997). ‘Visual imagery and perception in posttraumatic stress disorder : A positron emission tomographic investigation’. Archives of General Psychiatry 54 :233–241.
  29. Annabi-Elkadri. Spectre à Multirésolution dans l'Analyse et le Traitement de la Parole.   PhD   thesis,   Université   Tunis-ElManar, Faculté des Sciences de Tunis, Tunis, Tunisie, 2014.
To cite this article

Nefissa Annabi-Elkadri , " Comparative Studies For Speech Analysis Based On Multiresolution Spectrograms ", International Journal of Latest Research in Science and Technology . Vol. 3, Issue 2, pp 50-54 , 2014


Responsive image

MNK Publication was founded in 2012 to upholder revolutionary ideas that would advance the research and practice of business and management. Today, we comply with to advance fresh thinking in latest scientific fields where we think we can make a real difference and growth now also including medical and social care, education,management and engineering.

Responsive image

We offers several opportunities for partnership and tie-up with individual, corporate and organizational level. We are working on the open access platform. Editors, authors, readers, librarians and conference organizer can work together. We are giving open opportunities to all. Our team is always willing to work and collaborate to promote open access publication.

Responsive image

Our Journals provide one of the strongest International open access platform for research communities. Our conference proceeding services provide conference organizers a privileged platform for publishing extended conference papers as journal publications. It is deliberated to disseminate scientific research and to establish long term International collaborations and partnerships with academic communities and conference organizers.