Hossein Sameti.jpg

 

Hossein Sameti
Associate Professor

Department of Computer Engineering
Sharif University of Technology

  Tel

+98 21 66166637

  Fax

+98 21 66019246

  Mailing address

Room 706, Department of Computer Engineering, Sharif University of Technology, Azadi Avenue, Tehran 14588, Iran

  E-Mail

sameti [at] sharif [dot] edu

 

About Our Group

Education

Research Interests

 Publications

Courses

Students

Speech Processing Lab

AI GROUP ANNOUNCEMENTS

 

 :: About Our Group

 

 

Director of the speech processing research group, Sharif University of Technology

This research group started to work in 1999 and it has about 32 researchers presently. Our major projects are as follows:

1)    A Persian (Farsi) large vocabulary continuous speech recognition engine which can be utilized in different speech recognition products. The project includes research in different areas of automatic speech recognition and incorporation of different known methods for the Persian language. The system is a large vocabulary speaker-independent continuous speech recognition (LVCSR) system based on hidden Markov models (HMM). The vocabulary of the ASR system consists of more than 65,000 most common words of the Persian language.

2)    Spoken dialogue systems for the Persian language. We are developing the first speech enabled IVR system and dialogue systems in Persian. The system is developed for banking and ticketing applications and is launched for evaluation.

3)    Persian text to speech engine. This engine provides the most natural speech in Persian and includes very sophisticated text to phoneme transformer. It can handle complexities of Persian text such as kasre-ezafe and numerous homographs.

4)    Speaker identification over phone line.

5)    Keyword spotting

For more information about the group visit our lab website spl.ce.sharif.edu. Also the company web site, www.asr-gooyesh.com contains more detailed information about our speech products.

 

 

 

Up

 :: Education

 

 

 

 

 

Up

 :: Research Interests

 

 

  • Speech Processing
  • Spoken Dialogue Systems
  • Natural Language Understanding
  • Automatic Speech Recognition
  • Natural Language Modeling
  • Hidden Markov Modeling of Speech
  • Speech Enhancement

 :: Publications

 

 

Journal Papers

 

[1] H. Veisi and H. Sameti, "The integration of principal component analysis and cepstral mean subtraction in parallel model combination for robust speech recognition," Digital Signal Processing, Elsevier, vol. In Press, 2010.

[2] N. Najkar, F. Razzazi, and H. Sameti, "A novel approach to HMM-based speech recognition systems using particle swarm optimization," Mathematical and Computer Modelling, Elsevier, vol. 52, pp. 1910 -1920, 2010.

[3] M. Habibi, H. Sameti, and H. Setareh, "On-Line Learning of a Persian Spoken Dialogue System Using Real Training Data," Journal of Advances in Computer Research, vol. 1, pp. 31 - 39, Autumn 2009 2010.

[4] M. Bahrani and H. Sameti, "A New Bigram-PLSA Language Model for Speech Recognition," EURASIP Journal on Advances in Signal Processing, Hindawi, vol. 2010, pp. 1-8, 2010.

[5] B. BabaAli, H. Sameti, and T. H. Falk, "A model distance maximizing framework for speech recognizer-based speech enhancement," AEU - International Journal of Electronics and Communications, Elsevier, vol. In Press, 2010.

[6] B. Babaali, H. Sameti, and M. Safayani, "Likelihood Maximizing Based Multi-Band Spectral Subtraction for Robust Speech Recognition," EURASIP Journal on Advances in Signal Processing, Hindawi, vol. 2009, pp. 1-15, 2009.

[7] S. Shirali-Shahreza, H. Sameti, and M. Shirali-Shahreza, "Parental control based on speaker class verification," IEEE Transactions on Consumer Electronics, vol. 54, pp. 1244 -1251, 2008.

[8] M. Bahrani and H. Sameti, "Extraction and Modeling Context-Dependent Phone Units for Improvement of Continuous Speech Recognition Accuracy by Phoneme Clustering," Iranian Journal of Electrical and Computer Engineering, vol. 3, pp. 45-51 (In Persian), June 2004.

[9] H. Sameti and L. Deng, "Nonstationary-state hidden Markov model representation of speech signals for speech enhancement," Signal Processing, Elsevier, vol. 82, pp. 205-227, 2002.

[10] H. Sameti, H. Sheikhzadeh, L. Deng, and R. L. Brennan, "HMM-Based Strategies for Enhancement of Speech Signals Embedded in Nonstationary Noise," IEEE Transactions on Speech and Audio Processing vol. 6, pp. 445-455, 1998.

[11] L. Deng and H. Sameti, "Transitional speech units and their representation by regressive Markov states: applications to speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 4, pp. 301-306, 1996.

[12] L. Deng and H. Sameti, "Articulatory phonology and speech recognition: A case study involving use of dynamically defined speech primitives," The Journal of the Acoustical Society of America, vol. 95, p. 2870, June 1994 1994.

 

Conference Papers:

 

[13] M. Habibi, S. Rahbar, and H. Sameti, "Divided POMDP Method for Complex Menu Problems in Spoken Dialogue Systems," in IEEE Workshop on Spoken Language Technology, SLT 2010, Berkley, California, USA, 2010.

[14] S. Maryooriad, H. Sameti, and H. Veisi, "An Algebraic Gain Estimation Method to Improve the Performance of HMM-Based Speech Enhancement Systems," in 18th Iranian Conference on Electrical Engineering, Isfahan, 2010.

[15] M. Habibi, H. Sameti, and H. Setareh, "On-Line Learning of a Persian Spoken Dialogue System Using Real Training Data," in ISSPA2010 International Conference on Information Sciences, Signal Processing and their Applications, Kuala Lumpur, Malaysia, 2010, pp. 133-136.

[16] M. H. Bokaei, H. Sameti, M. Bahrani, and B. Babaali, "Segmental HMM-Based part-of-speech tagger," in 2010 International Conference on Audio, Language and Image Processing, Shanghai, China, 2010.

[17] H. Veisi and H. sameti, "An Improved Parallel Model Combination Method for Noisy Speech Recognition," in The eleventh biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, 2009, pp. 237-242.

[18] S. Shirali-Shahreza, H. abolhassani, H. Sameti, and M. Shirali-Shahreza, "Spoken CAPTCHA: A CAPTCHA System for Blind Users," in 2009 ISECS International Colloquium on Computing, Communication, Control, and Management, CCCM2009, 8-9 August, Sanya, China, 2009, pp. 1-4.

[19] N. Najkar and H. Sameti, "A Novel Approach to HMM-Based Speech Recognition System Using Particle Swarm Optimization " in The Fourth International Conference on Bio-Inspired Computing: Theories and Applications, BIC-TA 2009, Beijing, China, 2009, pp. 296-301.

[20] S. Momtazi and H. Sameti, "A Possibilistic Approach for Building Statistical Language Models," in Ninth International Conference on Intelligent Systems Design and Applications (ISDA09), Pisa, Italy, 2009, pp. 1014-1018.

[21] F. Khalili, M. Ardebilipour, and H. Sameti, "Design and implementation of vector quantizer for a 600 bps vocoder based on MELP," in Proceedings of the 11th international conference on Advanced Communication Technology (ICACT 2009) - Volume 2, Gangwon-Do, South Korea, 2009, pp. 1487-1490.

[22] F. Kaveh, H. Sameti, and M. Bahrani, "Combining Language Models for Persian Speech Understanding in a Dialogue System," in 14th Annual Computer Society of Iran Computer Conference (CSICC09), Tehran, Iran, 2009.

[23] P. Habashi and H. Sameti, "Unit Selection Method for Persian Speech Synthesis Using Fastvox," in 14th Annual Computer Society of Iran Computer Conference (CSICC09), Tehran, Iran, 2009.

[24] M. Ghaemmaghmi, H. Sameti, F. Razzazi, S. Dabbaghchian, and B. Babaali, "Robust Speech Recognition Using MLP Neural Network in Log-Spectral Domain," in IEEE International Symposium on Signal Processing and Information Technology (ISSPIT2009), Ajman, UAE, 2009.

[25] M. Ghaemmaghmi, H. Sameti, F. Razzazi, S. Dabbaghchian, and B. Babaali, "Noise Reduction Algorithm for Robust Speech Recognition Using MLP Neural Network," in 2009 Asia-Pacific Conference on Computational Intelligence and Industrial Applications (PACIIA 2009), Wuhan, China, 2009.

[26] H. Veisi and H. Sameti, "The Combination of CMS with PMC for Improving the Robustness of Speech Recognition Systems," in Advances in Computer Science and Engineering, CCIS 6, Springer Berlin Heidelberg, Proceedings of 13th International CSI Computer Conference, CSICC 2008, Kish Island, Iran, March 9-11, Revised Selected Papers, 2008, pp. 825-829.

[27] H. Veisi and H. Sameti, "Improving the performance of speech recognition systems using fault-tolerant techniques," in 9th International Conference on Signal Processing, ICSP'08, 26-29 October, Beijing, China, 2008, pp. 579-582.

[28] M. Soufifar, H. Sameti, B. Babaali, and K. Hosseinzadeh, "Introducing a method for using monogram and trigram combinational predictive language model in large vocabulary continuous speech recognition," in 13th International CSI Computer Conference, CSICC 2008, March 9-11, Kish Island, Iran, 2008, p. (In Persian).

[29] H. Sameti, H. Veisi, M. Bahrani, B. Babaali, and K. Hosseinzadeh, "Nevisa, a Persian Continuous Speech Recognition System," in Advances in Computer Science and Engineering,  CCIS 6, Springer Berlin Heidelberg, Proceedings of 13th International CSI Computer Conference, CSICC 2008 Kish Island, Iran, March 9-11, 2008 Revised Selected Papers 2008, pp. 485-492.

[30] H. Sajedi, H. Sameti, and H. Beigi, "MPSO: An algorithm for finding global optimum in complex problems," in 13th International CSI Computer Conference, CSICC 2008, March 9-11, Kish Island, Iran, 2008, p. (In Persian).

[31] Y. Poorebrahim, H. Sameti, and H. Zakipoor, "Implementation of a 1200 bps vocoder based on MELP " in 16th Iranian Conference on Electrical Engineering, Tehran, Iran, 2008, pp. 396-401 (In Persian).

[32] N. Nasiri, H. Sameti, M. Bahrani, and B. Babaali, "Triphone modeling in HMM-based Persian continuous speech recognition systems," in 13th International CSI Computer Conference, CSICC 2008,  March 9-11, Kish Island, Iran, 2008, p. (In Persian).

[33] S. Khorram, H. Sameti, H. Veisi, and H. R. Abutalebi, "A New Lattice LP-Based Post-Filter for Adaptive Noise Cancellers in Mobile and Vehicular Applications," in The 8th IEEE Symposium on Signal Processing and Information Technology, ISSPIT 2008, 16-19 December, Sarajevo, Bosnia & Herzegovina, 2008, pp. 407-412.

[34] S. Khorram, H. Sameti, and H. Veisi, "LP-based over-sampled subband Adaptive Noise Canceller for speech enhancement in diffuse noise fields," in 9th International Conference on Signal Processing, ICSP'08, Beijing, China, 2008, pp. 157-161.

[35] M. Bahrani, H. Sameti, N. Hafezi, and S. Momtazi, "A New Word Clustering Method for Building N-Gram Language Models in Continuous Speech Recognition Systems," in Lecture Notes in Computer Science, Proceedings of IEA/AIE 2008, 18-20 June, Wroclaw, Poland, 2008, pp. 286-293.

[36] M. Bahrani, H. Sameti, N. Hafezi, and S. Momtazi, "Automatic word clustering based on grammar for Persian continuous speech recognition systems," in 13th International CSI Computer Conference, CSICC 2008, March 9-11, Kish Island, Iran, 2008, p. (In Persian).

[37] B. Babaali, H. Sameti, and H. Veisi, "Using vocal tract length normalization in a HMM-based Persian continuous speech recognition system," in 13th International CSI Computer Conference, CSICC 2008, March 9-11, Kish Island, Iran, 2008, p. (In Persian).

[38] B. BabaAli, H. Sameti, and M. Safayani, "Spectral Subtraction in Likelihood-Maximizing Framework for Robust Speech Recognition," in Interspeech 2008, 22-26 September, Brisbane, Australia, 2008, pp. 980-983.

[39] B. BabaAli, H. Sameti, and M. Safayani, "Spectral subtraction in Model Distance Maximizing framework for robust speech recognition," in 9th International Conference on Signal Processing,  ICSP'08, 26-29 October, Beijing, China, 2008, pp. 627-630.

[40] A. Asaei, M. J. Taghizadeh, and H. Sameti, "Far-field continuous speech recognition system based on speaker Localization and sub-band Beamforming," in Computer Systems and Applications, 2008. AICCSA 2008. IEEE/ACS International Conference on, Doha, Qatar, 2008, pp. 495-500.

[41] B. Abbassi, H. Sameti, B. Babaali, and M. Bahrani, "Pronunciation variation modeling in a Persian continuous speech recognition system," in 13th International CSI Computer Conference, CSICC 2008, Kish Island, Iran, March 9-11, , Kish Island, Iran, 2008, p. (In Persian).

[42] H. Veisi and H. Sameti, "Noise and speaker robustness in a Persian continuous speech recognition system," in IEEE 9th International Symposium on Signal Processing and Its Applications,  ISSPA 2007, Sharjah, UAE, 2007, pp. 1-4.

[43] S. Vaisipour, B. Babaali, and H. Sameti, "Using and evaluating new confidence measures in word-based isolated word recognizers," in IEEE 9th International Symposium on Signal Processing and Its Applications,  ISSPA 2007, Sharjah, UAE, 2007, pp. 1-4.

[44] H. Sajedi, H. Sameti, H. Beigi, and B. Babaali, "Discriminative training of hidden Markov models using PSO algorithm," in 12th Annual Computer Society of Iran Computer Conference (CSICC07), February 20-22, Tehran, Iran, 2007, pp. 295-302 (In Persian).

[45] H. Sajedi, M. Jamzad, H. Sameti, and B. Babaali, "A clustering method for recognition of online Farsi letters using hidden Markov models," in 12th Annual Computer Society of Iran Computer Conference (CSICC07), February 20-22, Tehran, Iran, 2007, pp. 419-426 (In Persian).

[46] M. Safayani, H. Sameti, B. Babaali, and M. T. Manzuri Shalmani, "An efficient multi-band spectral subtraction method for robust speech recognition," in IEEE 9th International Symposium on Signal Processing and Its Applications,  ISSPA 2007, Sharjah, UAE, 2007, pp. 1-4.

[47] M. Safayani, H. Sameti, B. Babaali, and M. T. Manzuri, "Adaptation of spectral subtraction for improving the performance of speech recognition systems," in 12th Annual Computer Society of Iran Computer Conference (CSICC07), February 20-22, Tehran, Iran, 2007, pp. 338-343 (In Persian).

[48] M. Safayani, B. Babaali, M. T. M. Shalmani, H. Sameti, and S. Khaleghi, "Compensation of Channel and Noise Distortions Combining Maximum Likelihood based Spectral Subtraction and Normalization," in IEEE 9th International Symposium on Signal Processing and Its Applications,  ISSPA 2007, February 12-15, Sharjah, UAE, 2007, pp. 508-511.

[49] S. Momtazi, H. Sameti, S. Vaisipour, and M. Tefagh, "Introducing a Framework to Create Telephony Speech Databases from Direct Ones," in 14th International Conference on Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services. , Maribor, Slovenia, 2007, pp. 327-330.

[50] S. Momtazi, H. Sameti, M. Fazel-Zarandi, and M. Bahrani, "Robust parsing for word lattices in Continuous Speech Recognition systems," in IEEE 9th International Symposium on Signal Processing and Its Applications,  ISSPA 2007, Sharjah, UAE, 2007, pp. 1-4.

[51] S. Momtazi, H. Sameti, M. Bahrani, and N. Hafezi, "A POS-based fuzzy word clustering algorithm for continuous speech recognition systems," in IEEE 9th International Symposium on Signal Processing and Its Applications,  ISSPA 2007, Sharjah, UAE, 2007, pp. 1-4.

[52] S. Momtazi, H. Sameti, M. Bahrani, and N. Hafezi, "A fuzzy word clustering approach for for building statistical language models," in 12th Annual Computer Society of Iran Computer Conference (CSICC07), February 20-22, Tehran, Iran, 2007, pp. 508-512 (In Persian).

[53] A. Karimi, H. Sameti, and F. Amin, "A Proposed Criterion for the Quality Assessment of Watermarked Images Based on the Human Visual System," in 15th Iranian Conference on Electrical Engineering, Tehran, Iran, 2007, p. (In Persian).

[54] A. Jalali, S. M. Moosavi, H. Sameti, and H. Rabiee, "Improving the Performance of Generalized Side-lobe Canceller Method in Speech Enhancement," in 15th Iranian Conference on Electrical Engineering, Tehran, Iran, 2007.

[55] A. Asaei, H. Sameti, H. Rabiee, and M. Taghizadeh, "Robust Recognition of Persian Speech based on Speaker Localization and Beam Forming in Subband using Microphone Array," in 15th Iranian Conference on Electrical Engineering, Tehran, Iran, 2007.

[56] A. Asaei, H. Sameti, and M. S. Moeen, "Robust speaker localization using beamforming and a new harmonic-based filter," in 12th Annual Computer Society of Iran Computer Conference (CSICC07), February 20-22, Tehran, Iran, 2007, pp. 359-364 (In Persian).

[57] H. Veisi, H. Sameti, B. Babaali, K. Hosseinzadeh, and M. T. Manzuri, "Improving the Robustness of Persian Large Vocabulary Continuous Speech Recognition System for Real Applications," in IEEE 2nd Conference on Information and Communication Technologies: from Theory to Applications, ICTTA '06, Damascus, Syria, 2006, pp. 1293-1297.

[58] S. Sharifi, M. Habibi, H. Rahimzadeh, M. Jamzad, and H. Sameti, "Face Animation Learning for Persian Speech Based on 3D Face Tracking," in 11th Annual Computer Society of Iran Computer Conference (CSICC06), January 24-26, Tehran, Iran, 2006, pp. 874-878 (In Persian).

[59] H. Sajedi, B. Babaali, and H. Sameti, "Incorporating the Genetic algorithm techniques in the training process of a speech recognition system," in The 14th Iranian Conference in Electrical Engineering (ICEE), 2006.

[60] M. Safayani, B. Babaali, H. Sameti, and H. R. Abutalebi, "Experiments of Distant Talking Command Speech Recognition Based on a Microphone Array and Robustness Methods," in IEEE 2nd Conference on Information and Communication Technologies: from Theory to Applications, ICTTA '06, Damascus, Syria, 2006, pp. 1236-1241.

[61] N. Montazeri, G. Ghassem-Sani, and H. Sameti, "A fast and robust parser based on the Viterbi algorithm," in 11th Annual Computer Society of Iran Computer Conference (CSICC06), January 24-26, Tehran, Iran, 2006, pp. 473-478.

[62] S. Momtazi, M. Fazel Zarandi, H. Sameti, and M. Bahrani, "Using a Partial Robust Parser in Continuous Speech Recognition Systems," in 11th Annual Computer Society of Iran Computer Conference (CSICC06), January 24-26, Tehran, Iran, 2006, pp. 810-814 (In Persian).

[63] M. Bahrani, H. Sameti, N. Hafezi, and H. Movasagh, "Building and Incorporating Language Models for Persian continuous speech Recognition system " in The fifth international conference on Language Resources and Evaluation (LREC), Genoa, Italy, 2006.

[64] H. Veisi, H. Sameti, and H. R. Abutalebi, "On the using of Principal Component Analysis on the Farsi Continuous Speech Recognition for Feature Robustness and Feature Reductions," in 10th Annual Computer Society of Iran Computer Conference (CSICC05), Tehran-Iran, 2005, pp. 242-251.

[65] H. Veisi, A. Fazel Dehkordi, and H. Sameti, "Increasing the digit recognition rate in noise using noise robust features," in 13th Iranian Conference on Electrical Engineering (ICEE 2005), Zanjan, Iran, 2005.

[66] M. Sabetian and H. Sameti, "Data Hiding in Speech Using FFT Coefficients," in 10th Annual Computer Society of Iran Conference (CSICC05), February 15-17, Tehran, Iran, 2005, pp. 270-279 (In Persian).

[67] H. Movasagh and H. Sameti, "Two-step synchronous phoneme based search for continuous speech recognition," in First International Conference on Modeling, Simulation, and Applied Optimization (ICMSA0/05), Sharjah, U.A.E, 2005.

[68] R. Halavati, S. Bagheri Shouraki, H. Sameti, S. Haratizadeh, and B. Babaali, "A Novel Noise Immune Speech Recognition Approach," in 11th International Fuzzy Sets Association (IFSA) World Congress, Beijing, China, 2005, pp. 972-976.

[69] N. Hafezi, M. Bahrani, H. Movasagh, and H. Sameti, "Extracting Statistical Language Models for Continuous Speech Recognition Systems Using Persian Text Corpus," in 7th Conference of Intelligent Systems, Tehran, Iran, 2005.

[70] A. Fazel Dehkordi, H. Sameti, and M. T. Manzoori, "Introducing a new robust feature of root mean cepstral coefficients and comparison of robust feature based methods in the presence of telephone line noise," in 10 Annual Computer Society of Iran Conference (CSICC05), February 15-17, Tehran, Iran, 2005, pp. 260-269 (In Persian).

[71] A. Fazel Dehkordi, H. Sameti, and S. K. A. Ghiathi, "A New Feature Extraction Motivated by Human Ear," in First International Conference on Modeling, Simulation and Applied Optimization, ICMSA0/05, Sharjah, UAE, 2005, pp. 1-4.

[72] A. Fazel Dehkordi, H. Sameti, and M. Bahrani, "Designing and Preparing of FARSI Connected Digit Database and Using it in a Word Based Automatic Speech Recognition System " in 13th Iranian Conference on Electrical Engineering, Zanjan, Iran, 2005.

[73] M. Bahrani, B. Babaali, and H. Sameti, "Using language models in continuous Persian speech recognition," in 7th Conference of Intelligent Systems, Tehran, Iran, 2005.

[74] B. Babaali, M. Bahrani, H. beigi, and H. Sameti, "Telephony Isolated Digit Recognition using Support Vector Machine," in 7th Conference of Intelligent Systems, Tehran, Iran, 2005.

[75] B. Babaali, M. R. Bagheri, K. Hosseinzadeh, M. Bahrani, and H. Sameti, "A phoneme to word decoder based on lexicon tree for Persian speech recognition " in 10th Annual Computer Society of Iran Computer Conference (CSICC05), February 15-17, Tehran, Iran, 2005, pp. 321-329 (In Persian).

[76] H. Sameti, H. Movasagh, B. Babaali, M. Bahrani, K. Hosseinzadeh, A. Fazel Dehkordi, H. R. Abutalebi, H. Veisi, Y. mokri, N. Montazeri, and M. Nezami Ranjbar, " Large Vocabulary Persian Speech Recognition System " in The 1st workshop on Persian language and computer, Tehran, Iran, 2004.

[77] K. Hosseinzadeh, H. Sameti, H. R. Abutalebi, and A. Fazel Dehkordi, "MLLR method for environmental adaptation in the continuous FARSI speech recognition," in The 6th Conference on Intelligent Systems, November 26-27, Kerman, Iran, 2004.

[78] B. Babaali and H. Sameti, "The Sharif Speaker-Independent Large Vocabulary Speech Recognition System," in The 2nd Workshop on Information Technology & Its Disciplines (WITID 2004), Kish, Iran, 2004, pp. 24-26.

[79] H. Jahani, H. Sameti, and N. H. Gharavi, "Introducing a new attach on the stop/go waterfall system with more than 10 steps," in 2nd Iranian Society of Cryptology Conference, Tehran, Iran, 2003.

[80] N. Gharavi, H. Sameti, and A. Ghaemi Bafghi, "New method of dual transfers for optimum implementation of LFSRs in software applications," in 8th International CSI Computer Conference, CSICC03, February 25-27, Ferdowsi University, Mashad, Iran, 2003, pp. 298-304 (In Persian).

[81] H. Sameti, M. A. Ghafoorian, M. Bijankhan, S. A. Seyyedsalehi, and J. Sheikhzadegan, "Incorporating Training Capability in SHENAVA Farsi Speech Recognition System to Increase System Performance in Speaker-Dependent Applications," in First EurAsian Conference on Advances in Information and Communication Technology, October 29-31, Shiraz, Iran, 2002.

[82] H. Sameti and M. A. Ghafoorian, "Automatic Phoneme Endpoint Detection Using HMM and Viterbi Segmentation " in First EurAsian Conference on Advances in Information and Communication Technology, October 29-31, Shiraz, Iran, 2002.

[83] F. Almasganj, S. A. Seyedsalehi, M. Bijankhan, H. Sameti, and J. Sheikhzadegan, "SHENAVA-1 a Farsi Spontaneous Speech Recognition System," in Proceedings of ICEE 2001, 2001.

[84] H. Sheikhzadeh, R. L. Brennan, and H. Sameti, "Real-time implementation of HMM-based MMSE algorithm for speech enhancement in hearing aid applications," in Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., International Conference on, Detroit, USA, 1995, pp. 808-811 vol.1.

[85] L. Deng, G. Ramsay, and H. Sameti, "From modeling surface phenomena to modeling mechanisms: Towards a faithful model of the speech process aiming at speech recognition," in 1995 IEEE Workshop on Automatic Speech Recognition, Snowbird, Utah, 1995, pp. 183-184.

[86] L. Deng, J. Nu, and H. Sameti, "Improved speech modeling and recognition using multi-dimensional articulatory states as primitive speech units," in Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on, Detroit, USA, 1995, pp. 385-388.

[87] H. Sheikhzadeh, H. Sameti, L. Deng, and R. L. Brennan, "Comparative performance of spectral subtraction and HMM-based speech enhancement strategies with application to hearing and design," in Acoustics, Speech, and Signal Processing, ICASSP-94., 1994 IEEE International Conference on, Adelaide, Australia, 1994, pp. I/13-I/16 vol.1.

[88] L. Deng and H. Sameti, "Automatic Speech Recognition Using Dynamically Defined Speech Units," in Third International Conference on Spoken Language Processing, ICSLP94, Yokohama, Japan, 1994.

 

Books:

 

1. H. Sameti and M. Bijankhan, Persian Language and Computer, Volume 1, SAMT, 2010, 623 pages (In Persian).

2. H. Sameti and M. Bijankhan, Persian Language and Computer, Volume 2, SAMT, 2010, 671 pages (In Persian).

 

 

 

 

 

 :: Courses

 

 

  • Speech Recognition, graduate
  • Speech Processing,  graduate
  • Digital Signal Processing, graduate
  • Speech Enhancement,  graduate
  • Probability and Statistics, undergraduate
  • Signals and Systems, undergraduate

 

 

 

 :: Students

 

 

Ph.D. Students:

 

 

 

  • Bagher Babaali
  • Mohammad Bahrani
  • Hadi Veisi
  • Elahe Hosseini
  • Soheil Khorram 

 

 

 

 

 

 

 :: Speech Processing Lab (SPL)

 

 

SPL is a research laboratory of the Department of Computer Engineering at Sharif University Of Technology. The main area of the research in this lab is speech and language processing. Automatic speech recognition is a major field of research of this group and a speech recognition engine including the latest algorithms is developed. This engine is customized for the Persian language including Persian language models (statistical and grammatical) and the first version is released as NEVISA, the most robust and accurate Persian dictation system. SPL also developed NEWSHA, the high performance Persian telephony IVR and is expanding it towards a complete spoken dialogue system. The research fields of this group are:

  • Digital Speech Processing
  • Robust Speech Recognition
  • Spoken Dialogue Systems
  • Speech Synthesis (Text-To-Speech)
  • Speech Enhancement
  • Natural Language Modeling
  • Speech Database Design
  • Pattern Recognition

More information about this group is available in spl.ce.sharif.edu.