|
:: About me
|
|
|
|
Director of the Speech Processing
Lab (SPL), Sharif University of Technology
This
research group started to work in 1999. It is working towards preparing a
Persian (Farsi) speech recognition engine which can be utilized in
different speech recognition products. The project includes research in
different areas of automatic speech recognition and incorporation of
different known methods for Persian (Farsi0 language. The system is a Large
Vocabulary Speaker-Independent Continuous Speech Recognition (LV-SICSR)
system based on Hidden Markov Models (HMM) with a selectable lexicon of at
least 1000 words. The first versions of this system so-called NEVISA, are
completed which includes many robustness methods and NLP approaches. The
improvements of this system for 10,000 words is in progress now. Presently,
25 researchers are involved in the project. For more information about the
group see spl.ce.sharif.edu.
Also our official web site, www.asr-gooyesh.com contains more detailed information
about our speech products.
|
|
|
|
|
|

|
:: Education
|
|
|
|
|
|
|
|
|
|

|
:: Research
Interests
|
|
|
|
- Speech Processing
- Automatic Speech Recognition
- Hidden Markov Modeling of Speech
- Speech Enhancement
|
|

|
:: Publications
|
|
|
|
- Momtazi S., Fazel M., Sameti H., Bahrani M.,
Robust Parsing for Word Lattices in Continuous Speech Recognition
Systems , 11th International CSI Computer Conference, Tehran, Iran,
Jan 2006. (in Persian)
- Bahrani M., Sameti H., Extraction and
Modeling Context Dependent Phone Units for Improvement of Continuous
Speech Recognition Accuracy by Phonemes Clustering , Submitted in
Iranian Journal of Electrical and Computer Engineering, Vol 2, 2005.
(in Persian)
- Hafezi N., Bahrani M., Movasagh H., Sameti
H., Extracting Statistical Language Models for Continuous Speech
Recognition Systems Using Persian Text Corpus , 7th Conference of
Intelligent Systems, Dec 2005. (in Persian)
- Sajedi H., Babaali B., Sameti H., Incorporating the
Genetic algorithm techniques in the training process of a Speech
Recognition System, the 14th Iranian Conference in Electrical
Engineering (ICEE), Tehran-Iran, 2005.
- Halavati R., Bagheri Shouraki S., Sameti H., Harati
Zadeh S., Babaali B., A Novel Noise Immune Speech Recognition
Approach , 11th IFSA World Congress, Beijing, China, 2005. - BEST
STUDENT PAPER
- Babaali B., Bagheri M.R, Hosseinzadeh Kh., Bahrani
M., sameti H., A phoneme to word decoder based on lexicon tree for
Persian speech recognition, 10th Annual Computer Society of Iran Computer
Conference (CSICC), Tehran-Iran, 2005.
- Bahrani M., Sameti H., Robust Parsing for Word
Lattices in Continuous Speech Recognition Systems, ISSPIT 2005.
- Movasagh H.,Bahrani M., Sameti H., Building
and Incorporating Language Models for Persian continuous speech
Recognition system , ISSPIT 2005.
- Movasagh H., Sameti H., Two–step synchronous
phoneme based search for continuose speech Recognition, ICM SAO, 2005.
- Veisi H., Fazel A., Sameti H., Increasing the
digit recognition rate in noise using noise robust features , Iranian
Conference in Electrical Engineering (ICEE), Zanjan, Iran, 2005.
- Veisi H., Sameti H., Abutalebi H.R., On the
using of Principal Component Analysis on the Farsi Continuous Speech
Recognition for Feature Robustness and Feature Reductions , 10th
Annual Computer Society of Iran Computer Conference (CSICC),
Tehran-Iran, 2005. (in Persian)
- Fazel Dehkordi A., Sameti H., Bahrani M.,
Designing and Preparing of FARSI Connected Digit Database and Using it
in a Word Based Automatic Speech Recognition System , 13th Iranian
Conference on Electrical Engineering (ICEE), Zanjan, Iran, 2005. (in
Persian)
- Fazel Dehkordi A., Sameti H., Manzuri M. T.,
RCC-Mean Subtraction Robust Feature and Compare Various Feature based
Methods for Robust Speech Recognition in presence of Telephone Noise,
10th Annual Conference of Computer Society of Iran, Tehran, Iran,
2005. (in Persian)
- Fazel Dehkordi A., Sameti H., Ghiathi S. K, A
new feature extraction motivated by human , First International
Conference On Modeling, Simulation and Applied Optimization,
(ICMSAO’05), Sharjah, UAE, 2005.
- Hosseinzadeh K., Sameti H., Abutalebi H.R. , Fazel
Dehkordi A., MLLR method for environmental adaptation in the
continuous FARSI speech recognition , The 6th Conference on
Intelligent Systems, Kerman, Iran, 2004.
- Babaali B., Sameti H., The Sharif
Speaker-Independent Large Vocabulary Speech Recognition System , The
2nd Workshop on Information Technology & Its Disciplines (WITID
2004), Feb. 24-26, 2004, Kish Island, Iran.
- Sameti H., Movasagh H., Babaali B., Bahrani M.,
Hosseinzadeh K., Fazel Dehkordi A., Abutalebi H. R., Veisi H., Mokri
Y., Motazeri N., Nezami Ranjbar M., Large Vocabulary Persian Speech
Recognition System , The 1st workshop on Persian language and
computer, Tehran, Iran, May 25-26 2004. (in Persian)
- H. Sameti and Li Deng, Nonstationary-State Hidden
Markov Model Representation of Speech Signals for Speech Enhancement,
Elsevier Signal Processing Journal, Volume 82, Number 2, pp. 205-227,
Feb. 2002.
- H. Sameti, H. Sheikhzadeh, L. Deng, and R. L.
Brennan, HMM-Based Strategies for Enhancement of Speech Embedded in
Non-Stationary Noise , IEEE Trans. on Speech and Audio Processing,
Vol. 6, No. 5, pp. 445-455, Sept. 1998.
- H. Sameti, Nonstationary - State Hidden Markov
Models for Speech Enhancement , The Second Iranian Computer Society
Conference, Amirkabir University of Technology, Tehran, Iran,
December 1996.
- L. Deng and H. Sameti, Transitional Speech Units
and Their Representation by Regressive Markov States:
Applications to Speech Recognition, IEEE Trans. on Speech and Audio
Processing, Vol. 4, No. 4, pp. 301-306, July 1996.
- L. Deng, J. Wu, and H. Sameti, Improved Speech
Modeling and Recognition Using Multi-dimensional Articulatory States
as Primitive Speech Units, Proceedings of IEEE Int. Conf. Acoustics,
Speech, and Signal Processing (ICASSP), pp. 385-388, Detroit, USA,
April 1995.
- H. Sheikhzadeh, R. L. Brennan, and H. Sameti,
Real-Time Implementation of HMM-Based MMSE Algorithm for Speech
Enhancement in Hearing Aid Applications, Proceedings of IEEE Int.
Conf. Acoustics, Speech, and Signal Processing (ICASSP), pp. 808-811,
Detroit, USA, April 1995.
- L. Deng and H. Sameti, Automatic Speech Recognition
Using Dynamically Defined Speech Units , Proceedings of the 1994
International Conference on Spoken Language Processing, Vol. 4, pp.
2167--2170, Yokohama, Japan, September 18-22, 1994.
- L. Deng and H. Sameti, Articulatory Phonology and
Speech Recognition: A Case Study Involving Use of Dynamically Defined
Speech Primitives, Acoustical Society of America, 127th Meeting,
Cambridge, MA, June 1994.
- H. Sheikhzadeh, H. Sameti,
L. Deng, and R. L. Brennan, Comparative Performance of Spectral
Subtraction and HMM-Based Speech Enhancement Strategies with
Application to Hearing Aid Design , Proceedings of IEEE Int. Conf.
Acoustics, Speech, and Signal Processing (ICASSP), pp. I-13, I-16,
Adelaide, Australia, April 1994.
|
|
|
|
|
|

|
:: Courses
|
|
|
|
- Speech Recognition, Graduate
- Speech Processing, Graduate
- Speech Enhancement, Graduate
- Statistics
|
|
|
|
|
|

|
:: Students
|
|
|
|
Ph.D. Students
|
|
|
|
|
1. Babaali Bagher,
Starting 2004
2. Bahrani Mohammad,
Starting 2004
|
|
|
|
|
|
|
|
M.S. Students
|
|
|
|
|
1. Veisipour Saman, Statring
2004
Thesis:
Confidence measure and out of vocabulary
|
|
|
|
|
2. Safayani Mehran,
Statring 2004
Thesis:
Using microphone array for improving speech recognition performance
|
|
|
|
|
3. Veisi Hadi, Nov.
2005
Thesis:
Model based methods for robust speech recognition systems
|
|
|
|
|
4. Hosseinzadeh Khosro,
Nov. 2004
Thesis:
Improving the accuracy of speech recognition in noisy environments
|
|
|
|
|
5. Movasagh Hamed, Jan.
2003
Thesis:
Design and Implementation of Optimized Search for Persian Speech
Recognition by HMM
|
|
|
|
|
|

|
:: Speech
Processing Lab (SPL)
|
|
|
|
SPL is one of the
research laboratories of Computer
Engineering Department of Sharif University Of Technology. The main area of the
research in this Lab is Digital Signal Processing, specially Speech
signals. Speech Recognition is the major field of activity for this group
and a speech recognition engine including the latest and successful
algorithms is developed. This engine is customized for Persian language
including Persian language models (Statistical and Grammatical) and the
first version is released as NEVISA, the most robust and accurate
Persian dictation system. SPL also developed NEWSHA, the high
accurate Persian telephony speech recognition including command and digit
recognition. The summary of the fields of activity of this group are:
- Digital Speech Processing
- Robust Speech Recognition
- Speech Synthesis (Text-To-Speech)
- Speech Enhancement
- Natural Language Processing (NLP)
- Speech Database Design
- Pattern Recognition
More
information about this group is available in spl.ce.sharif.edu page.
|
|
|
|
|
|