Course introduction | |
---|---|
Course name | Statistical Natural Language Processing |
Credits | 3 |
Instructor | Bahram Vazirnezhad- Ph.D. in Biomedical Engineering- Bio-Electric |
Schedules | Saturdays and Tuesdays 10:30-12:00, LLD 1st floor AVR and SLP Lab |
Office hours | See office hours |
General purpose | This course will focus on basic to advanced statistical techniques and methods in natural language and speech processing and its applications on speech and language technologies such as machine translation, question answering, language modeling, document clustering, speech processing. The syllabus contains mathematical foundations including probability theory, information theory, hypothesis testing, statistical inference, hidden markov models and statistical speech and language processing and its applications in part-of-speech tagging, machine translation, document clustering, document classification, speech enhancement, speech recognition and speech synthesis etc. |
Course outline | Seminars
Homeworks Projects Exams |
Required material | Electronic version of material are available HERE and hardcopies will be made available as required |
Main reference | Probability, Random Variables and Stochastic Processes, A. Papoulis
An introduction to Computational Linguistics, R. Grishman Foundations of Statistical Natural Language Processing, C. Manning, H. Schutze Advanced Digital Signal Processing and Noise Reduction, S. Vaseghi Spoken Language Processing, X. Huang, A. Acero, H. Hon Discrete Time Processing of Speech Signals, J. R. Deller, J. H. L. Hansen, J. G. Proakis Statistical Machine Translation, P. Koehn |
Evaluation | Class activity/ effective presence/ homeworks 25%
Seminars 20% Projects 35% Final exam 20% |
Notes | -
- |
Course syllabus | ||||
---|---|---|---|---|
Week | Description | Lecture | Homework | Notes |
1 | Introduction and course outline | lectures | homeworks | The purpose, content and organization of the course, required material; evaluation; preparing schedule for presentations etc. |
2 | Fundamentals- Probability, Statistics and Random Variables | lecture 1 | homework 1 | |
3 | Fundamentals- Statistics, Stochastic Processes and Fourier Transform | lecture 3 | homework 3 | |
4 | Fundamentals- Syntax, Chomsky hierarchy of languages | lecture 4 | homework 4 | |
5 | Fundamentals- Information theory and Entropy | lecture 5 | homework 5 | |
6 | Methods and Algorithms- Statistical pattern recognition, Baysian inference, Clustering algorithms (K-means, KNN) | lecture 6 | homework 6 | |
7 | Methods and Algorithms- Statistical pattern recognition, Decision trees, PCA, Least Square | lecture 7 | homework 7 | |
8 | Methods and Algorithms- Syntactic recognition, Finite-State-Machines, Push-Down-Automata | lecture 8 | homework 8 | |
9 | Methods and Algorithms- Search algorithms, Dynamic Programming, Dynamic Time Warping | lecture 9 | homework 9 | |
10 | Methods and Algorithms- Linear Prediction, Hidden Markov Models | lecture 10 | homework 10 | |
11 | Applications- POS tager | lecture 11 | homework 11 | |
12 | Applications- Text Classification | lecture 12 | homework 12 | |
13 | Applications- Speech Enhancement | lecture 13 | homework 13 | |
14 | Applications- Speech Recognition | lecture 14 | homework 14 | |
15 | Applications- Speech Synthesis | lecture 15 | homework 15 | |