

CSE507
| Course |
CSE507 |
| Title |
Introduction to Computational Linguistics |
| Description |
An introduction to the techniques, literature, technologies
and current challenges of computational linguistics. Survey covers
four areas of computational linguistics using English as example
language: mathematical foundations, syntax, semantics and discourse.
|
| Prerequisite |
CSE537 for Computer Science Students.
|
| Credit Information |
3 - credits |
| Course Topics |
- Mathematical Foundations and Words
- Regular expressions and finite-state automata
- Finite-state transducers and morphology
- Review of basic probability and statistics
- N-gram language models
- Part-of-speech tagging
- Syntax
- Context-free grammars
- Parsing using context-free grammars
- Features and unification
- Grammars of English
- Lexicalized and probabilistic parsing
- Semantics
- Propositional and first-order logics and extensions to
handle time, default reasoning, probabilistic reasoning
- Representing meaning
- Semantic analysis
- Lexical semantics
- Discourse
- Reference resolution
- Discourse structure: text and dialogue
- Dialogue systems
Special topics, which are interspersed with the regular course
topics, may include: information extraction; alternative grammar
formalisms (e.g. tree-adjoining grammars), machine translation,
natural language generation |
| Textbook(s) |
D. Jurafsky and J. Martin (2000), Speech and Language Processing,
Prentice Hall, NJ.
The textbook is augmented with selected chapters from other books:
Topic I. C. -- C. Manning and H. Schutz (2000), Statistical Natural
Language Processing, Chapter 2.
Topic II. D. -- chapter J. Allen, Natural Language Understanding,
Chp. 5.
Also, as the course progresses students are increasingly directed
to original research papers on the topics of discussion. A selection
of these papers is placed on reserve each semester. |
| Prerequisites by topic |
None. |
| Course Objectives |
- Make students familiar with the range of tools and techniques
used in computational linguistics. Show them where to find the
literature in this field, and give them confidence in reading
it. In short, improve their abilities as computational linguists.
- Give students the ability to look at a computational linguistics
task and identify useful techniques for solving it. Encourage
them to think creatively and give them experience in planning
and conducting experiments in this field. In short, improve their
abilities as researchers.
- Encourage students to collaborate with others in different
fields. Because this course is interdisciplinary, computer scientists
end up working with linguists and psycho-linguists, which is exactly
what happens in computational linguistics all the time.
|
| Computer Usage |
Perl, Prolog. Students may also choose to use other languages
and tools for projects, e.g. Java, HTK, C/C++. There are no labs
for this course. However, heavy use is made of the Blackboard system.
|
| Course Webpage |
http://www.cs.sunysb.edu/~cse507
|
| Course Coordinator |
Dr. Amanda Stent |
|
|