 and be ready to write some very simple Regex to identify strings in texts. Think of more high-level (with lexical/semantic constraints) RegEx to recognize Location names. 
Homework 3
Read IBM's Watson paper IBM Question Answering and Poon and Domingo's USP paper Unsupervised Semantic Parsing. Be prepared to answer some questions on these two papers. 
Homework 4
Install Sphinx and Stanford Parser - Read: Sphinx-4: A Flexible Open Source Framework for Speech Recognition  whitepaper and Stanford parses whitepaper.  You may also want to read: An Introduction to Speech Recognition, B. Plannerer, 2005 and Understanding the CMU Sphinx Speech Recognition System, Chun-Feng Liao, National Chengchi University. The Test will be on May 24th (May 10 lesson wil be held by Prof. Roberto Basili on Machine learning methods for NLP).
 and be ready to write some very simple Regex to identify strings in texts. Think of more high-level (with lexical/semantic constraints) RegEx to recognize Location names. 
Homework 3
Read IBM's Watson paper IBM Question Answering and Poon and Domingo's USP paper Unsupervised Semantic Parsing. Be prepared to answer some questions on these two papers. 
Homework 4
Install Sphinx and Stanford Parser - Read: Sphinx-4: A Flexible Open Source Framework for Speech Recognition  whitepaper and Stanford parses whitepaper.  You may also want to read: An Introduction to Speech Recognition, B. Plannerer, 2005 and Understanding the CMU Sphinx Speech Recognition System, Chun-Feng Liao, National Chengchi University. The Test will be on May 24th (May 10 lesson wil be held by Prof. Roberto Basili on Machine learning methods for NLP). 
 .  
Here you can find  a previous students' project on Sphinx.
  .  
Here you can find  a previous students' project on Sphinx. 
 ). Define a small corpus of 100 short sentences. Measure the performances using the performance indicators presented during the Speech Recognition lesson.  Then process the recognized texts with the Stanford parser, and again measure the performances. How many errors are induced by the speech recognizer, how many by the parser? Write a 5-10 page report and comment. 
Students that, at the end of the course, did not pass the classroom test will additionally read and present one of the papers in the above list (they can choose the preferred paper, but every student must have a different paper).
NOTE: Students can coordinate their work creating and joining a Google group NLP - 2013
). Define a small corpus of 100 short sentences. Measure the performances using the performance indicators presented during the Speech Recognition lesson.  Then process the recognized texts with the Stanford parser, and again measure the performances. How many errors are induced by the speech recognizer, how many by the parser? Write a 5-10 page report and comment. 
Students that, at the end of the course, did not pass the classroom test will additionally read and present one of the papers in the above list (they can choose the preferred paper, but every student must have a different paper).
NOTE: Students can coordinate their work creating and joining a Google group NLP - 2013
 * The Treetagger (POS tagger and Chunker for many   languages including Estonian, Romenian, Latin, Swaili  and more) http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html
 * The Treetagger (POS tagger and Chunker for many   languages including Estonian, Romenian, Latin, Swaili  and more) http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html * WordNet-  http://wordnet.princeton.edu/
 * WordNet-  http://wordnet.princeton.edu/ * FrameNet-    http://framenet.icsi.berkeley.edu/
 * FrameNet-    http://framenet.icsi.berkeley.edu/ * VerbNet-    http://verbs.colorado.edu/~mpalmer/projects/verbnet.html
 * VerbNet-    http://verbs.colorado.edu/~mpalmer/projects/verbnet.html * A Word Sense Disambiguation tool based on WordNet-    http://lcl.uniroma1.it/ssi/
 * A Word Sense Disambiguation tool based on WordNet-    http://lcl.uniroma1.it/ssi/ * Machine Translation: Google Translate -  http://translate.google.com/#
 * Machine Translation: Google Translate -  http://translate.google.com/# * Multilingual electronic dictionary of football (soccer) language. It is based on FrameNet http://www.kicktionary.de/
 * Multilingual electronic dictionary of football (soccer) language. It is based on FrameNet http://www.kicktionary.de/ *Transcription of the IBL speech corpus (route directions in a University Campus) http://www.ltg.ed.ac.uk/ibl/ibl-transcription.txt
  *Transcription of the IBL speech corpus (route directions in a University Campus) http://www.ltg.ed.ac.uk/ibl/ibl-transcription.txt NOTE: details on the IBL corpus are found on http://www.tech.plym.ac.uk/soc/staff/guidbugm/ibl/readme1.html
  NOTE: details on the IBL corpus are found on http://www.tech.plym.ac.uk/soc/staff/guidbugm/ibl/readme1.html It is a useful guide on how to build a corpus of expressions and words used by humans instructing an agent moving an artificial environment.
  * Sphinx,  open source speech recognition http://cmusphinx.sourceforge.net/
   It is a useful guide on how to build a corpus of expressions and words used by humans instructing an agent moving an artificial environment.
  * Sphinx,  open source speech recognition http://cmusphinx.sourceforge.net/ * BabelNet, A very large multilingual ontology  http://lcl.uniroma1.it/babelnet/
 
  * BabelNet, A very large multilingual ontology  http://lcl.uniroma1.it/babelnet/ 
| I | Attachment   | History | Action | Size | Date | Who | Comment | 
|---|---|---|---|---|---|---|---|
|  pptx | Project_NLP.pptx | r1 | manage | 213.3 K | 2012-03-17 - 13:09 | PaolaVelardi | 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
           
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
|  |  |  
      Questo sito usa cookies, usandolo ne accettate la presenza. (CookiePolicy)
       Torna al Dipartimento di Informatica   |  |  |  | 
 
  Copyright © 2008-2025 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Copyright © 2008-2025 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.