| |
General Information
This syllabus can be expected to change as the course progresses.
Classes are divided into two formats.
- Lectures
- Each week there will be two lectures,
on Tuesday and Thursday. PowerPoint slides will be available on
the Web.
Related chapters of the book by Manning, Raghavan, and Schütze are indicated, "MRS".
- Discussions
- Class discussions on Wednesday evenings are based on readings from
journal articles or other papers to be read before the class.
It is essential that that everybody comes to class well prepared.
Week 1
| Date |
Event |
Topic |
| Thursday 8/28 |
Lecture 1 |
Searching full text 1: Basic concepts
MRS: 1 Boolean retrieval
[PowerPoint, HTML]
|
Week 2
| Date |
Event |
Topic |
| Tuesday 9/2 |
Lecture 2 |
Searching full text 2: Dictionaries, inverted files, postings
MRS: 2 The term vocabulary and postings lists
[PowerPoint, HTML] |
| Wednesday 9/3 |
Discussion 1 |
Examples of information retrieval [instructions]
[PowerPoint, HTML] |
| Thursday 9/4 |
Lecture 3 |
Searching full text 3: Term weighting
MRS: 6 Scoring, term weighting, and the vector space model
[PowerPoint, HTML] |
Week 3
| Date |
Event |
Topic |
| Tuesday 9/9 |
Lecture 4 |
Searching full text 4: Similarity, ranking and the vector space model
MRS: 7 Computing scores in a complete search system
[PowerPoint, HTML] |
| Wednesday 9/10 |
Discussion 2 |
SMART [reading]
[PowerPoint, HTML] |
| Thursday 9/11 |
Lecture 5 |
Searching full text 5: Index construction
MRS: 4 Index construction
[PowerPoint, HTML] |
Week 4
| Date |
Event |
Topic |
| Tuesday 9/16 |
Lecture 6 |
String processing 1:
Wild cards, stemming, and spelling
MRS: 3 Dictionaries and tolerant retrieval
[PowerPoint, HTML] |
| Wednesday 9/17 |
Discussion 3 |
IDF [reading]
[PowerPoint HTML] |
| Thursday 9/18 |
Lecture 7 |
String processing: String search
MRS: 5 Index compression
[PowerPoint, HTML] |
| Sunday 9/21 |
Assignment 1 due |
Building an index |
Week 5
| Date |
Event |
Topic |
| Tuesday 9/23 |
Lecture 8 |
Relevance feedback and query refinement
MRS: 9 Relevance feedback and query expansion
[PowerPoint, HTML] |
| Wednesday 9/24 |
Discussion 4 |
Latent semantic indexing [reading]
[PowerPoint, HTML] |
| Thursday 9/25 |
Lecture 9 |
Latent semantic indexing
MRS: 18 Matrix decompositions and latent semantic indexing
[PowerPoint, HTML] |
Week 6
| Date |
Event |
Topic |
| Tuesday 9/30 |
Lecture 10 |
Evaluation of retrieval effectiveness 1: Measures of effectiveness
MRS: 8. Evaluation in information retrieval
[PowerPoint, HTML] |
| Wednesday 10/1 |
Discussion 5 |
TREC [reading]
[PowerPoint, HTML] |
| Thursday 10/2 |
Lecture 11 |
Evaluation of retrieval effectiveness 2: TREC
[PowerPoint, HTML]
|
Week 7
| Date |
Event |
Topic |
| Tuesday 10/7 |
Lecture 12 |
[No class] |
| Wednesday 10/8 |
Discussion 6 |
Google [reading]
[PowerPoint, HTML] |
| Thursday 10/9 |
Lecture 13 |
Web searching 1: Web crawling
MRS: 19 Web search basics
MRS: 20 Web crawling and indexes
[PowerPoint, HTML] |
| Saturday 10/11 |
Assignment 2 due |
Latent semantic indexing |
Week 8
| Date |
Event |
Topic |
| Tuesday 10/14 |
[fall break] |
|
| Wednesday 10/15 |
Midterm examination |
|
| Thursday 10/16 |
Lecture 14 |
Web searching 2: Links and anchor text
MRS: 21 Link analysis
[PowerPoint, HTML]
|
Week 9
| Date |
Event |
Topic |
| Tuesday 10/21 |
Lecture 15 |
Web searching 3: Systems aspects of web
searching
[PowerPoint, HTML] |
| Wednesday 10/22 |
Discussion 7 |
MapReduce [reading]
[PowerPoint,
HTML] |
| Thursday 10/23 |
Lecture 16 |
MapReduce programming workshop
[PowerPoint, HTML] |
Week 10
| Date |
Event |
Topic |
| Tuesday 10/28 |
Lecture 17 |
Web searching 4: Spam and advertising
[PowerPoint, HTML] |
| Wednesday 10/29 |
Discussion 8 |
The Google File System [reading]
[PowerPoint, HTML] |
| Thursday 10/30 |
Lecture 18 |
Probabilistic information retrieval
MRS: 11 Probabilistic information retrieval
[PowerPoint, HTML]
|
Week 11
| Date |
Event |
Topic |
| Tuesday 11/4 |
Lecture 19 |
Usability 1: Interfaces for browsing and searching
[PowerPoint, HTML] |
| Wednesday11/5 |
Discussion 9 |
Snippets [reading]
[PowerPoint, HTML] |
| Thursday 11/6 |
Lecture 20 |
Usability 2: Evaluation with human in the loop
[PowerPoint, HTML] |
| Sunday 11/9 |
Assignment 3 due |
|
Week 12
Week 13
| Date |
Event |
Topic |
| Tuesday 11/18 |
Lecture 23 |
Metadata 3: Thesauruses
[PowerPoint, HTML] |
| Wednesday 11/19 |
Discussion 11 |
Informedia [reading]
[PowerPoint, HTML] |
| Thursday 11/20 |
Lecture 24 |
Classification and categorization 1
MRS: 14 Vector space classification
[PowerPoint, HTML] |
Week 14
| Date |
Event |
Topic |
| Tuesday 11/25 |
Lecture 25 |
Classification and categorization 2
MRS: 16 Flat clustering
MRS: 17 Hierarcical clustering
[PowerPoint, HTML] |
| Wednesday 11/26 |
[Thanksgiving] |
|
| Thursday 11/27 |
[Thanksgiving] |
|
Week 15
| Date |
Event |
Topic |
| Tuesday 12/2 |
Lecture 26 |
Searching library collections 1: books
MRS: 10 XML Retrieval
[PowerPoint, HTML |
| Wednesday 12/3 |
Discussion 12 |
Mixed content and metadata[reading]
[PowerPoint,
HTML] |
| Thursday 12/4 |
Lecture 27 |
Searching library collections 2: mixed content
[PowerPoint, HTML] |
| Friday 12/5 |
Assignment 4 due |
|
Examination
| Date |
Event |
| TBA |
Final examination. (See the examinations page for information about taking the final exam early.) |
|