THE CITY COLLEGE

Department of Computer Science


CSc I0810. Topics in Software and Systems

(Modern Information Retrieval: Search Technologies)


Fall 2006 Office: NAC 7/244 Hours: M/W

Prof. Abbe Mowshowitz Phone: (212) 650-6161 email: abbe@cs.ccny.cuny.edu


TEXT: Baeza-Yates & Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley, 1999.

GRADING: Midterm (20%), term project (40%), final exam (40%).


This course is an advanced introduction to information retrieval with a special emphasis on Web searching. The course will begin with a presentation of retrieval strategies (i.e., vector space, probabilistic, inference networks, extended Boolean, and others). Next, retrieval evaluation methods (i.e., relevance-based measures and alternatives) will be discussed, followed by query languages and operations, and indexing and searching. The latter part of the course will focus on Web searching (i.e., search engine architectures, user interfaces, ranking, crawling and indexing).


TERM PROJECT: Each student will be assigned a research or programming task related to Web searching. You are expected to prepare a written report on the research or the program, and to present a summary of your research or to demonstrate your program in class.


Course Outline


Part I. Retrieval Strategies and Evaluation

Session 1. Classic vector space model

Session 2. Set theoretic and probabilistic models

Session 3. Language and other models

Session 4. Relevance-based and other performance measures


Midterm


Part III. Queries, Indexing and Searching

Sessions 5. Query languages and operations

Session 6. Indexing

Session 7. Searching


Part V. Web Searching

Session 8. Models of the Web and search engine architecture

Session 9. Ranking

Session 10. Crawling and indexing

Session 11. Result fusion

Session 12. Metasearchers and searching using hyperlinks


Sessions 13 & 14. Presentation of student project results