Search Engine Technology


Course description

A significant portion of the information that surrounds us is in textual format. A number of techniques for accessing such information exist, ranging from databases to natural language processing.
Some of the most prestigious companies these days spend large amounts of money to build intelligent search engines that allow casual users to find what they want anytime, from anywhere, and in any language.
In this course, we will cover the theory and practice behind the implementation of search engines, focusing on a wide range of topics including methods for text storage and retrieval, the structure of the Web as a graph, evaluation of systems, and user interfaces.

Learning objectives

  • Understand how search engines work
  • Understand the limits of existing search technology
  • Learn to appreciate the sheer size of the Web
  • Learn to wrote code for text indexing and retrieval
  • Learn about the state of the art in IR research
  • Learn to analyze textual and semi-structured data sets
  • Learn to appreciate the diversity of texts on the Web
  • Learn to evaluate information retrieval
  • Learn about standardized document collections
  • Learn about text similarity measures
  • Learn about semantic dimensionality reduction
  • Learn about the idiosyncrasies of hyperlinked document collections
  • Learn about web crawling
  • Learn to use existing software
  • Understand the dynamics of the Web by building appropriate mathematical models
  • Build working systems that assist users in finding useful information on the Web
  • Download PPT  Lectures
    Size
    576K
    199K
    265K
    488K
    135K
    530K
    233K
    1.2M
    153K
    2.6M
    1.4M
    1.2M
    545K

No comments:

free counters