O'Reilly Open Source Convention.
Books Safari Bookshelf Conferences O'Reilly Network

Arrow Home
Arrow Registration
Arrow Speakers
Arrow Keynotes
Arrow Tutorials
Arrow Sessions
Arrow At-a-Glance
Arrow BOFs
Arrow Events
Arrow Community
Arrow Exhibitors
Arrow Sponsors
Arrow Hotel/Travel
Arrow Venue Map
Arrow See & Do
Arrow Press
Arrow Mail List

O'Reilly Open Source Convention


Building a Smarter Search Engine: Artificial Stupidity
Maciej Ceglowski, National Institute for Technology and Liberal Education

Track: Emerging Topics
Date: Friday, July 11
Time: 10:30am - 11:15am
Location: Salon D

Statistical natural language processing is artifical intelligence for the lazy. It lets you build search engines that "do what you mean" without all the hard work of teaching the computer to understand human language. Statistical techniques use word counting and other tricks to make surprisingly intelligent guesses about document similarity, giving results that can surpass those of heavyweight expert systems. The authors have been working on an open source (Perl) implementation of two statistical techniques, latent semantic analysis (LSA) and contextual network graphs. The talk is intended for anyone who wants to go beyond keyword search - you'll leave armed with links, code, and an understanding of where to learn more.

O'Reilly Home | Privacy Policy

© 2003, O'Reilly Media, Inc.