The O'Reilly Peer-to-Peer and Web Services Conference
oreilly.comO'Reilly Network
ConferencesInternationalSafari: Books Online

Arrow Home
Arrow Registration
Arrow Speakers
Arrow Keynotes
Arrow Tutorials
Arrow Sessions
Arrow BOFs
Arrow Community Meetings
Arrow Events
Arrow Exhibitors
Arrow Sponsors
Arrow Hotel/Travel
Arrow See & Do
Arrow Press
Arrow Mail List

Practical Tools For Innovation
O'Reilly Bioinformatics Technology Conference
January 28-31, 2002 -- Tucson, AZ
Chambered Nautilus


AI Techniques for Microarray Data Analysis

Ben Goertzel, CEO, Cognitive Bioscience

Track: Analysis
Date: Tuesday, January 29
Time: 2:15pm - 3:45pm
Location: Canyon I

Microarray technology (including gene chips, spotted microarrays, and other related devices) provides unprecedented powerful insights into gene expression; and it can soon be expected to do the same for protein expression. However, the practical analysis of the data derived from microarray experiments is fraught with difficulties, stemming from the large volume and high noise level of the data sets, and the subtlety of the underlying data patterns. Standard statistical techniques, such as clustering and principal components analysis, are widely used but give highly uneven results. However, more advanced data analysis methods can yield greater insights, being less easily misled by noise, and more able to draw diverse background knowledge into the data analysis process.

In this talk we will discuss three particular examples of data analysis techniques from the AI field that can be fruitfully applied to the analysis of microarray data:

  1. Supervised learning techniques for data categorization. Well-known methods like SVM’s and decision trees can be used, for instance, to learn rules distinguishing cancerous from noncancerous cells. Appropriate construction of “feature vectors” makes a large difference in performance.

  2. Clustering using a priori background knowledge. Though popular, clustering is generally a poor data analysis method, but invocation and appropriate merging of knowledge from relevant databases can significantly improve its performance.

  3. Probabilistic rule inference from microarray time series data. While exact dynamical rules cannot be inferred from noisy and incomplete data, probabilistic rules approximately governing the dynamics can be. Home | Conferences Home | Bioinformatics Conference Home
Registration | Speakers | Keynotes | Tutorials | Sessions | BOFs
Exhibitors | Sponsors | Hotel/Travel | Press | Mail List

© 2001, O'Reilly Media, Inc.