----------------------------------------------------------------------- BIOINFORMATICS COLLOQUIUM School of Computational Sciences George Mason University ----------------------------------------------------------------------- Iterative Denoising for Cross-Corpus Discovery David Marchette, Ph.D. John Hopkins University, Naval Surface Warfare Center Tuesday, September 21, 2004 4:30 pm Verizon Auditorium, Prince William Campus Given two disparate corpora we wish to identify meaningful cross-corpus associations; e.g., observations in different corpora satisfying, perhaps, a dictionary definition of serendipity: a meaningful discovery not explicitly sought. Toward this end, we introduce an iterative denoising methodology for cross-corpus discovery. This is a method for dimensionality reduction and search that utilizes corpus-dependent projections. We take a (perhaps overly) broad definition of corpus; we will illustrate the methodology on hyperspectral data analysis, text document processing, and analyzing user login sessions. Some potential applications of these techniques to biological problems will be discussed. ---------------------------------------------------------------------- Refreshments are served at 4:00 pm. Find the schedule and directions at http://www.binf.gmu.edu/colloq.html