DIMACS Workshop on Data Quality, Data Cleaning and Treatment of Noisy Data

November 3 - 4, 2003
DIMACS Center, CoRE Building, Rutgers University, Piscataway, NJ

Parni Dasu, AT&T Labs, tamr@research.att.com
Presented under the auspices of the Special Focus on Data Analysis and Mining.

Workshop Program:

 Monday, November 3, 2003

 9:00 -  9:40  Breakfast and Registration

 9:40 -  9:50  Welcome and Opening Remarks
               Brenda Latka, DIMACS Associate Director

 9:50 - 10:00  Opening Remarks
               Tamraparni Dasu, AT&T Labs - Research

10:00 - 10:50  Managing Inconsistency in Data Exchange and Integration
               Rene Miller, University of Toronto

10:50 - 11:40  Data Quality in Trading Surveiilancs
               Grace Zhang, Morgan Stanley

11:40 - 12:30  Bellman - A Data Quality Browser
	       Ted Johnson, AT&T Labs

12:30 -  2:00  Lunch

 2:00 -  3:00  The Data Cleaning Problem --
	       Some Key Issues and Practical Approaches
               Ron Pearson, Daniel Baugh Institute for Functional Genomics and
	       Computational Biology, Thomas Jefferson University

 3:00 -  3:50  Pre-processing of Microarray Data
               Dhammikai Amaratunga, Javier Cabrera, Nandini Raghavan
	       Johnson & Johnson, Rutgers, Johnson & Johnson

 3:50 -  4:00  Break

 4:00 -  4:50  Checks and Balances: Monitoring Data Quality Problems in Network
               Traffic Databases
               S. Muthukrishnan, Rutgers University
 4:50 -  5:30  Maximum Patterns and Outliers in the Logical Analysis of
	       Data (LAD)
               T. Bonates, P. Hammer, A. Kogan, and I. Lozina
	       RUTCOR, Rutgers University

 5:30	       Banquet Dinner 

 Tuesday, November 4, 2003

 9:30 -  9:50  Breakfast and Registration

 9:50 - 10:00  Opening Remarks

10:00 - 11:00  Data Mining: A Powerful Tool for Data Cleaning
	       Jiawei Han, University of Illinois at Urbana-Champaign

11:00 - 12:00  A $220 Million Success Story
               Jon Hill, British Telecommunications

12:00 - 1:00   Life Cycle Datamining
               Gregg Vesonder, Jon Wright and Parni Dasu, AT&T Labs - Research

 1:00 -  2:30  Lunch

 2:30 -  3:20  Managing Data Streams
	       Andrew Hume, AT&T Labs

 3:20 -  4:10  Web page cleaning for web data mining
	       Bing Liu, University of Illinois at Chicago

 4:10 -  4:20  Break

 4:20 -  5:10  Relational Nonlinear FIR Filters
	       R.K. Pearson and M. Gabbouj
	       Daniel Baugh Institute for Functional Genomics and
               Computational Biology, Thomas Jefferson University
	       and Tampere University of Technology

Previous: Participation
Next: Registration
Workshop Index
DIMACS Homepage
Contacting the Center
Document last modified on October 28, 2003.