### Session on Mathematical, Statistical, and Algorithmic
Problems of Very Large Data Sets - American
Mathematical Society Meeting

#### Jointly sponsored by DIMACS and Division of Mathematical Sciences, NSF

#### January 8, 1997, 2:15 - 6:00 P.M.

San Diego, CA

- Organizer for session:
**Joan Feigenbaum**, AT&T Labs jf@research.att.com

The emergence of the computer as an essential tool in scientific research
and as an essential ingredient in commercial systems has led to the
generation of massive amounts of data. These data sets are of critical
importance for a broad variety of applications, including (but certainly
not limited to) astrophysical models, genetic sequencing, geographic
information systems, ecological monitoring, weather prediction,
telecommunications applications, commercial digital video and audio,
digital libraries, government information systems, and biological models
for medical applications. Researchers in all of these applications areas
currently face daunting computational problems in organizing and extracting
useful information from these massive data sets.

In an effort to acquaint the mathematical sciences community with the
problems and challenges is this area, the Division of Mathematical Sciences
and DIMACS have organized a session on massive data sets at the San Diego
meeting of the American Mathematical Society in January. This session will
describe some of the challenging mathematical, statistical, and algorithmic
problems inherent in organizing and using enormous amounts of data. The
emphasis will be on basic issues that transcend particular applications.
Speakers will explain why existing mathematical, statistical, and
algorithmic methods break down on the enormous data sets that scientists
and technologists now encounter regularly and will attempt to delineate the
boundaries at which these breakdowns occur. The session will include a
discussion of specific DIMACS and DMS programs that focus on huge data
sets.

### List of Speakers:

- Laney Kulsrud, Center for Communications Research, Princeton
"Mathematical Methods for Mining Massive Data Sets"
- David Karger, Massachusetts Institute of Technology
"Information Retrieval: Challenges in Interactive-Time Manipulation of
Massive Text Collections"
- Sarbani Basu, Aarhus University
"Helioseismic Inversions: Massive Data Sets and the Determination of Solar
Structure"
- Ken Church, AT&T Laboratories
"Massive Data Sets and Graph Algorithms in Telecommunications Systems"

The session has been organized by Joan Feigenbaum at AT&T Labs.
Abstracts of the talks and the schedule of the session
will be available on the DIMACS Web page, http://dimacs.rutgers.edu.

Program

Workshop Index

DIMACS Homepage

Contacting the Center

Document last modified on November 26, 1996