Cornell Information Science contact | cis home
   home  about us  undergrad programs  grad programs  research  faculty and researchers
 About Us
 Overview
 Seminar Series
 Facilities
 Contact
 INFORMATION SCIENCE SEMINAR

The Web Laboratory: A petabyte collection of data for research on the content, structure, and evolution of the Web

 

Speaker: Bill Arms, Professor & Co-Director of Information Science, Cornell University

Date: Wednesday, April 27, 2005 4:15-5:15p

Location: 301 College Avenue, Seminar Room

 

Abstract -

The petabyte data store is an NSF-funded project by Computer Science and the Cornell Theory Center to support research projects that have very large collections of data. We plan to mount a set of complete web crawls, dating back to 1996, obtained from the Internet Archive. This treasure trove of data can be used in a bewildering variety of ways for research on the content, structure, and evolution of the Web. The talk will describe some of the novel research that the laboratory will enable, and the technical challenges involved in managing data on this scale.

Bio -

William Arms is Professor of Computer Science and Co-Director of the Information Science program. His research interests are in digital libraries.

 

If you would like to meet with Bill Arms, please contact Anat Nidar-Levi.


For more information please contact Jeff Hancock.