|
||||
|
|
||||
|
|
INFORMATION SCIENCE SEMINAR The Web Laboratory: A petabyte collection of data for research on the content, structure, and evolution of the Web
Speaker: Bill Arms, Professor & Co-Director of Information Science, Cornell University Date: Wednesday, April 27, 2005 4:15-5:15p Location: 301 College Avenue, Seminar Room
Abstract - The petabyte data store is an NSF-funded project by Computer Science and the
Cornell Theory Center to support research projects that have very large
collections of data. We plan to mount a set of complete web crawls, dating
back to 1996, obtained from the Internet Archive. This treasure trove of
data can be used in a bewildering variety of ways for research on the
content, structure, and evolution of the Web. The talk will describe some of
the novel research that the laboratory will enable, and the technical
challenges involved in managing data on this scale.
If you would like to meet with Bill Arms, please contact Anat Nidar-Levi. |
|||
| ©2004 Cornell University | ||||