MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Austrian on-line archive processing: Analyzing archives of the world wide web (2002) [7 citations — 0 self]

Download:
Download as a PDF | Download as a PS
by Andreas Rauber, Andreas Aschenbrenner, Oliver Witvoet
In Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2002
http://www.ifs.tuwien.ac.at/ifs/research/pub_ps/rau_ecdl02.ps.gz
Add To MetaCart

Abstract:

Abstract. With the popularity of the World Wide Web and the recognition of its worthiness of being archived we find numerous projects aiming at creating large-scale repositories containing excerpts and snapshots of Web data. Interfaces are being created that allow users to surf through time, analyzing the evolution of Web pages, or retrieving information using search interfaces. Yet, with the timeline and metadata available in such a Web archive, additional analyzes that go beyond mere information exploration, become possible. In this paper we present the AOLAP project building a Data Warehouse of such a Web archive, allowing its analysis and exploration from di#erent points of view using OLAP technologies. Specifically, technological aspects such as operating systems and Web servers used, geographic location, and Web technology such as the use of file types, forms or scripting languages, may be used to infer e.g. technology maturation or impact.

Citations

55 Preserving the Internet – Kahle - 1997
45 The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling – Kimball, Ross - 2002
44 Computing geographical scopes of web resources – Ding, Gravano, et al. - 2000
14 Multidimensional database technology – Pedersen, Jensen - 2001
10 The Kulturarw3 project - The Royal Swedish Web Archiw3e - An example of "complete" collection of web pages – Arvidson, Persson, et al. - 2000
7 Collecting and preserving the web: Developing and testing the NEDLIB harvester – Hakala - 2001
6 Managing time consistency for active data warehouse environments – Bruckner, Tjoa - 2001
6 Cost-Driven Design for Archival Repositories – Crespo, Garcia-Molin - 2001
5 Towards web-scale web archeology – Leung, Perl, et al. - 2001
4 Metadata for digital preservation: A review of recent developments – Day
3 Long-term preservation of electronic publications: The NEDLIB project. D-Lib Magazine – Werf-Davelaar - 1999
2 Web schemas in WHOWEDA – Bhowmick, Keong, et al. - 2000
1 Webbase: A repositoru of web pages – Hirai, Raghavan, et al.
1 Austrian On-Line Archive: Current status and next steps. Presentation given at the – Rauber - 2001
1 Part of our culture is born digital - On e#orts to preserve it for future generations – Rauber, Aschenbrenner - 2001