Skip to main navigation Skip to search Skip to main content

XBase: Making Your Gigabyte Disk Queriable

  • Hongjun Lu
  • , Guoren Wang
  • , Ge Yu
  • , Yubin Bao
  • , Jianhua Lv
  • , Yaxin Yu
  • Hong Kong University of Science and Technology
  • Northeastern University China

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the rapid development of the Internet and the World Wide Web (WWW), very large amount of information is available and ready for downloading, most of which are free of charge. At the same time, hard disks with large capacity are available at affordable prices. Most of us nowadays often dump a large number of various types of documents into our computers without much thinking. On the other hand, file systems have not changed too much during the past decades. Most of them organize files in directories that form a tree structure, and a file is identified by its name and pathname in the directory tree. Remembering name of files created sometime ago and digging them out from a disk with dozen gigabytes of data in hundred thousands of files becomes never an easy task. Tools available for helping such a search are still far from satisfactory.Xbase (XML-based document BASE) is a prototype system aiming at addressing the above problem. By XML-based, we meant that XML is used to define the metadata. The current version of XBase stores text-based files, including semi-structured data such as XML, HTML, plain text documents (e.g., tex files, computer programs) and those files that can be converted into text (e.g., postscript files, PDF files). In XBase, file name is optional. Users can just load a file into XBase without giving a name and the directory where it should be stored. XBase will automatically associate it with attributes such as the time when the file was saved, its source, its size and type, and etc., To retrieve those files, XBase provides three access methods, explorative browsing, querying using query languages, and keyword based search.

Original languageEnglish
Title of host publicationProceedings of the 2002 ACM SIGMOD International Conference on Management of Data, SIGMOD 2002
PublisherAssociation for Computing Machinery, Inc
Pages630
Number of pages1
ISBN (Electronic)1581134975, 9781581134971
DOIs
Publication statusPublished - 3 Jun 2002
Externally publishedYes
Event2002 ACM SIGMOD International Conference on Management of Data, SIGMOD 2002 - Madison, United States
Duration: 3 Jun 20026 Jun 2002

Publication series

NameProceedings of the 2002 ACM SIGMOD International Conference on Management of Data, SIGMOD 2002

Conference

Conference2002 ACM SIGMOD International Conference on Management of Data, SIGMOD 2002
Country/TerritoryUnited States
CityMadison
Period3/06/026/06/02

Keywords

  • DOM
  • XML query processing
  • multidimensional browsing

Fingerprint

Dive into the research topics of 'XBase: Making Your Gigabyte Disk Queriable'. Together they form a unique fingerprint.

Cite this