UMBC High Performance Computing Facility
Evaluating Accumulo as a Distributed Semantic
Web Triple Store
Tim Finin, Computer Science and Electrical Engineering
Anupam Joshi, Computer Science and Electrical Engineering
Steve Troxell, Computer Science and Electrical Engineering
Semantic Web technologies provide a new approach to to creating,
accessing and using distributed, open knowledge using sophisticated
knowledge representation concepts coupled with common Web languages and
protocols. Since the amount of knowledge available in this form is
already large (tens of billions of facts) and expected to grow rapidly,
it is important to develop tools that can work at "Web scale".
This project will evaluate parallel approaches to storing, managing and
querying Semantic Web data in RDF in a cloud computing environment.
Specifically, we will use a set of benchmark datasets to evaluate
Accumulo, a distributed key/value store modeled after Google's BigTable,
as a store for distributed RDF data when used with an environment
provided by Apache Hadoop and Apache Zookeeper.