UNBBAYES-MEBN: COMMENTS ON IMPLEMENTING A PROBABILISTIC ONTOLOGY TOOL Rommel N. Carvalho, Marcelo Ladeira, Laécio L. Santos, Shou Matsumoto Universidade de Brasília – Departamento de Ciência da Computação Brasília, DF - Brazil {rommel.carvalho, laecio, cardialfly}@gmail.com; mladeira@unb.br
Paulo C. G. Costa George Mason University – C4I Center Fairfax, VA - USA pcosta@gmu.edu
ABSTRACT The quest for principled approaches to represent and reason under uncertainty in the Semantic Web (SW) is a very active research subject. Recently, the World Wide Web Consortium (W3C) created the Uncertainty Reasoning for the World Wide Web Incubator Group - URW3-XG [Laskey, K.J. et al., 2007] to better define the challenge of reasoning with and representing uncertain information available through the World Wide Web and related WWW technologies. One of the most promising approaches is the use of a Bayesian framework to handle uncertainty in SW ontologies. Working within this approach, Costa [2005] proposed a probabilistic ontology language, denoted PR-OWL, to represent and to reason with probabilistic ontologies. PR-OWL language is based on MEBN – Multi-Entity Bayesian Network [Laskey & Mahoney, 1997; Laskey & Costa, 2005; Laskey, 2007], a formalism that brings together the expressiveness of first-order logic (FOL) and the inferential power of Bayesian Networks (BN) to support probabilistic reasoning. Since both MEBN and PR-OWL are still under development, there is no tool that implements MEBN/PR-OWL as a knowledge representation formalism and probabilistic reasoner. This paper discusses the technical problems encountered, as well as how they were addressed in such an implementation that is currently under development at the University of Brasilia, with technical support from the C4I Center at George Mason University. KEYWORDS Multi-Entity Bayesian Network, Bayesian networks, probabilistic ontology Web, probabilistic reasoning, Semantic Web.
1. INTRODUCTION What was first thought to be a great advantage for decision makers is now becoming a bottleneck. The huge amount of information available due to the increasing connectivity across the globe, has been making its timely processing by humans into knowledge almost impossible. This information overload needs to be overcomed by IT techniques, enabling the jump from the “information technology revolution” to the “knowledge revolution”, a natural sequence predicted in Alvin Toffler’s The Third Wave [Toffler, 1980]. The “knowledge revolution” will be seen, in the future, as the phase where the arduous and manual task of identifying, accessing and utilizing the information was assigned successfully to computers, allowing human beings to change their focus from data to knowledge driven activities. However, how will IT accomplish such a daunting task? One widely shared response is that the solution lies with semantics. Technologies for making semantic information explicit and computationally accessible are key to effective exploitation of data from disparate sources [Laskey et al, 2007]. In this context emerges the Semantic Web (SW) to define common formats for integration and combination of data drawn from diverse sources, and a language for recording how data relates to real world objects. Explicit semantics is essential for appropriate processing of syntactically identical but semantically different terms (e.g., “Washington” the President, the city, or the football team). Ontologies, or shared repositories of precisely defined concepts expressed in standardized languages, are a vital tool for enabling semantic interoperability among web