Data analysis — from science to enterprise
It’s a little known fact that the same uniquely scalable shared memory architecture that enables SGI Altix® 4700 with Intel® Itanium® processors to power technical and scientific breakthroughs also excels at high-performance reasoning on ontologies for enterprise data analytics applications. Ontologies are knowledge models or formal representations of a set of relevant concepts within a domain and the relationship between these concepts.
A major advantage of ontologies for data analytics is the ability to share the meaning (semantics) of information in a knowledge model, capture complex relationships and integrate heterogeneous data sources. These characteristics can only be achieved if the ontology run-time is able to scale with a growing number of facts.
Silicon Graphics and Ontoprise GmbH have demonstrated that OntoBroker® inference engine can load and process large ontologies in main memory. The SGI Altix 4700 platform has a unique configuration elasticity that allows adding processors and memory independent of each other, and thus easily accommodating user growth and securing hardware investment with a lower TCO. Here are some benchmark examples using the OntoBroker inference engine on a SGI Altix 4700 server:
–Complex graph traversal. Finding the possible traversable paths between the nodes in a graph comprising of 1 million nodes takes only 15.7 seconds, thus making complex reasoning possible
–Semantic retrieval and query processing with 104 facts. Five classes of 195 queries with varying complexity run in less than 18 milliseconds on the SGI Altix® 4700
–SmartWeb®. The longest queries for multimedia web content with 95 queries and 60 rules takes an average of 17 seconds from a disk-based database but only an average of 73 milliseconds from a non-materialized in-memory model.
–Wikipedia® knowledge base search. The most complex query with average result sets from Wikipedia® knowledge base takes less than 35 ms with a trade-off of load time of 48 million wiki facts in 138 minutes
–Automotive test. A large ontology comprising of 1.4 million facts takes only 133 seconds for in-memory load while queries over the indexed database model, and the database load took 381 seconds. The query times showed that large result sets benefit most from an in-memory model
So with a professional reasoning engine like OntoBroker, and scalable servers like SGI Altix 4700, users of many applications can find required information much more quickly and easily.