For an application to be more intelligent, it needs to know more. To know more, an application needs to collect more information. When collecting and integrating more information, the resulting dataset becomes increasingly complex.
What makes a dataset complex?
Data models make datasets complex. Real world models are filled with hierarchies and hyper-relationships, but our current modelling techniques are all based on binary relations. Querying these legacy datasets is challenging, because query languages are only able to retrieve explicitly stored data, and not implicitly derived information.
GRAKN.AI is the database solution for working with complex data that intelligent applications rely on. Grakn allows an organisation to grow its competitive advantage by uncovering hidden knowledge that is too complex for human cognition. The data model can evolve as a business “learns” all while reducing engineering time, cost and complexity.
Meet Grakn and Graql
GRAKN.AI is composed of two parts: 1. Grakn (the storage), and 2. Graql (the language).
Grakn is a hyper-relational database, that provides an Enhanced-Entity-Relational (EER) schema to model extremely complex datasets. Grakn uses the EER model to understand the complete context of their relationships. Consequently, Grakn allows computers to process complex information more intelligently, through a higher level of abstraction.
Grakn allows you to model the real world and all the hierarchies and hyper-relationships contained in it.
Grakn’s schema/ontology modelling constructs include, but are not limited to, data type hierarchy, relation type hierarchy, bi-directional relationships, multi-type relationships, N-ary relationships, relationships in relationships, conditional relationships, virtual relationships, dynamic relationships, and so on. In other words, Grakn allows you to model the real world and all the hierarchies and hyper-relationships contained within it.
Grakn’s ontology functions as a data schema constraint that guarantees information consistency.
Grakn’s ontology is flexible, allowing a data model to evolve. Later in the year, we will show how you can have a machine learning system adapt and grow your Grakn ontology. Our ontology is a key feature in maintaining data quality, as it functions as a data schema constraint that guarantees information consistency.
Grakn is built using several distributed computing platforms, such as Apache Cassandra, Apache TinkerPop, Apache Spark, and Apache Hadoop. Grakn is designed to be sharded and replicated over a network of distributed machines.
Graql is Grakn’s declarative, reasoning (through OLTP) and analytics (through OLAP) query language that creates a much higher level abstraction over complex relationships. Graql knowledge-oriented query language.
Graql allows you to derive implicit information that is hidden in your dataset, as well as reduce the complexity of that information.
When using legacy systems, database queries have to define the data patterns they are looking for explicitly. Graql, on the other hand, will translate a query pattern into all its logical equivalents and evaluate them against the database. This includes but is not limited to the inference of types, relationships, context, and pattern combination. In other words, Graql allows you to derive implicit information that is hidden in your dataset, as well as reduce the complexity of expressing intelligent questions. In effect, Graql helps finding new knowledge easy with concise and intuitive statements.
Graql is also capable of performing distributed analytics as part of the language, which allows you to perform analytics over large out of the box. These types of analytics are usually not possible without developing custom distributed graph algorithms that are unique to every use case.
One may consider Graql as an OLKP (OnLine Knowledge Processing) language, which combines both OLTP (OnLine Transaction Processing) and OLAP (OnLine Analytical Processing).