G-CORE: A Core for Future Graph Query Languages

Published 5 Dec 2017 in cs.DB | (1712.01550v2)

Abstract: We report on a community effort between industry and academia to shape the future of graph query languages. We argue that existing graph database management systems should consider supporting a query language with two key characteristics. First, it should be composable, meaning, that graphs are the input and the output of queries. Second, the graph query language should treat paths as first-class citizens. Our result is G-CORE, a powerful graph query language design that fulfills these goals, and strikes a careful balance between path query expressivity and evaluation complexity.

Abstract PDF Upgrade to Chat

Citations (188)

View on Semantic Scholar

Summary

An Academic Insight into G-CORE: A Core for Future Graph Query Languages

The paper "G-CORE: A Core for Future Graph Query Languages" is authored by a collaboration of experts from both industry and academia, underlining the collective effort within the Linked Data Benchmark Council's (LDBC) Graph Query Language Task Force. It seeks to address notable gaps in existing graph query languages by proposing G-CORE, a language aimed at setting foundational principles that guide future developments in graph query capabilities.

Core Attributes of G-CORE

The paper details the essential elements believed to be critical for advancing graph query languages. Two primary characteristics are emphasized: composability and the treatment of paths as first-class citizens. Composability refers to the ability of graph queries to seamlessly integrate both the input and output as graphs, fostering greater interoperability, simplification in abstraction, and productivity through query reuse. Meanwhile, treating paths as primary entities elevates their significance in analyzing relationships within graphs, which can be manipulated, returned, and annotated directly.

Structuring G-CORE

The authors elaborate on the Path Property Graph model, extending the prevalent property graph model by incorporating paths directly into the data structure. In this model, paths—not merely nodes and edges—are endowed with properties and identities, enhancing the descriptive power of property graphs to also include complex routes. Notably, this extension is backward-compatible, ensuring that existing property graphs can easily transition to this advanced framework.

Within G-CORE, paths can be retrieved and analyzed with sophisticated queries initially targeting specific search patterns, which have applications like social network analysis, fraud detection, or logistics optimization. The careful design retains usability on large data sets by ensuring polynomial-time evaluation of queries.

Addressing Challenges

By leveraging a foundation of straightforward path expressions and cost evaluation mechanisms, G-CORE provides sound complexity assurances in data processing. The language is structured to manage complex graphs without succumbing to computational intractability, thereby merging practical implementations in existing systems with theoretical data complexities.

Graph Operations and Extensions

The document rigorously defines various components, including the operations of intersection, union, and difference on property graphs, necessary for complex graph manipulations. Also proposed are future extensions, contemplating aspects like tabular data integration and tabular projections. These would facilitate interactions with conventional data management systems, widening practical application scenarios without deviating from the language's graph-centric core.

Implications and Future Directions

The practical implications of G-CORE stretch towards streamlining processes across industries — from logistics and telecommunications to healthcare and financial sectors — where graph data structures are integral. On a theoretical plane, this language stands to catalyze standardization efforts in graph data querying, encouraging uniformity without stifling innovation.

Envisioned as a non-proprietary guide rather than a formal standard proposal, G-CORE's framework is positioned as a pivotal influence in refining and enhancing the efficacy and expressive power of graph data handling. As graph databases continue to expand in scope and adoption, integrating the G-CORE principles could introduce a coherent and sustainable approach to querying complex networks.

In conclusion, the evolution prompted by G-CORE not only facilitates advanced querying but also ensures analytical breadth and depth while navigating the burgeoning graph databases landscape. This paper, in essence, is both a comprehensive study and a roadmap for future explorations in graph query languages, unifying theoretical depth with practical applicability.

Markdown Report Issue