New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Querying and constructing multiple graphs #241

Open

boggle wants to merge 83 commits into opencypher:master from boggle:CIP2017-06-18-multiple-graphs

Contributor

boggle commented Jul 2, 2017 •

edited

This is a proposal for making Cypher work with multiple graphs.

It is part of the redesign of Cypher for adding support for working with multiple graphs that targets Cypher 10.

View latest version of CIP from associated branch

boggle added 2 commits

June 27, 2017 19:36


          First early draft for Cypher support for working with multiple graphs

56f2d90

This covers a lot of ground:

* Data model
* Language execution model
* Working with named graphs
* Declarative Graph Construction
* Graph composition
* New Patterns: Optional Copy Patterns
* New Patterns: Merge Patterns
* Create, update, modify persistent graphs


          Reflect recent discussions

ce09cf5

boggle added CIP enhancement NOT READY FOR MERGE NOT READY FOR REVIEW labels

boggle changed the title ~~CIP2017-06-18 Multiple Graphs~~ CIP2017-06-18: Multiple Graphs

boggle force-pushed the CIP2017-06-18-multiple-graphs branch 3 times, most recently from 2498907 to 7332c02 Compare

July 2, 2017 23:21


          Minor changes and clarifications

98d78b6

boggle force-pushed the CIP2017-06-18-multiple-graphs branch from 7332c02 to 8459014 Compare

July 3, 2017 07:42


          Remove graph space notion and consolidate update semantics section

4714ca6

boggle force-pushed the CIP2017-06-18-multiple-graphs branch from 8459014 to 4714ca6 Compare

July 3, 2017 08:12

Mats-SX reviewed

View reviewed changes

cip/CIP2017-06-18-multiple-graphs.adoc Outdated

+              === (Property) Graph
+              _Definition_ A *property graph* is a set of labeled nodes and typed relationships both together with their properties (a property is a tuple of a named key and a value).
+              Graphs may be updatable, i.e. the set of contained nodes and relationships may change during the lifetime of the graph.

Member

Mats-SX Jul 3, 2017

This section should probably link to the PGM spec in our repo.

cip/CIP2017-06-18-multiple-graphs.adoc Outdated

+              It is an error to attempt to update a read-only graph.
+              The same node or relationship may be part of many graphs.
+              A relationship may only be part of a graph if it's start node and it's end node are both also part of the same graph.

Member

Mats-SX Jul 3, 2017

it's -> its

or rephrased:

if its source and target nodes are both also ...

Contributor Author

boggle Jul 3, 2017

That always trips me up :)

Member

Mats-SX Jul 3, 2017

Yeah, used to trip me up too, but then I learned that it's == it is, so in case you're unsure, just spell it out and it'll become apparent :)

cip/CIP2017-06-18-multiple-graphs.adoc Outdated

+              The same node or relationship may be part of many graphs.
+              A relationship may only be part of a graph if it's start node and it's end node are both also part of the same graph.
+              Therefore removing a node from a graph may require removing some of it's relationships from the graph, too.

Member

Mats-SX Jul 3, 2017

it's -> its

It not only may, it will require removing all of them. Or rephrased:

Thus, removing a node from a graph will require removing all of its relationships from that graph, too.

cip/CIP2017-06-18-multiple-graphs.adoc Outdated


		Graphs do not expose an identity like nodes or relationships do.

		Graphs may be made addressable through other means by a conforming implementation (e.g. through exposing the graph under a _graph URL_ for referencing and loading it).

Member

Mats-SX Jul 3, 2017

I suggest unwrapping the example from parentheses.

cip/CIP2017-06-18-multiple-graphs.adoc Outdated


		With this terminology in place, execution of a parameterized Cypher query in the single graph execution model can be described as executing within (and operating on) a given execution context and an initial query context and finally returning the query context produced as output for the top-most `RETURN` clause.

		Note: This formulation is introduced to describe a high-level model for the execution of queries; A real world implementation is free to choose any other internal representation (e.g. based on an algebra) as long as it does not violate the specified semantics.

Member

Mats-SX Jul 3, 2017

A -> a (not capitalised)

cip/CIP2017-06-18-multiple-graphs.adoc Outdated

+              * `<graph-specifier-list>`: A comma separated list of `<graph-specifier>` that are to be passed on
+              * `*`: All named graphs are to be passed on
+              * `*, <graph-specifier-list>`: All named graphs are to be passed on together with any additional named graphs that are newly bound in `<graph-specifier-list>`
+              * `-`: No named graphs are to be passed on

Member

Mats-SX Jul 3, 2017

I'm interpreting that GRAPHS is optional (which I support). What is the point of GRAPHS - if we can just leave it out?

cip/CIP2017-06-18-multiple-graphs.adoc Outdated

+              This in essence mirrors the semantics for tabular data returned by Cypher.
+              Both `WITH ... GRAPHS ...` and `RETURN ... GRAPHS ...` will pass on (or return respectively) exactly the set of described named graphs.
+              To simplify passing on available graphs it is proposed by this CIP that regular `WITH <return-items>` is taken to be syntactic sugar for `WITH <return-items> GRAPHS -` and that regular `RETURN <return-items>` is taken to be syntactic sugar for `RETURN <return-items> GRAPHS -`.

Member

Mats-SX Jul 3, 2017

What is the point of having a long-form GRAPHS - as the normal form, and call leaving it out syntactic sugar? Why not say that leaving it out is the normal form, and that the other forms modify that?

Contributor Author

boggle Jul 3, 2017 •

edited

The procedures CIP (?) I think added - for procedures not returing any columns, similarly researches have suggested, that it is an omission on the part of SQL to not be able to return no columns (more so for Cypher where the single row field less table plays a special role to start off queries). In light of this I added this for no reason but consistency with these other decisions.

Member

Mats-SX Jul 3, 2017

But for procedures there was actually a need for YIELD - in order to not cause implicit conflicts with variables that were in scope, as I recall. My personal preference would be to use the empty string to denote the intention in this proposal.

cip/CIP2017-06-18-multiple-graphs.adoc Outdated

+              Both `WITH ... GRAPHS ...` and `RETURN ... GRAPHS ...` will pass on (or return respectively) exactly the set of described named graphs.
+              To simplify passing on available graphs it is proposed by this CIP that regular `WITH <return-items>` is taken to be syntactic sugar for `WITH <return-items> GRAPHS -` and that regular `RETURN <return-items>` is taken to be syntactic sugar for `RETURN <return-items> GRAPHS -`.
+              To even further simplify, it is additionally proposed that `WITH|RETURN <return-items> INPUT GRAPHS <graph-return-items>` is to be syntactic sugar for `WITH|RETURN <return-items> GRAPHS <graph-return-items>, SOURCE GRAPH, TARGET GRAPH`.

Member

Mats-SX Jul 3, 2017

I'm not convinced of the usefulness of this syntactic sugar -- I find that it is hard to know what kind of queries will be prominent in this new model. In general, I think that it would be useful to have a little less focus on the syntactic sugar bits, and more on the core model. Syntactic sugar additions could always follow later.

Contributor Author

boggle Jul 3, 2017

Let's revisit how default graphs are handled as a group first - this may very well remove the need for this. In short I added this as a simple way for a query to say: "I'm ok to run on any incoming graphs and am happy to pass those on, just give 'em some names for me". Without this sugar, expressing this becomes rather verbose.

Member

Mats-SX Jul 3, 2017

I'm not against the sugar per se, I just find it difficult to assess whether a particular piece of sugar is valuable this early in the process of defining these very new concepts, and so I'm leaning towards skepticism in general. I find it is peripheral to the contents of the CIP anyway.

cip/CIP2017-06-18-multiple-graphs.adoc Outdated


		=== Discarding available tabular data

		It is additionally proposed that both `WITH GRAPHS <graph-return-items>` and `RETURN GRAPHS <graph-return-items>` are syntactic sugar for `WITH - GRAPHS <graph-return-items>` (and `RETURN - GRAPHS <graph-return-items>` respectively).

Member

Mats-SX Jul 3, 2017

I feel similarly to this as to GRAPHS -; I prefer the absence of - to its presence in this context.

cip/CIP2017-06-18-multiple-graphs.adoc Outdated


		However, the change has been carefully designed to not change the semantics of existing queries.

		== Alternatives

Member

Mats-SX Jul 3, 2017

I think this and subsequent sections are superfluous since the introduction of CIRs. We should modify our template.

Member

Mats-SX commented Jul 3, 2017

Great work putting these concepts into spec!

Mats-SX mentioned this pull request

Syntax and AST for Multiple Graphs Cypher neo4j/neo4j#9629

Merged

boggle and others added 8 commits

August 3, 2017 14:44


          Move into correct directory

581f192


          Language changes

343ff7d


          Clarified semantics of UNION

4cdb3dc


          Removed INPUT GRAPHS syntax

a756fcf


          Reworked the syntax to be a little more flexible

71ad96e

- Homogenized graph specifier syntax
- Added DEFAULT GRAPH
- WITH, RETURN can also return comma separated list of graphs without
  leading `GRAPHS` if bound graphs are prefixed with `GRAPH`,
  i.e. RETURN a, b, c COPY OF GRAPH foo is possible


          Simplified aliasing and returning graphs syntax definition

be361f6


          Tweaked syntax as per internal discussion

972ceaf

- COPY .. TO ..
- Allow FROM <name> AS <new-name> (wo leading GRAPH)
- Allow INTO <name> AS <new-name> (wo leading GRAPH)


          Added more meaningful titles to existing examples

4c0d1d2

petraselmer force-pushed the CIP2017-06-18-multiple-graphs branch 2 times, most recently from 54286fa to b402f1d Compare

August 4, 2017 21:23


          Added first draft of detailed example

f8fcc8e

- The jpg files ought to be moved elsewhere at a later stage

petraselmer force-pushed the CIP2017-06-18-multiple-graphs branch 2 times, most recently from 335a474 to 3258b3b Compare

August 5, 2017 08:36

boggle and others added 24 commits

April 30, 2018 17:27


          Reworked references vs identity a bit

2327c3b

WIP

4bc50dd


          Updated definitions

8ff91a1


          Shuffle structure

fa9e85f


          More structure work

4da00d2


          Added Data model text

85c3740


          Tiny fixes

3df1eb8


          Clone => Replicate

d831bfc


          Juxtaposition, simplification, introspection functions

3af673a


          Fixed composite statement definition

93517e5


          Fix definitions around composite statements

cde04b7


          Polished Abstract, Intro, Data Model

7b2237c


          Removed catalog section from CIP

a18a2a2


          Polished Query structure and Execution model

11d9531


          Added Overview subsection

bdb3df3


          local declarations and simple statement chains

fec7c6e


          Polished Basic graph operations

5cc373e


          Merge branch 'CIP2017-06-18-multiple-graphs-devel' of github.com:bogg…

97cd398

…le/openCypher into CIP2017-06-18-multiple-graphs-devel


          Links in Graph construction

739a435


          Polishing of Graph construction - Take I

53257da


          Polishing of Graph construction - Take II

ed1a1aa


          Addressed TODOs

1c06ab9


          Updated image

d96f928


          Fix up errors

c5b8e42

boggle force-pushed the CIP2017-06-18-multiple-graphs branch from 7f258be to c5b8e42 Compare

May 8, 2018 07:56

boggle added oCIG cypher10 and removed NOT READY FOR REVIEW labels

boggle changed the title ~~CIP2017-06-18: Multiple Graphs~~ Querying and constructing multiple graphs

linsimiao commented Dec 26, 2019

hi all, I have read your documentation. I found it easy to mix CONSTRUCT with UPDATE. I wonder whether the following cyphers mean the same.

FROM xxx match (a:Person) UPDATE GRAPH merge (b:Student{name:a.name})

and

FROM xxx match (a:Person) CONSTRUCT merge (b:Student{name:a.name})

suppose the working graph is yyy, is it both two cyphers will lead to create nodes with a label Student in the graph yyy.

Thanks for your reply.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment