Query Language
Overview
Language concepts
For the query language, we reuse the concepts of graph patterns (which is a key concept in many graph transformation tools) as a concise and easy way to specify complex structural model queries. These graph-based queries can capture interrelated constellations of EMF objects, with the following benefits:
- the language is expressive and provides powerful features such as negation or counting,
- graph patterns are composable and reusable,
- queries can be evaluated with great freedom, i.e. input and output parameters can be selected at run-time,
-
some frequently encountered shortcomings of EMF’s interfaces are addressed:
- easy and efficient enumeration of all instances of a class regardless of location,
- simple backwards navigation along all kinds of references (even without eOpposite)
- finding objects based on attribute value.
Basic documentation
The current version of the IncQuery Graph Pattern language (IQPL) owes many of its syntax to the VTCL language of model transformation framework VIATRA2. If you would like to read more on the foundations of the new language, we kindly point you to our ICMT 2011 paper (important note: the most up-to-date IncQuery language syntax differs slightly from the examples of the ICMT paper).
Short syntax guide
See also the language tutorial and the School example.
-
Import declarations are required to indicate which EMF packages are referenced in the query definitions.
-
Use the registered namespace URI for import declarations. Content assist is available:
import "http://my.own.ePackage.nsUri/1.0"
-
Use the registered namespace URI for import declarations. Content assist is available:
-
Enclose pattern definitions in a package:
package my.own.patterns
-
Introduce a pattern by the pattern keyword, a pattern name, and a list of parameter variables. Then enclose in curly braces a list of constraints that define when the pattern should match.
pattern myPattern(A,B,C) = {...pattern contraints...}
-
Disjunction ("or") can be expressed by linking several pattern bodies with the or keyword:
pattern myPattern(A,B,C) = {... pattern contraints ...} or {... pattern constraints ...}
-
The most basic pattern constraints are type declarations: use EClasses, ERelations and EAttributes. The data types should also be fine.
-
An EClass constraint expressing that the variable MyEntityVariable must take a value that is an EObject of the class MyClass (from EPackage my.own.ePackage, as imported above) looks like this:
MyClass(MyEntityVariable);
-
A relation constraint for the EReference MyReference expressing that MyEntityVariable is of eClass MyClass and its MyReference EReference is pointing to TheReferencedEntity (or if MyReference is many-valued, then it is one of the target object contained in the EList), as seen below:
MyClass.MyReference(MyEntityVariable, TheReferencedEntity);
-
A relation constraint for an EAttribute, asserting that TheAttributeVariable is the String/Integer/etc. object that is the MyAttribute value of MyEntityVariable, looks exactly the same as the EReference constraint:
MyClass.MyAttribute(MyEntityVariable, TheAttributeVariable);
-
Such reference navigations can be chained; the last step may either be a reference or attribute traversal:
MyClass.MyReference.ReferenceFromThere.AnotherReference.MyAttribute(MyEntityVariable, MyString);
-
Pattern parameters can be suffixed by a type declaration, that will be valid in each pattern body. Here is an alternative way to express the type of variable B:
pattern myPattern(A,B : MyClass,C) = {...pattern contraints...}; -
(You will probably not need this, but EDatatype type constraints can be applied on attribute values, with a syntax similar to that used for EObjects, and with the additional semantics that the attribute value must come from the model, not just any int/String/etc. computed e.g. by counting):
MyDatatype(MyAttributeVariable);
or for the built-in datatypes (import the Ecore metamodel):
EString(MyAttributeVariable);
-
An EClass constraint expressing that the variable MyEntityVariable must take a value that is an EObject of the class MyClass (from EPackage my.own.ePackage, as imported above) looks like this:
-
By default, each variable you define may be equal to each other variable in a query. This is especially important to know when using attributes or multiple variables with the same type (or supertype).
- For example, if you have two variables MyClass(SomeObj1), MyClass(SomeObj2), SomeObj1 and SomeObj2 may match to the same EObject.
-
If you want to declare that two variables mustn't be equal, you can write:
SomeObj1 != SomeObj2;
-
If you want to declare, that two variables must be the same, you can write:
SomeObj1 == SomeObj2;
-
Pattern composition: you can reuse a previously define pattern by caling it in a different pattern's body:
find otherPattern(OneParameter, OtherParameter, ThirdParameter);
-
You can express negation with the neg keyword. Those actual parameters of the negative pattern call that are not used elsewhere in the calling body will be quantified; this means that the calling pattern only matches if no substitution of these calling variables could be found. See examples in order to understand. The below constraint asserts that for the given value of the (elsewhere defined) variable MyEntityVariable, the pattern neighborPattern does not match for any values of OtherParameter (not mentioned elsewhere).
neg find neighborPattern(MyEntityVariable, OtherParameter);
-
In the above constraints, wherever you could use an (attribute) variable in a pattern body, you can also use:
-
Constant literals of primitive types, such as
10
, or"Hello World"
. -
Constant literals of enumeration types, such as
MyEEnum::MY_LITERAL
-
Aggregation of multiple matches of a called pattern into a single value. Currently match counting is supported, in a syntax analogous to negative pattern calls:
HowManyNeighbors == count find neighborPattern(MyEntityVariable, OtherParameter);
- Attribute expression evaluation: coming soon.
-
Constant literals of primitive types, such as
-
Additional attribute constraints using the check() construct:
check((A as Integer) > (S as String).length());
-
One can also use the transitive closure of binary patterns in a pattern call, such as the transitive closure of pattern friend:
find friend+(MyGuy, FriendOfAFriendOfAFriend);
Limitations (as of IncQuery 0.6)
- Meta-level queries (instanceOf etc.) will not currently work (although Ecore models can be processed as any other model).
-
Make sure that the result of the check() expressions can change ONLY IF one of the variables defined in the query changes.
- In practice, a good rule of thumb is to only use attribute variables or other scalar values in a check(), no EObjects.
-
In particular, do not call non-constant methods of EObjects in a check(). Use attribute values instead, if necessary converted to the native type using (SomeInt as Integer) and co, so as to help the type inference.
- For example, you CAN use check((Name as String).contains(SomeString as String)).
- But You MUSTN'T use check((SomeObject as MyClass).name.contains((SomeString as String)) as the name of SomeObject can change without SomeObject changing!
- Use only well-behaving derived references or attributes. Better yet, reimplement the derived feature using queries. Regular derived features are not supported in patterns (except the ones in the Ecore metamodel, which are well-behaving by default) as they can have arbitrary Java implementations and EMF-IncQuery is unable to predict when their value will change.
See advanced issues for additional topics, such as attribute handling.
Step-by-step tutorial
Note: the built-in cheat sheet of EMF-IncQuery should also provide help in getting started.
Creating your first query
- src (for manually written code)
- src-gen (for generated code)
Using other structural constraints
Note, that currently this will cause type inference for the generated Match class to fail, so the parameters Cls and Name will be returned as Objects.
Check expressions and constants
It is possible to add constraints for the values of attribute variables in a query, these are called check expressions in EMF-IncQuery. For example, if you would like to ensure, that the name of the selected EClass is "MyClass", it can be done as follows:
Similarly, you can use other methods inside a check, for example compare numbers (after similar casting) or even use methods of the attribute values, such as String.contains(). Note, that you should only use methods which (1) have deterministic results, so they return the same value for the same EObject and (2) their results only change when the EObject itself changes, otherwise incremental updates will not work correctly.
If you only need equality checks for attribute values, you can simply replace the variable with the constant value, e.g.:
Injectivity constraints
By default, if you have two variables in the same query with the same type, then these variables may be matched to the same EObject during evaluation. This behavior is called injectivity and can be demonstrated with the following example:
In this case, if the MyClass EClass has an attribute, then the query will have a match where Cls=MyClass and Cls2=MyClass.
If you want to explicitely declare, that two variables must always take on the same value, you can write:
Although in such cases you can just remove Cls2 and use Cls in each occurrence.
It is a more interesting case, when you declare, that two variables must never be the same. This can be written like this:
Query composition
It is possible to create query compositions, where a complex query only matches if other queries have matches with given parameters. This allows the separation of concerns in a complex query and even increases the performance of the evaluation if done correctly.
For example, the examples from the previous section can be implemented by reusing queries from earlier sections, e.g.:
It is important, that you can use the same variable in multiple find expressions, e.g.:
Negative application conditions
Finally, you can use negative application conditions (NAC), for indicating that if a given query matches with the given parameters, then the query containing the NAC must not match. For example:
Here, the MyClass EClass must not have any attributes, since that would mean a match for the negative application condition. Note that the Attr variable in the NAC is not used anywhere else in the query. In such cases, the Attr variable will not have a concrete value inside the query, as no positive constraints for its existence is given. For illustration, the following query is NOT equal to the previous one:
Note, that here, Attr must exist, but it must not be an eAttribute of Cls, since that would mean the NAC matches.
Non-existence of attributes
Unset or null-valued attributes (or references) simply won't match, as there is no referenced EObject or attribute value to substitute in the target pattern variable. If you are especially looking for these, use a negative application condition, e.g.:
Aggregation of match sets
In some cases, you may want to check that there is a given number of matches for a query and you are not interested what exactly the matches are. In these cases, you can use aggregation to assign the number of matches to a variable. For example, you can count the number of attributes for a class:
The result of count is always an Integer, so you can use it as a regular variable in other parts of the query, e.g. in a check:
More complex Ecore query example
For a more complex example on Ecore queries, see our dedicated example.