CQP Dialect used for CQP2SPARQL

The CQP dialect for this project is mostly based on the SketchEngine CQP dialect, which is described here: https://www.sketchengine.eu/documentation/corpus-querying/

Key differences from various standards

Logical operators can be both words and symbols:

Conjunction: and, &
Disjunction: or, |
Negation is currently only !

Tokens can have symbolic labels, not only numerical
Global constraints can be separated by any AND variant, or ::
Comments are allowed after the global constraints after the symbol #
Quantifying regular expressions can be applied to bracketed groups of tokens: ([] [])*

Names and conventions

Attributes are represented as :<attr_name>, where namespaces are predefined in config files of a specific tool: conll, rdfs, powla, etc.
Segments names are represented as :<type_name>, where namespaces are predefined in config files, and type_name corresponds to a specific type of which should be the enclosing segments.

Examples

Sentences that contain exactly two verbs: one with a lemma է (to be), and another one which is either a perfective converb or an imperfective converb:

(    v1:[conll:LEM='է' and rdfs:type='olia:Verb'] 
     v2:[a='eanc:ImperfectiveConverb' or rdfs:type='eanc:PerfectiveConverb']) 
     within 
         (<powla:sentence/> !containing v3:[rdfs:type='olia:Verb'])
) & v1.ID != v3.ID and v2.ID != v3.ID

Sentences that contain more than one recursive clauses with that.

(
	[conll:WORD='that']
	[]+
){2,}
within <powla:sentence/>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cqp_dialect.md

cqp_dialect.md

CQP Dialect used for CQP2SPARQL

Key differences from various standards

Names and conventions

Examples

Files

cqp_dialect.md

Latest commit

History

cqp_dialect.md

File metadata and controls

CQP Dialect used for CQP2SPARQL

Key differences from various standards

Names and conventions

Examples