The query builder can be accessed via "My Corpora" or "Corpus Analysis" in the sidebar options. Select the desired corpus and click on the "Analyze" and then "Concordance" buttons to open the query builder.

The query builder uses the Corpus Query Language (CQL) to help you make a query for analyzing your texts. In this way, it is possible to filter out various types of text parameters, for example, a specific word, a lemma, or you can set part-of-speech tags (pos) that indicate the type of word you are looking for (a noun, an adjective, etc.). In addition, you can also search for structural attributes, or specify your query for a token (word, lemma, pos) via entity typing. And of course, the different text parameters can be combined.

Tokens and structural attributes can be added by clicking on the "+" button (the "input marker") in the input field or the labeled buttons below it. Elements added are shown as chips. These can be reorganized using drag and drop. The input marker can also be moved in this way. Its position shows where new elements will be added.
A "translation" of your query into Corpus Query Language (CQL) is shown below.

Advanced users can make direct use of the Corpus Query Language (CQL) by switching to "expert mode" via the toggle button.

Add new token to your Query

If you are only looking for a specific token, you can click on the left button and select the type of token you are looking for from the drop-down menu. By default "Word" is selected.

Word and Lemma

If you want to search for a specific word or lemma and the respective category is selected in the drop-down menu, you can type in the word or lemma of your choice in the input field. You can confirm your entry by clicking the Submit button on the right. You can also use the options below to modify your token request before pressing the submit button. These options are explained further here.

English pos, german pos or simple_pos

You can choose between the options "english pos", "german pos" and "simple_pos" to search for different parts-of-speech. You can find an overview of all tags under the "Tagsets" tab.

Empty Token

Here you can search for a token with unspecified attributes (also called wildcard token). This selection should never stand alone and should always be extended with an incidence modifier or stand in a larger query, because otherwise all possible tokens would be searched for and the program would crash.

Options for editing your query

You have the possibility to extend or specify the token you are searching for with certain factors. For this the query builder offers some fixed options. You can find more information about the options in the Corpus Query Language Tutorial.

Wildcard Character

A wildcard character replaces any character and is represented in the form of a dot.

Option Group

With an option group you can search for different variants of a token. The variants are not limited, so you can manually enter more options in the same format. "Option1" and "option2" must be replaced accordingly.

Incidence Modifiers

With the Incidence Modifiers you can determine the occurrence of single tokens. For example you can use "?" to indicate that the token occurs either not at all or once:
[word = "is"] [word="it"] [word="your"] [word="litte"]? [word = "dog"]
Here the word "little" should occur either once or not at all. With [word="dogs?"] the search is for "dog "or "dogs".

Ignore Case

With the check mark at Ignore Case the upper and lower case is ignored. This is marked with a "%c". By default (if not checked) it is case sensitive.

"or" & "and"

"Or" ( | ) and "and" ( & ) are conditions you can put on a token. With "or" one of the two conditions must be fulfilled, with "and" both conditions must be fulfilled. For example, the word must be called "will" AND be a verb, only then it will be displayed. Note that "and" is not responsible for lining up tokens in this case. For this you can simply string them together:
[word="I"] [word="will" & simple_pos="VERB"] [word="go"].

Tokens that have already been added can also be modified by clicking on the corresponding pen icon. Click on the "ignore case" box, for example, and the query builder will not differentiate between upper- and lower- case letters for that respective token. New conditions added apply to the most recent token information.

Add structural attributes to your query

You can use structural attributes to search specifically for structures in the text or to further narrow down your previous search query.

Sentence

With "Sentence" (<s></s>) you can search for sentences within your text. This search can of course be specified if you search for particular tokens or entities between the sentence tags (<s></s>). For example, you can search for sentences that contain only a noun, verb, and adjective.
Click on Sentence to add the sentence chips: Sentence Start and Sentence End. These mark where the sentence starts and ends. Use drag-and-drop to place them accordingly. When the Sentence attribute is added, the input marker will automatically be moved between the sentence chips. Use drag-and-drop as needed to continue your query at a different position.

Entities

With entities, i.e. units of meaning, you can search for text sections that contain more specific information, for example, persons, dates, or events. The codes for these categories can be selected using the drop-down menus. You can find an explanation of these abbreviations under the tab "Tagsets".
You can also search for unspecified entities by selecting "Add entity of any type".

Click on the Entity button to add the entity chips Entity Type= and Entity End.

The entity type can be changed by clicking on the pen symbol on the chip. When the Entity attribute is added, the input marker will automatically be moved between the entity chips. Use drag-and-drop as needed to continue your query at a different position.

Meta Data (currently unavailable)

The meta data function is being worked on and cannot currently be used!
With the meta data you can annotate your text and add specific conditions. You can select a category on the left and enter your desired value on the right. The selected metadata will apply to your entire request and will be added at the end.

General Options of the query builder

You have several options to edit your query after adding it to the preview.

Editing the elements

You can edit your query chips by clicking on the pen icon.

Deleting the elements

You can delete the added elements from the query by clicking the X behind the respective content.

Move the elements of your query

You can drag and drop elements to customize your query.

Setting an incidence modifier

With the incidence modifier option, you can specify the amount of times a token should appear in your query. This is particularly relevant for empty tokens (tokens with unspecified attributes). Click on a token (blue chip) and select the desired option from the list to add an incidence modifier. To close the list without adding anything, click on the token again.

Switching between Query Builder and Expert mode

To work with the plain Corpus Query Language instead of using the Query Builder, click on the "expert mode" switch. Your query can be entered into the input field. All elements previously added will be carried over into expert mode. Click on the switch again to switch back to the Query Builder if desired. All recognized elements will be parsed into chips; those not recognized will be deleted from the query.

Query Builder

Overview

Content