home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-IMST: POS Tags: AUX

There are 4 AUX lemmas (0%), 149 AUX types (1%) and 1121 AUX tokens (2%). Out of 14 observed tags, the rank of AUX is: 14 in number of lemmas, 8 in number of types and 11 in number of tokens.

The 10 most frequent AUX lemmas: i, mi, değil, ol

The 10 most frequent AUX types: mi, değil, mı, dır, dir, ydi, dı, ydı, tu, mu

The 10 most frequent ambiguous lemmas: i (AUX 785, CCONJ 35, NOUN 3), değil (AUX 106, CCONJ 30, VERB 15), ol (VERB 886, AUX 2)

The 10 most frequent ambiguous types: değil (AUX 63, CCONJ 30, VERB 7), dır (AUX 63, ADP 10), ise (CCONJ 45, AUX 21), değildir (AUX 16, VERB 2), değildi (AUX 15, VERB 1), ti (AUX 10, NOUN 1), değilim (AUX 8, VERB 5), lar (ADP 15, AUX 7), ler (ADP 11, AUX 6, NOUN 1), iz (AUX 4, NOUN 2)

Morphology

The form / lemma ratio of AUX is 37.250000 (the average of all parts of speech is 2.776562).

The 1st highest number of forms (119) was observed with the lemma “i”: ‘di, ‘dı, ‘dır, ‘tı, ‘ydi, ‘ydı, akutistan’dayım, ayoşmuş, değil, değildi, değildir, değilim, değilmiş, di, dik, dim, din, dir, du, duk, dular, dum, dur, durlar, dü, düm, dür, dı, dım, dır, dırlar, edendir, edenmiş, edir, erdeydin, eredesin, eredesiniz, eredeydi, etinlerdir, eydi, eymiş, idi, im, immiş, imse, imsiniz, ir, irerken, ise, iz, ken, lar, lardır, ledir, ler, lerdir, lırsınız, miş, müş, mış, mışsın, okurken, ostakoviç’miş, s’ın, sa, se, sem, sin, sinizdir, siyse, sun, sunuz, sın, sınız, ti, tir, tu, tum, tur, tü, tür, tı, tılar, tım, tır, usevisin, ydi, ydik, ydiler, ydim, ydin, ydu, ydum, ydü, ydı, ydık, ydılar, ydım, ydınız, yim, yiz, yken, ymiş, ymişçesine, ymuş, ymış, ymışım, ymışız, ysa, yse, yum, yuz, yüz, yım, yız, üdür, üm, ım, ız.

The 2nd highest number of forms (23) was observed with the lemma “mi”: mi, misin, misiniz, miydi, miydin, miyim, miyiz, miymiş, mu, musun, musunuz, muydu, muyum, mü, mı, mıdır, mısın, mısınız, mıydı, mıydım, mıymış, mıyım, mıyız.

The 3rd highest number of forms (10) was observed with the lemma “değil”: değil, değildi, değildim, değildir, değilim, değiliz, değiller, değilmiş, değilse, değilsin.

AUX occurs with 8 features: Mood (1095; 98% instances), Tense (1094; 98% instances), Aspect (1092; 97% instances), Number (1075; 96% instances), Person (1075; 96% instances), Polarity (113; 10% instances), Evident (37; 3% instances), VerbForm (23; 2% instances)

AUX occurs with 17 feature-value pairs: Aspect=Perf, Evident=Nfh, Mood=Cnd, Mood=Des, Mood=Gen, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Part

AUX occurs with 38 feature combinations. The most frequent feature combination is Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Past (260 tokens). Examples: ydi, dı, ydı, tu, ydu, tı, di, ti, du, mıydı

Relations

AUX nodes are attached to their parents using 9 different relations: cop (849; 76% instances), aux:q (218; 19% instances), root (22; 2% instances), nmod (11; 1% instances), conj (8; 1% instances), aux (6; 1% instances), compound (5; 0% instances), ccomp (1; 0% instances), compound:lvc (1; 0% instances)

Parents of AUX nodes belong to 12 different parts of speech: NOUN (411; 37% instances), ADJ (319; 28% instances), VERB (171; 15% instances), PRON (77; 7% instances), ADV (76; 7% instances), (22; 2% instances), ADP (16; 1% instances), PROPN (13; 1% instances), NUM (8; 1% instances), AUX (5; 0% instances), CCONJ (2; 0% instances), DET (1; 0% instances)

1051 (94%) AUX nodes are leaves.

38 (3%) AUX nodes have one child.

12 (1%) AUX nodes have two children.

20 (2%) AUX nodes have three or more children.

The highest child degree of a AUX node is 5.

Children of AUX nodes are attached using 16 different relations: punct (72; 51% instances), nsubj (18; 13% instances), obj (11; 8% instances), conj (10; 7% instances), advcl (5; 4% instances), aux:q (5; 4% instances), advmod (4; 3% instances), nmod (4; 3% instances), advmod:emph (2; 1% instances), case (2; 1% instances), cc (2; 1% instances), parataxis (2; 1% instances), amod (1; 1% instances), csubj (1; 1% instances), discourse (1; 1% instances), obl (1; 1% instances)

Children of AUX nodes belong to 11 different parts of speech: PUNCT (72; 51% instances), NOUN (25; 18% instances), VERB (13; 9% instances), ADJ (11; 8% instances), AUX (5; 4% instances), CCONJ (5; 4% instances), ADV (3; 2% instances), ADP (2; 1% instances), PRON (2; 1% instances), PROPN (2; 1% instances), DET (1; 1% instances)