Treebank Statistics: UD_Turkish-IMST: POS Tags: AUX
There are 4 AUX
lemmas (0%), 149 AUX
types (1%) and 1121 AUX
tokens (2%).
Out of 14 observed tags, the rank of AUX
is: 14 in number of lemmas, 8 in number of types and 11 in number of tokens.
The 10 most frequent AUX
lemmas: i, mi, değil, ol
The 10 most frequent AUX
types: mi, değil, mı, dır, dir, ydi, dı, ydı, tu, mu
The 10 most frequent ambiguous lemmas: i (AUX 785, CCONJ 35, NOUN 3), değil (AUX 106, CCONJ 30, VERB 15), ol (VERB 886, AUX 2)
The 10 most frequent ambiguous types: değil (AUX 63, CCONJ 30, VERB 7), dır (AUX 63, ADP 10), ise (CCONJ 45, AUX 21), değildir (AUX 16, VERB 2), değildi (AUX 15, VERB 1), ti (AUX 10, NOUN 1), değilim (AUX 8, VERB 5), lar (ADP 15, AUX 7), ler (ADP 11, AUX 6, NOUN 1), iz (AUX 4, NOUN 2)
- değil
- dır
- ise
- değildir
- değildi
- ti
- değilim
- lar
- ler
- iz
Morphology
The form / lemma ratio of AUX
is 37.250000 (the average of all parts of speech is 2.776562).
The 1st highest number of forms (119) was observed with the lemma “i”: ‘di, ‘dı, ‘dır, ‘tı, ‘ydi, ‘ydı, akutistan’dayım, ayoşmuş, değil, değildi, değildir, değilim, değilmiş, di, dik, dim, din, dir, du, duk, dular, dum, dur, durlar, dü, düm, dür, dı, dım, dır, dırlar, edendir, edenmiş, edir, erdeydin, eredesin, eredesiniz, eredeydi, etinlerdir, eydi, eymiş, idi, im, immiş, imse, imsiniz, ir, irerken, ise, iz, ken, lar, lardır, ledir, ler, lerdir, lırsınız, miş, müş, mış, mışsın, okurken, ostakoviç’miş, s’ın, sa, se, sem, sin, sinizdir, siyse, sun, sunuz, sın, sınız, ti, tir, tu, tum, tur, tü, tür, tı, tılar, tım, tır, usevisin, ydi, ydik, ydiler, ydim, ydin, ydu, ydum, ydü, ydı, ydık, ydılar, ydım, ydınız, yim, yiz, yken, ymiş, ymişçesine, ymuş, ymış, ymışım, ymışız, ysa, yse, yum, yuz, yüz, yım, yız, üdür, üm, ım, ız.
The 2nd highest number of forms (23) was observed with the lemma “mi”: mi, misin, misiniz, miydi, miydin, miyim, miyiz, miymiş, mu, musun, musunuz, muydu, muyum, mü, mı, mıdır, mısın, mısınız, mıydı, mıydım, mıymış, mıyım, mıyız.
The 3rd highest number of forms (10) was observed with the lemma “değil”: değil, değildi, değildim, değildir, değilim, değiliz, değiller, değilmiş, değilse, değilsin.
AUX
occurs with 8 features: Mood (1095; 98% instances), Tense (1094; 98% instances), Aspect (1092; 97% instances), Number (1075; 96% instances), Person (1075; 96% instances), Polarity (113; 10% instances), Evident (37; 3% instances), VerbForm (23; 2% instances)
AUX
occurs with 17 feature-value pairs: Aspect=Perf
, Evident=Nfh
, Mood=Cnd
, Mood=Des
, Mood=Gen
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Tense=Past
, Tense=Pres
, VerbForm=Conv
, VerbForm=Part
AUX
occurs with 38 feature combinations.
The most frequent feature combination is Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Past
(260 tokens).
Examples: ydi, dı, ydı, tu, ydu, tı, di, ti, du, mıydı
Relations
AUX
nodes are attached to their parents using 9 different relations: cop (849; 76% instances), aux:q (218; 19% instances), root (22; 2% instances), nmod (11; 1% instances), conj (8; 1% instances), aux (6; 1% instances), compound (5; 0% instances), ccomp (1; 0% instances), compound:lvc (1; 0% instances)
Parents of AUX
nodes belong to 12 different parts of speech: NOUN (411; 37% instances), ADJ (319; 28% instances), VERB (171; 15% instances), PRON (77; 7% instances), ADV (76; 7% instances), (22; 2% instances), ADP (16; 1% instances), PROPN (13; 1% instances), NUM (8; 1% instances), AUX (5; 0% instances), CCONJ (2; 0% instances), DET (1; 0% instances)
1051 (94%) AUX
nodes are leaves.
38 (3%) AUX
nodes have one child.
12 (1%) AUX
nodes have two children.
20 (2%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 5.
Children of AUX
nodes are attached using 16 different relations: punct (72; 51% instances), nsubj (18; 13% instances), obj (11; 8% instances), conj (10; 7% instances), advcl (5; 4% instances), aux:q (5; 4% instances), advmod (4; 3% instances), nmod (4; 3% instances), advmod:emph (2; 1% instances), case (2; 1% instances), cc (2; 1% instances), parataxis (2; 1% instances), amod (1; 1% instances), csubj (1; 1% instances), discourse (1; 1% instances), obl (1; 1% instances)
Children of AUX
nodes belong to 11 different parts of speech: PUNCT (72; 51% instances), NOUN (25; 18% instances), VERB (13; 9% instances), ADJ (11; 8% instances), AUX (5; 4% instances), CCONJ (5; 4% instances), ADV (3; 2% instances), ADP (2; 1% instances), PRON (2; 1% instances), PROPN (2; 1% instances), DET (1; 1% instances)