home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-TueCL: POS Tags: NOUN

There are 95 NOUN lemmas (37%), 131 NOUN types (34%) and 226 NOUN tokens (25%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: kitap, ev, okul, çocuk, ki, anne, araba, sabah, arkadaş, kardeş

The 10 most frequent NOUN types: evde, kitap, kitabı, okula, sabah, annesini, eve, çocuk, doktor, ekmek

The 10 most frequent ambiguous lemmas: ki (NOUN 7, SCONJ 2), saat (ADV 1, NOUN 1)

The 10 most frequent ambiguous types: ki (NOUN 2, SCONJ 2), saat (ADV 1, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.378947 (the average of all parts of speech is 1.503846).

The 1st highest number of forms (5) was observed with the lemma “ev”: Evin, ev, evde, evdeki, eve.

The 2nd highest number of forms (5) was observed with the lemma “ki”: ki, kiler, kinden, kini, kinin.

The 3rd highest number of forms (4) was observed with the lemma “araba”: araba, arabalık, arabam, arabayı.

NOUN occurs with 6 features: Case (223; 99% instances), Number (223; 99% instances), Person[psor] (51; 23% instances), Number[psor] (50; 22% instances), Polarity (3; 1% instances), Person (1; 0% instances)

NOUN occurs with 17 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polarity=Neg, Polarity=Pos

NOUN occurs with 32 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (91 tokens). Examples: kitap, sabah, çocuk, doktor, ekmek, Öğretmen, alkol, araba, bisiklet, erkek

Relations

NOUN nodes are attached to their parents using 17 different relations: obj (57; 25% instances), obl (51; 23% instances), nsubj (41; 18% instances), root (17; 8% instances), nmod (14; 6% instances), obl:tmod (13; 6% instances), nmod:poss (8; 4% instances), compound:lvc (5; 2% instances), conj (4; 2% instances), orphan (4; 2% instances), nsubj:pass (3; 1% instances), amod (2; 1% instances), ccomp (2; 1% instances), nsubj:outer (2; 1% instances), compound (1; 0% instances), obl:agent (1; 0% instances), parataxis (1; 0% instances)

Parents of NOUN nodes belong to 7 different parts of speech: VERB (145; 64% instances), NOUN (29; 13% instances), ADJ (28; 12% instances), (17; 8% instances), PROPN (4; 2% instances), AUX (2; 1% instances), PRON (1; 0% instances)

135 (60%) NOUN nodes are leaves.

62 (27%) NOUN nodes have one child.

11 (5%) NOUN nodes have two children.

18 (8%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 6.

Children of NOUN nodes are attached using 19 different relations: punct (19; 13% instances), nmod (17; 11% instances), nsubj (17; 11% instances), amod (14; 9% instances), det (14; 9% instances), nmod:poss (13; 9% instances), aux:q (10; 7% instances), cop (8; 5% instances), advmod (7; 5% instances), aux (6; 4% instances), acl (5; 3% instances), cc (4; 3% instances), nummod (4; 3% instances), conj (3; 2% instances), advcl (2; 1% instances), advmod:emph (2; 1% instances), orphan (2; 1% instances), case (1; 1% instances), compound (1; 1% instances)

Children of NOUN nodes belong to 12 different parts of speech: NOUN (29; 19% instances), AUX (24; 16% instances), PROPN (21; 14% instances), PUNCT (19; 13% instances), ADJ (14; 9% instances), DET (14; 9% instances), ADV (9; 6% instances), VERB (6; 4% instances), CCONJ (4; 3% instances), NUM (4; 3% instances), PRON (4; 3% instances), ADP (1; 1% instances)