home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish_English-BUTR: POS Tags: NOUN

There are 78 NOUN lemmas (29%), 83 NOUN types (27%) and 87 NOUN tokens (20%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: şey, hoca, ders, kaşar, şarkı, aile, akıl, akşam, ara, aspect

The 10 most frequent NOUN types: şey, kaşar, Aklıma, Bro, Canım, De-Google, Derse, Dünyanın, Guys, Hoca

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.064103 (the average of all parts of speech is 1.154982).

The 1st highest number of forms (3) was observed with the lemma “hoca”: Hoca, Hoca’nın, hocayı.

The 2nd highest number of forms (2) was observed with the lemma “ders”: Derse, dersini.

The 3rd highest number of forms (2) was observed with the lemma “şarkı”: Şarkıyı, şarkısı.

NOUN occurs with 5 features: Number (74; 85% instances), Case (61; 70% instances), Person (59; 68% instances), Number[psor] (16; 18% instances), Person[psor] (16; 18% instances)

NOUN occurs with 15 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Number[psor]=Sing, Person=1, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3

NOUN occurs with 25 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=3 (28 tokens). Examples: şey, kaşar, Hoca, Kanka, aile, akşam, cümle, diziydi, gece, gravyer

Relations

NOUN nodes are attached to their parents using 12 different relations: obl (19; 22% instances), compound (18; 21% instances), obj (14; 16% instances), nsubj (12; 14% instances), conj (6; 7% instances), discourse (5; 6% instances), nmod (5; 6% instances), root (4; 5% instances), amod (1; 1% instances), fixed (1; 1% instances), flat (1; 1% instances), parataxis (1; 1% instances)

Parents of NOUN nodes belong to 7 different parts of speech: VERB (52; 60% instances), NOUN (19; 22% instances), ADJ (8; 9% instances), (4; 5% instances), PRON (2; 2% instances), ADP (1; 1% instances), PROPN (1; 1% instances)

33 (38%) NOUN nodes are leaves.

32 (37%) NOUN nodes have one child.

11 (13%) NOUN nodes have two children.

11 (13%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 17 different relations: det (23; 25% instances), amod (16; 17% instances), nmod (10; 11% instances), punct (10; 11% instances), compound (5; 5% instances), acl (4; 4% instances), case (4; 4% instances), obj (4; 4% instances), advmod (3; 3% instances), conj (3; 3% instances), nummod (3; 3% instances), obl (2; 2% instances), cc (1; 1% instances), discourse (1; 1% instances), mark (1; 1% instances), nsubj (1; 1% instances), parataxis (1; 1% instances)

Children of NOUN nodes belong to 12 different parts of speech: DET (23; 25% instances), NOUN (19; 21% instances), ADJ (16; 17% instances), PUNCT (10; 11% instances), VERB (7; 8% instances), ADP (4; 4% instances), PROPN (4; 4% instances), NUM (3; 3% instances), ADV (2; 2% instances), PRON (2; 2% instances), CCONJ (1; 1% instances), SCONJ (1; 1% instances)