home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish_English-BUTR: POS Tags: NOUN

There are 71 NOUN lemmas (29%), 76 NOUN types (27%) and 78 NOUN tokens (20%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: şey, hoca, ders, şarkı, aile, akıl, akşam, ara, aspect, ayak

The 10 most frequent NOUN types: şey, Aklıma, Bro, Canım, Derse, Dünyanın, Guys, Hoca, Hoca’nın, Kafamda

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.070423 (the average of all parts of speech is 1.152263).

The 1st highest number of forms (3) was observed with the lemma “hoca”: Hoca, Hoca’nın, hocayı.

The 2nd highest number of forms (2) was observed with the lemma “ders”: Derse, dersini.

The 3rd highest number of forms (2) was observed with the lemma “şarkı”: Şarkıyı, şarkısı.

NOUN occurs with 5 features: Number (65; 83% instances), Case (54; 69% instances), Person (52; 67% instances), Number[psor] (16; 21% instances), Person[psor] (16; 21% instances)

NOUN occurs with 15 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Number[psor]=Sing, Person=1, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3

NOUN occurs with 24 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=3 (23 tokens). Examples: şey, Hoca, Kanka, aile, akşam, cümle, diziydi, gece, görev, kavga

Relations

NOUN nodes are attached to their parents using 12 different relations: compound (18; 23% instances), obl (16; 21% instances), obj (14; 18% instances), nsubj (10; 13% instances), discourse (5; 6% instances), nmod (5; 6% instances), root (4; 5% instances), conj (2; 3% instances), amod (1; 1% instances), fixed (1; 1% instances), flat (1; 1% instances), parataxis (1; 1% instances)

Parents of NOUN nodes belong to 7 different parts of speech: VERB (48; 62% instances), NOUN (18; 23% instances), ADJ (4; 5% instances), (4; 5% instances), PRON (2; 3% instances), ADP (1; 1% instances), PROPN (1; 1% instances)

32 (41%) NOUN nodes are leaves.

26 (33%) NOUN nodes have one child.

10 (13%) NOUN nodes have two children.

10 (13%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 4.

Children of NOUN nodes are attached using 16 different relations: det (22; 28% instances), amod (10; 13% instances), nmod (10; 13% instances), punct (7; 9% instances), compound (6; 8% instances), acl (4; 5% instances), obj (4; 5% instances), advmod (3; 4% instances), case (3; 4% instances), nummod (3; 4% instances), conj (2; 3% instances), obl (2; 3% instances), cc (1; 1% instances), discourse (1; 1% instances), mark (1; 1% instances), nsubj (1; 1% instances)

Children of NOUN nodes belong to 12 different parts of speech: DET (22; 28% instances), NOUN (18; 23% instances), ADJ (10; 13% instances), PUNCT (7; 9% instances), VERB (7; 9% instances), ADP (4; 5% instances), NUM (3; 4% instances), PROPN (3; 4% instances), ADV (2; 3% instances), PRON (2; 3% instances), CCONJ (1; 1% instances), SCONJ (1; 1% instances)