Treebank Statistics: UD_Turkish_English-BUTR: POS Tags: NOUN
There are 71 NOUN lemmas (29%), 76 NOUN types (27%) and 78 NOUN tokens (20%).
Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: şey, hoca, ders, şarkı, aile, akıl, akşam, ara, aspect, ayak
The 10 most frequent NOUN types: şey, Aklıma, Bro, Canım, Derse, Dünyanın, Guys, Hoca, Hoca’nın, Kafamda
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NOUN is 1.070423 (the average of all parts of speech is 1.152263).
The 1st highest number of forms (3) was observed with the lemma “hoca”: Hoca, Hoca’nın, hocayı.
The 2nd highest number of forms (2) was observed with the lemma “ders”: Derse, dersini.
The 3rd highest number of forms (2) was observed with the lemma “şarkı”: Şarkıyı, şarkısı.
NOUN occurs with 5 features: Number (65; 83% instances), Case (54; 69% instances), Person (52; 67% instances), Number[psor] (16; 21% instances), Person[psor] (16; 21% instances)
NOUN occurs with 15 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Number[psor]=Sing, Person=1, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3
NOUN occurs with 24 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|Person=3 (23 tokens).
Examples: şey, Hoca, Kanka, aile, akşam, cümle, diziydi, gece, görev, kavga
Relations
NOUN nodes are attached to their parents using 12 different relations: compound (18; 23% instances), obl (16; 21% instances), obj (14; 18% instances), nsubj (10; 13% instances), discourse (5; 6% instances), nmod (5; 6% instances), root (4; 5% instances), conj (2; 3% instances), amod (1; 1% instances), fixed (1; 1% instances), flat (1; 1% instances), parataxis (1; 1% instances)
Parents of NOUN nodes belong to 7 different parts of speech: VERB (48; 62% instances), NOUN (18; 23% instances), ADJ (4; 5% instances), (4; 5% instances), PRON (2; 3% instances), ADP (1; 1% instances), PROPN (1; 1% instances)
32 (41%) NOUN nodes are leaves.
26 (33%) NOUN nodes have one child.
10 (13%) NOUN nodes have two children.
10 (13%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 4.
Children of NOUN nodes are attached using 16 different relations: det (22; 28% instances), amod (10; 13% instances), nmod (10; 13% instances), punct (7; 9% instances), compound (6; 8% instances), acl (4; 5% instances), obj (4; 5% instances), advmod (3; 4% instances), case (3; 4% instances), nummod (3; 4% instances), conj (2; 3% instances), obl (2; 3% instances), cc (1; 1% instances), discourse (1; 1% instances), mark (1; 1% instances), nsubj (1; 1% instances)
Children of NOUN nodes belong to 12 different parts of speech: DET (22; 28% instances), NOUN (18; 23% instances), ADJ (10; 13% instances), PUNCT (7; 9% instances), VERB (7; 9% instances), ADP (4; 5% instances), NUM (3; 4% instances), PROPN (3; 4% instances), ADV (2; 3% instances), PRON (2; 3% instances), CCONJ (1; 1% instances), SCONJ (1; 1% instances)