home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Kenet: POS Tags: X

There are 96 X lemmas (1%), 146 X types (0%) and 451 X tokens (0%). Out of 15 observed tags, the rank of X is: 7 in number of lemmas, 8 in number of types and 13 in number of tokens.

The 10 most frequent X lemmas: ne, gibi, pırıl, biri, o, ben, bu, didik, kim, sen

The 10 most frequent X types: neler, gibiydi, nedir, pırıl, gibidir, oysa, biridir, biriydi, didik, tefek

The 10 most frequent ambiguous lemmas: ne (ADV 211, CCONJ 165, ADJ 156, PRON 123, X 61, NOUN 1), gibi (ADP 938, X 46, NOUN 1), biri (PRON 175, X 21), o (PRON 1099, DET 516, X 20), ben (PRON 862, NOUN 172, PROPN 11, X 11), bu (DET 1795, PRON 625, X 11), kim (PRON 119, X 9, PROPN 8, NOUN 1), sen (PRON 374, X 9), biz (PRON 421, NOUN 68, X 8), için (ADP 683, X 7)

The 10 most frequent ambiguous types: neler (X 24, NOUN 1), gibiydi (X 24, ADP 2), biridir (X 9, ADJ 2, NUM 2), biriydi (X 9, ADJ 1, NUM 1), benim (PRON 92, NOUN 27, X 6), bizim (PRON 76, NOUN 13, X 5), seninki (X 3, ADJ 1), kimseler (NOUN 6, X 3), biriyim (X 2, ADJ 1), hepimiz (PRON 4, X 2)

Morphology

The form / lemma ratio of X is 1.520833 (the average of all parts of speech is 2.835413).

The 1st highest number of forms (9) was observed with the lemma “ne”: Neymiş, nedense, nedir, neler, nelerdir, nesiydi, neydi, neyse, neysek.

The 2nd highest number of forms (7) was observed with the lemma “o”: Ondaki, odur, onlarmış, onunki, oydu, oysa, oyuz.

The 3rd highest number of forms (6) was observed with the lemma “kim”: Kiminiz, kimdi, kimler, kimlerdir, kimseler, kimsin.

X occurs with 1 features: Number (242; 54% instances)

X occurs with 2 feature-value pairs: Number=Plur, Number=Sing

X occurs with 3 feature combinations. The most frequent feature combination is _ (209 tokens). Examples: pırıl, didik, tefek, fıldır, harıl, cıvıl, deşik, fıkır, tiril, çıtır

Relations

X nodes are attached to their parents using 18 different relations: compound (107; 24% instances), root (103; 23% instances), amod (40; 9% instances), obl (33; 7% instances), nmod (32; 7% instances), parataxis (28; 6% instances), case (21; 5% instances), obj (20; 4% instances), advcl (18; 4% instances), nsubj (18; 4% instances), conj (11; 2% instances), ccomp (10; 2% instances), discourse (5; 1% instances), acl (1; 0% instances), appos (1; 0% instances), csubj (1; 0% instances), fixed (1; 0% instances), mark (1; 0% instances)

Parents of X nodes belong to 7 different parts of speech: VERB (161; 36% instances), (103; 23% instances), NOUN (84; 19% instances), X (81; 18% instances), ADJ (17; 4% instances), PRON (3; 1% instances), ADV (2; 0% instances)

162 (36%) X nodes are leaves.

141 (31%) X nodes have one child.

55 (12%) X nodes have two children.

93 (21%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 24 different relations: punct (136; 24% instances), compound (110; 19% instances), nsubj (93; 16% instances), obl (33; 6% instances), nmod (27; 5% instances), advmod (21; 4% instances), parataxis (18; 3% instances), acl (17; 3% instances), advcl (15; 3% instances), obj (13; 2% instances), cc (11; 2% instances), amod (10; 2% instances), ccomp (9; 2% instances), conj (9; 2% instances), csubj (9; 2% instances), case (8; 1% instances), xcomp (7; 1% instances), vocative (6; 1% instances), discourse (5; 1% instances), det (4; 1% instances), appos (3; 1% instances), mark (3; 1% instances), dep (1; 0% instances), iobj (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: NOUN (158; 28% instances), PUNCT (136; 24% instances), X (81; 14% instances), VERB (72; 13% instances), ADJ (43; 8% instances), ADV (21; 4% instances), PRON (19; 3% instances), CCONJ (14; 2% instances), ADP (7; 1% instances), PROPN (6; 1% instances), INTJ (4; 1% instances), SCONJ (4; 1% instances), DET (3; 1% instances), NUM (1; 0% instances)