home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-PUD: POS Tags: X

There are 76 X lemmas (2%), 82 X types (1%) and 104 X tokens (1%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 9 in number of types and 13 in number of tokens.

The 10 most frequent X lemmas: the, _, of, North, Association, Mps, My, News, Really, Street

The 10 most frequent X types: the, of, North, Association, My, News, Really, Uber, You, ‘da

The 10 most frequent ambiguous lemmas: the (X 5, PROPN 1), _ (ADJ 133, NOUN 83, AUX 68, PUNCT 62, NUM 30, PROPN 26, VERB 19, ADV 15, ADP 7, PRON 5, X 5, SYM 4, DET 1), Uber (X 2, PROPN 1), America (PROPN 1, X 1), de (ADV 61, VERB 29, PROPN 5, ADJ 1, NOUN 1, X 1), her (DET 31, X 1), son (ADJ 28, NOUN 8, X 1)

The 10 most frequent ambiguous types: the (X 4, PROPN 1), Uber (X 2, PROPN 1), America (PROPN 1, X 1), Her (DET 5, X 1), Son (ADJ 6, X 1), de (ADV 61, PROPN 4, X 1)

Morphology

The form / lemma ratio of X is 1.078947 (the average of all parts of speech is 1.517471).

The 1st highest number of forms (5) was observed with the lemma “_”: Anyway”in, Heart”ı, Open’daki, in, lerin.

The 2nd highest number of forms (2) was observed with the lemma “Mps”: Mps, Mps’ye.

The 3rd highest number of forms (2) was observed with the lemma “Street”: Street, Street’te.

X occurs with 6 features: Case (13; 13% instances), Number (10; 10% instances), Foreign (4; 4% instances), Definite (1; 1% instances), Person (1; 1% instances), Polarity (1; 1% instances)

X occurs with 12 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Definite=Def, Foreign=Yes, Number=Plur, Number=Sing, Person=3, Polarity=Pos

X occurs with 13 feature combinations. The most frequent feature combination is _ (82 tokens). Examples: the, of, North, Association, My, Really, America, Associated, Breaking, Casa

Relations

X nodes are attached to their parents using 13 different relations: flat (61; 59% instances), nmod:poss (15; 14% instances), appos (5; 5% instances), compound (4; 4% instances), obl (4; 4% instances), conj (3; 3% instances), nsubj (3; 3% instances), obj (3; 3% instances), iobj (2; 2% instances), amod (1; 1% instances), case (1; 1% instances), nmod (1; 1% instances), orphan (1; 1% instances)

Parents of X nodes belong to 6 different parts of speech: X (43; 41% instances), NOUN (28; 27% instances), PROPN (19; 18% instances), VERB (11; 11% instances), ADJ (2; 2% instances), NUM (1; 1% instances)

66 (63%) X nodes are leaves.

17 (16%) X nodes have one child.

4 (4%) X nodes have two children.

17 (16%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 11 different relations: flat (58; 60% instances), punct (18; 19% instances), appos (4; 4% instances), compound (4; 4% instances), conj (4; 4% instances), acl (3; 3% instances), amod (1; 1% instances), case (1; 1% instances), cc (1; 1% instances), cop (1; 1% instances), nummod (1; 1% instances)

Children of X nodes belong to 9 different parts of speech: X (43; 45% instances), PROPN (18; 19% instances), PUNCT (18; 19% instances), NOUN (9; 9% instances), ADJ (3; 3% instances), NUM (2; 2% instances), ADP (1; 1% instances), AUX (1; 1% instances), CCONJ (1; 1% instances)