home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Indonesian-GSD: POS Tags: X

There are 25 X lemmas (0%), 25 X types (0%) and 39 X tokens (0%). Out of 16 observed tags, the rank of X is: 14 in number of lemmas, 14 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: jpg, an, www, anti, b, dkk, s, x, y, &nbsp

The 10 most frequent X types: jpg, an, www, anti, b, dkk, s, x, y, &nbsp

The 10 most frequent ambiguous lemmas: jpg (X 5, PROPN 4), an (NUM 36, DET 11, PROPN 10, X 3), anti (NOUN 7, ADJ 5, X 2, PROPN 1), b (NOUN 10, PROPN 7, X 2), dkk (X 2, PROPN 1), s (PROPN 7, ADP 2, X 2, NOUN 1), x (PROPN 13, X 2, NUM 1), y (PROPN 3, X 2), &nbsp (NOUN 1, PUNCT 1, X 1), d (NOUN 6, ADP 2, PROPN 1, X 1)

The 10 most frequent ambiguous types: an (NUM 36, DET 11, PROPN 9, X 3), anti (NOUN 7, ADJ 5, X 2), b (NOUN 2, X 2, PROPN 1), s (ADP 2, X 2, NOUN 1), x (PROPN 5, X 2), &nbsp (NOUN 1, PUNCT 1, X 1), d (ADP 2, NOUN 1, PROPN 1, X 1), di (ADP 2200, VERB 11, CCONJ 8, SCONJ 3, PROPN 2, X 1), ke (ADP 356, NUM 62, DET 9, VERB 1, X 1), org (NOUN 1, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.045328).

The 1st highest number of forms (1) was observed with the lemma “&nbsp”: &nbsp.

The 2nd highest number of forms (1) was observed with the lemma “an”: an.

The 3rd highest number of forms (1) was observed with the lemma “anti”: anti.

X occurs with 2 features: Degree (2; 5% instances), Number (2; 5% instances)

X occurs with 2 feature-value pairs: Degree=Pos, Number=Sing

X occurs with 2 feature combinations. The most frequent feature combination is _ (37 tokens). Examples: jpg, an, www, b, dkk, s, x, y, &nbsp, Duhai

Relations

X nodes are attached to their parents using 6 different relations: dep (33; 85% instances), nsubj (2; 5% instances), acl (1; 3% instances), appos (1; 3% instances), conj (1; 3% instances), obl (1; 3% instances)

Parents of X nodes belong to 6 different parts of speech: X (10; 26% instances), NOUN (9; 23% instances), PROPN (7; 18% instances), VERB (6; 15% instances), NUM (5; 13% instances), PUNCT (2; 5% instances)

29 (74%) X nodes are leaves.

6 (15%) X nodes have one child.

1 (3%) X nodes have two children.

3 (8%) X nodes have three or more children.

The highest child degree of a X node is 14.

Children of X nodes are attached using 7 different relations: punct (16; 52% instances), dep (10; 32% instances), appos (1; 3% instances), conj (1; 3% instances), flat (1; 3% instances), nmod (1; 3% instances), nummod (1; 3% instances)

Children of X nodes belong to 6 different parts of speech: PUNCT (12; 39% instances), X (10; 32% instances), SYM (4; 13% instances), NOUN (2; 6% instances), PROPN (2; 6% instances), NUM (1; 3% instances)