home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bokota-ChibErgIS: POS Tags: X

There are 6 X lemmas (2%), 6 X types (1%) and 18 X tokens (1%). Out of 15 observed tags, the rank of X is: 10 in number of lemmas, 10 in number of types and 12 in number of tokens.

The 10 most frequent X lemmas: y, o, pero, porque, que, ute

The 10 most frequent X types: y, o, pero, porque, que, ute

The 10 most frequent ambiguous lemmas: ute (ADJ 2, X 1)

The 10 most frequent ambiguous types: ute (ADJ 2, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.193029).

The 1st highest number of forms (1) was observed with the lemma “o”: o.

The 2nd highest number of forms (1) was observed with the lemma “pero”: pero.

The 3rd highest number of forms (1) was observed with the lemma “porque”: porque.

X occurs with 1 features: Foreign (16; 89% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (16 tokens). Examples: y, pero, porque, que

Relations

X nodes are attached to their parents using 6 different relations: cc (13; 72% instances), compound (1; 6% instances), conj (1; 6% instances), discourse (1; 6% instances), obl:mod (1; 6% instances), root (1; 6% instances)

Parents of X nodes belong to 3 different parts of speech: VERB (11; 61% instances), NOUN (6; 33% instances), (1; 6% instances)

15 (83%) X nodes are leaves.

2 (11%) X nodes have one child.

0 (0%) X nodes have two children.

1 (6%) X nodes have three or more children.

The highest child degree of a X node is 3.

Children of X nodes are attached using 4 different relations: reparandum (2; 40% instances), aux (1; 20% instances), ccomp (1; 20% instances), punct (1; 20% instances)

Children of X nodes belong to 4 different parts of speech: VERB (2; 40% instances), AUX (1; 20% instances), NOUN (1; 20% instances), PUNCT (1; 20% instances)