home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-PUD: POS Tags: X

There are 1 X lemmas (4%), 7 X types (0%) and 9 X tokens (0%). Out of 14 observed tags, the rank of X is: 14 in number of lemmas, 13 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: Bretanha, Dama, Olá, americano, americanos, coreano, voz

The 10 most frequent ambiguous lemmas: _ (NOUN 4636, ADP 2571, PUNCT 2547, VERB 2512, DET 2070, ADJ 1554, PROPN 1352, PRON 910, ADV 841, CCONJ 578, NUM 471, AUX 328, SYM 34, X 9)

The 10 most frequent ambiguous types: americano (ADJ 1, NOUN 1, X 1), americanos (ADJ 4, X 1), voz (NOUN 4, X 1)

Morphology

The form / lemma ratio of X is 7.000000 (the average of all parts of speech is 228.814815).

The 1st highest number of forms (7) was observed with the lemma “_”: Bretanha, Dama, Olá, americano, americanos, coreano, voz.

X does not occur with any features.

Relations

X nodes are attached to their parents using 2 different relations: compound (8; 89% instances), discourse (1; 11% instances)

Parents of X nodes belong to 4 different parts of speech: NOUN (3; 33% instances), PROPN (3; 33% instances), ADJ (2; 22% instances), VERB (1; 11% instances)

8 (89%) X nodes are leaves.

1 (11%) X nodes have one child.

The highest child degree of a X node is 1.

Children of X nodes are attached using 1 different relations: punct (1; 100% instances)

Children of X nodes belong to 1 different parts of speech: PUNCT (1; 100% instances)