home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Guajajara-TuDeT: POS Tags: NUM

There are 5 NUM lemmas (1%), 5 NUM types (0%) and 9 NUM tokens (0%). Out of 15 observed tags, the rank of NUM is: 12 in number of lemmas, 13 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: ru, mokoz, mukuz, pitai, pɨta

The 10 most frequent NUM types: naʔiruz, mokoz, Mukuz, napɨtaʔikwaw, pitei

The 10 most frequent ambiguous lemmas: ru (NUM 4, ADP 2), mokoz (NUM 2, NOUN 1), pɨta (VERB 9, NUM 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.933709).

The 1st highest number of forms (1) was observed with the lemma “mokoz”: mokoz.

The 2nd highest number of forms (1) was observed with the lemma “mukuz”: Mukuz.

The 3rd highest number of forms (1) was observed with the lemma “pitai”: pitei.

NUM occurs with 3 features: Polarity (5; 56% instances), Rel (4; 44% instances), Degree (1; 11% instances)

NUM occurs with 3 feature-value pairs: Degree=Dim, Polarity=Neg, Rel=NCont

NUM occurs with 3 feature combinations. The most frequent feature combination is _ (4 tokens). Examples: mokoz, Mukuz, pitei

Relations

NUM nodes are attached to their parents using 3 different relations: nummod (6; 67% instances), nmod (2; 22% instances), root (1; 11% instances)

Parents of NUM nodes belong to 2 different parts of speech: NOUN (8; 89% instances), (1; 11% instances)

8 (89%) NUM nodes are leaves.

0 (0%) NUM nodes have one child.

0 (0%) NUM nodes have two children.

1 (11%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 4 different relations: discourse (2; 40% instances), obl (1; 20% instances), obl:subj (1; 20% instances), punct (1; 20% instances)

Children of NUM nodes belong to 4 different parts of speech: NOUN (2; 40% instances), PART (1; 20% instances), PRON (1; 20% instances), PUNCT (1; 20% instances)