home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tupinamba-TuDeT: POS Tags: NUM

There are 8 NUM lemmas (1%), 9 NUM types (0%) and 17 NUM tokens (0%). Out of 14 observed tags, the rank of NUM is: 10 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: mokõj, mosapɨr, _, mosapɨ, mosapɨrɨ, oito, ojepe, ojoirunɨk

The 10 most frequent NUM types: mokõj, mosapɨr, Quatro, cento, mosapɨ, mosapɨrɨ, oito, ojepe, ojoirunɨk

The 10 most frequent ambiguous lemmas: _ (NOUN 87, VERB 34, PUNCT 12, ADP 9, PRON 9, PROPN 9, PART 6, ADV 5, NUM 2, DET 1, X 1), ojepe (NOUN 1, NUM 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.125000 (the average of all parts of speech is 1.577170).

The 1st highest number of forms (2) was observed with the lemma “_”: Quatro, cento.

The 2nd highest number of forms (1) was observed with the lemma “mokõj”: mokõj.

The 3rd highest number of forms (1) was observed with the lemma “mosapɨ”: mosapɨ.

NUM occurs with 1 features: NumType (13; 76% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (13 tokens). Examples: mokõj, mosapɨr, mosapɨ, mosapɨrɨ, ojepe

Relations

NUM nodes are attached to their parents using 5 different relations: nummod (12; 71% instances), root (2; 12% instances), appos (1; 6% instances), nmod (1; 6% instances), nsubj (1; 6% instances)

Parents of NUM nodes belong to 5 different parts of speech: NOUN (10; 59% instances), NUM (3; 18% instances), (2; 12% instances), PROPN (1; 6% instances), VERB (1; 6% instances)

13 (76%) NUM nodes are leaves.

0 (0%) NUM nodes have one child.

1 (6%) NUM nodes have two children.

3 (18%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 7 different relations: advmod (4; 29% instances), punct (4; 29% instances), nmod (2; 14% instances), appos (1; 7% instances), dep (1; 7% instances), nummod (1; 7% instances), obl (1; 7% instances)

Children of NUM nodes belong to 5 different parts of speech: ADV (4; 29% instances), PUNCT (4; 29% instances), NUM (3; 21% instances), NOUN (2; 14% instances), PART (1; 7% instances)