home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kiche-IU: POS Tags: NUM

There are 26 NUM lemmas (2%), 27 NUM types (1%) and 65 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 6 in number of lemmas, 7 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: kebʼ, jun, oxibʼ, jobʼ, nabʼe, kajibʼ, ukabʼ, lajuj, uwuq, 2012

The 10 most frequent NUM types: kebʼ, jun, oxibʼ, jobʼ, nabʼe, kajibʼ, lajuj, ukabʼ, uwuq, 2012

The 10 most frequent ambiguous lemmas: jun (DET 86, NUM 11), nabʼe (NUM 4, ADV 2)

The 10 most frequent ambiguous types: jun (DET 79, NUM 11), nabʼe (NUM 3, ADV 1)

Morphology

The form / lemma ratio of NUM is 1.038462 (the average of all parts of speech is 1.617880).

The 1st highest number of forms (2) was observed with the lemma “ukabʼ”: Ukʼabʼ, ukabʼ.

The 2nd highest number of forms (1) was observed with the lemma “2012”: 2012.

The 3rd highest number of forms (1) was observed with the lemma “2019”: 2019.

NUM occurs with 1 features: NumType (11; 17% instances)

NUM occurs with 1 feature-value pairs: NumType=Ord

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (54 tokens). Examples: kebʼ, jun, oxibʼ, jobʼ, kajibʼ, lajuj, 2012, 2019, 30.000, 300

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (38; 58% instances), nsubj (7; 11% instances), amod (5; 8% instances), conj (4; 6% instances), root (4; 6% instances), appos (3; 5% instances), dep (1; 2% instances), nmod (1; 2% instances), obj (1; 2% instances), obl (1; 2% instances)

Parents of NUM nodes belong to 5 different parts of speech: NOUN (49; 75% instances), NUM (8; 12% instances), (4; 6% instances), VERB (3; 5% instances), PART (1; 2% instances)

41 (63%) NUM nodes are leaves.

11 (17%) NUM nodes have one child.

10 (15%) NUM nodes have two children.

3 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 15 different relations: punct (10; 24% instances), det (5; 12% instances), nmod (5; 12% instances), amod (4; 10% instances), conj (4; 10% instances), nummod (3; 7% instances), nsubj (2; 5% instances), parataxis (2; 5% instances), acl (1; 2% instances), advmod (1; 2% instances), case (1; 2% instances), cc (1; 2% instances), dep (1; 2% instances), dep:agr (1; 2% instances), flat (1; 2% instances)

Children of NUM nodes belong to 11 different parts of speech: PUNCT (10; 24% instances), NUM (8; 19% instances), NOUN (7; 17% instances), ADJ (5; 12% instances), DET (5; 12% instances), VERB (2; 5% instances), ADP (1; 2% instances), ADV (1; 2% instances), CCONJ (1; 2% instances), PRON (1; 2% instances), PROPN (1; 2% instances)