home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bokota-ChibErgIS: POS Tags: NUM

There are 6 NUM lemmas (2%), 6 NUM types (1%) and 16 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 9 in number of lemmas, 9 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: gda, ite, apa, dos, gwa, tres

The 10 most frequent NUM types: gda, ite, apa, dos, gwa, tres

The 10 most frequent ambiguous lemmas: gwa (ADV 6, NUM 1)

The 10 most frequent ambiguous types: gwa (ADV 6, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.193029).

The 1st highest number of forms (1) was observed with the lemma “apa”: apa.

The 2nd highest number of forms (1) was observed with the lemma “dos”: dos.

The 3rd highest number of forms (1) was observed with the lemma “gda”: gda.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 4 different relations: clf (5; 31% instances), nmod (4; 25% instances), obl:mod (4; 25% instances), nummod (3; 19% instances)

Parents of NUM nodes belong to 4 different parts of speech: NOUN (6; 38% instances), NUM (5; 31% instances), VERB (4; 25% instances), PROPN (1; 6% instances)

8 (50%) NUM nodes are leaves.

8 (50%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 3 different relations: clf (5; 63% instances), case (2; 25% instances), obj (1; 13% instances)

Children of NUM nodes belong to 3 different parts of speech: NUM (5; 63% instances), ADP (2; 25% instances), NOUN (1; 13% instances)