home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: NUM

There are 20 NUM lemmas (1%), 29 NUM types (1%) and 68 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 14 in number of lemmas, 14 in number of types and 16 in number of tokens.

The 10 most frequent NUM lemmas: mâːy, nàmbóŋ, wupsə, lim, mbə́ɬəŋ, nandam, nàmbón, ɗàrí, dubu, tókndam

The 10 most frequent NUM types: mbə́ɬəŋ, mâːy, wupsə, lim, nàmbóŋ, nàmbón, ɗàrí, nandam, watsə́may, dubu

The 10 most frequent ambiguous lemmas: wupsə (NUM 8, DET 1), mbə́rgə̀ptəŋ (NOUN 6, NUM 2), kòːkàrí (NOUN 2, NUM 1, X 1)

The 10 most frequent ambiguous types: mbə́rgə̀ptəŋ (NOUN 6, NUM 2), kòːkàrí (NOUN 2, NUM 1, X 1), nàmbóɲi (ADJ 2, NUM 1), wumí (NUM 1, VERB 1)

Morphology

The form / lemma ratio of NUM is 1.450000 (the average of all parts of speech is 1.611418).

The 1st highest number of forms (4) was observed with the lemma “nàmbóŋ”: nàmbóŋ, nàmbóŋə́y, nàmbóɲi, nàmbóɲíː.

The 2nd highest number of forms (2) was observed with the lemma “dubu”: dubu, dubú.

The 3rd highest number of forms (2) was observed with the lemma “lim”: lim, limês.

NUM occurs with 2 features: Definite (4; 6% instances), Deixis (1; 1% instances)

NUM occurs with 3 feature-value pairs: Definite=Def, Definite=Ind, Deixis=Remt

NUM occurs with 3 feature combinations. The most frequent feature combination is _ (64 tokens). Examples: mbə́ɬəŋ, mâːy, wupsə, lim, nàmbóŋ, nàmbón, ɗàrí, nandam, watsə́may, dubu

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (32; 47% instances), xcomp (6; 9% instances), flat (5; 7% instances), conj (4; 6% instances), obj (4; 6% instances), obl (4; 6% instances), compound:redup (3; 4% instances), root (3; 4% instances), dislocated (2; 3% instances), nmod (2; 3% instances), nsubj (1; 1% instances), parataxis (1; 1% instances), vocative (1; 1% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (25; 37% instances), VERB (21; 31% instances), NUM (14; 21% instances), PART (3; 4% instances), (3; 4% instances), PRON (1; 1% instances), X (1; 1% instances)

40 (59%) NUM nodes are leaves.

18 (26%) NUM nodes have one child.

7 (10%) NUM nodes have two children.

3 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 13 different relations: case (8; 19% instances), punct (8; 19% instances), conj (6; 14% instances), flat (5; 12% instances), cc (4; 9% instances), compound:redup (3; 7% instances), discourse (2; 5% instances), nummod (2; 5% instances), acl (1; 2% instances), acl:relcl (1; 2% instances), advmod (1; 2% instances), dislocated (1; 2% instances), nmod (1; 2% instances)

Children of NUM nodes belong to 11 different parts of speech: NUM (14; 33% instances), ADP (9; 21% instances), PUNCT (8; 19% instances), X (3; 7% instances), CCONJ (2; 5% instances), VERB (2; 5% instances), INTJ (1; 2% instances), NOUN (1; 2% instances), PART (1; 2% instances), PRON (1; 2% instances), SCONJ (1; 2% instances)