home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: NUM

There are 8 NUM lemmas (1%), 10 NUM types (1%) and 19 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 15 in number of lemmas, 15 in number of types and 16 in number of tokens.

The 10 most frequent NUM lemmas: lim, nàmbón, ɗàrí, dubu, nandam, nàmbóŋ, hàmsə́n, wupsə

The 10 most frequent NUM types: nàmbón, lim, ɗàrí, dubu, nandam, hàmsə́n, limês, nàmbóŋə́y, nàmbóɲíː, wupsə

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.250000 (the average of all parts of speech is 1.640000).

The 1st highest number of forms (2) was observed with the lemma “lim”: lim, limês.

The 2nd highest number of forms (2) was observed with the lemma “nàmbóŋ”: nàmbóŋə́y, nàmbóɲíː.

The 3rd highest number of forms (1) was observed with the lemma “dubu”: dubu.

NUM occurs with 1 features: Aspect (2; 11% instances)

NUM occurs with 1 feature-value pairs: Aspect=Res

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (17 tokens). Examples: nàmbón, lim, ɗàrí, dubu, nandam, hàmsə́n, limês, wupsə

Relations

NUM nodes are attached to their parents using 7 different relations: flat (5; 26% instances), nummod (5; 26% instances), xcomp (4; 21% instances), obj (2; 11% instances), conj (1; 5% instances), dislocated (1; 5% instances), obl (1; 5% instances)

Parents of NUM nodes belong to 4 different parts of speech: VERB (8; 42% instances), NUM (6; 32% instances), NOUN (4; 21% instances), PRON (1; 5% instances)

9 (47%) NUM nodes are leaves.

8 (42%) NUM nodes have one child.

2 (11%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 6 different relations: flat (5; 42% instances), nmod (2; 17% instances), punct (2; 17% instances), cc (1; 8% instances), conj (1; 8% instances), discourse (1; 8% instances)

Children of NUM nodes belong to 5 different parts of speech: NUM (6; 50% instances), PUNCT (2; 17% instances), X (2; 17% instances), CCONJ (1; 8% instances), PART (1; 8% instances)