home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-PUD: POS Tags: NUM

There are 207 NUM lemmas (4%), 207 NUM types (3%) and 353 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: zwei, drei, vier, 3, sechs, zehn, 1, 10, 50, 100

The 10 most frequent NUM types: zwei, drei, vier, 3, sechs, zehn, 1, 10, 50, 100

The 10 most frequent ambiguous lemmas: zwei (NUM 22, DET 1, NOUN 1), 3 (NUM 7, ADJ 1), 1 (NUM 5, ADJ 1, NOUN 1), 10 (NUM 5, NOUN 1), sieben (NUM 3, NOUN 1), - (PUNCT 178, NUM 2, CCONJ 1), 16 (NUM 2, ADJ 1), 31 (NOUN 2, NUM 2), 6 (NUM 2, ADJ 1), 21 (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: 3 (NUM 7, ADJ 1), 1 (NUM 5, NOUN 1), - (PUNCT 178, NUM 2, CCONJ 1), 16 (NUM 2, ADJ 1), 31 (NOUN 2, NUM 2), 21. (ADJ 1, NUM 1), 4 (ADJ 1, NUM 1), III (NOUN 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.195641).

The 1st highest number of forms (1) was observed with the lemma “-”: -.

The 2nd highest number of forms (1) was observed with the lemma “1”: 1.

The 3rd highest number of forms (1) was observed with the lemma “1,165”: 1,165.

NUM occurs with 6 features: NumType (352; 100% instances), Case (1; 0% instances), Degree (1; 0% instances), Gender (1; 0% instances), Number (1; 0% instances), Person (1; 0% instances)

NUM occurs with 6 feature-value pairs: Case=Acc, Degree=Pos, Gender=Masc, NumType=Card, Number=Plur, Person=3

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (352 tokens). Examples: zwei, drei, vier, 3, sechs, zehn, 1, 10, 50, 100

Relations

NUM nodes are attached to their parents using 9 different relations: nummod (226; 64% instances), obl:tmod (69; 20% instances), obl (17; 5% instances), nmod (13; 4% instances), conj (12; 3% instances), compound (10; 3% instances), nsubj (4; 1% instances), nsubj:pass (1; 0% instances), obj (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (242; 69% instances), VERB (66; 19% instances), SYM (22; 6% instances), NUM (16; 5% instances), ADJ (4; 1% instances), PROPN (2; 1% instances), DET (1; 0% instances)

253 (72%) NUM nodes are leaves.

73 (21%) NUM nodes have one child.

17 (5%) NUM nodes have two children.

10 (3%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 13 different relations: advmod (38; 26% instances), case (31; 22% instances), punct (26; 18% instances), cc (11; 8% instances), conj (11; 8% instances), nmod (11; 8% instances), compound (5; 3% instances), det (3; 2% instances), cop (2; 1% instances), nsubj (2; 1% instances), obl:tmod (2; 1% instances), acl:relcl (1; 1% instances), cc:preconj (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: ADV (38; 26% instances), ADP (31; 22% instances), PUNCT (26; 18% instances), NUM (16; 11% instances), CCONJ (12; 8% instances), PROPN (8; 6% instances), NOUN (6; 4% instances), DET (3; 2% instances), AUX (2; 1% instances), ADJ (1; 1% instances), VERB (1; 1% instances)