home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: NUM

There are 583 NUM lemmas (4%), 596 NUM types (4%) and 3400 NUM tokens (3%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.

The 10 most frequent NUM lemmas: één, twee, 2004, 2006, 2005, 1, 2003, drie, 2, 2002

The 10 most frequent NUM types: twee, 2004, 2006, 2005, 1, 2003, één, drie, 2, een

The 10 most frequent ambiguous lemmas: één (ADJ 225, NUM 128, PROPN 6), twee (NUM 101, ADJ 45), 1 (NUM 84, ADJ 5, PROPN 1), drie (NUM 62, ADJ 21), 2 (NUM 55, ADJ 6, PROPN 1, SYM 1), 3 (NUM 37, ADJ 8), 2000 (NUM 36, PROPN 1), 5 (NUM 36, ADJ 1), 4 (NUM 35, ADJ 2), vijf (NUM 28, ADJ 4)

The 10 most frequent ambiguous types: 1 (NUM 84, PROPN 1), één (NUM 72, PROPN 6), 2 (NUM 55, PROPN 1, SYM 1), een (DET 1598, NUM 45), 2000 (NUM 36, PROPN 1), 10 (NUM 27, PROPN 1), 20 (NUM 25, SYM 1), 7 (NUM 24, SYM 1), vier (NUM 21, VERB 1), 8 (NUM 23, SYM 1)

Morphology

The form / lemma ratio of NUM is 1.022298 (the average of all parts of speech is 1.168496).

The 1st highest number of forms (4) was observed with the lemma “één”: Eén, een, eentje, één.

The 2nd highest number of forms (2) was observed with the lemma “150”: 125-150, 150.

The 3rd highest number of forms (2) was observed with the lemma “1975”: 1955-1975, 1975.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 20 different relations: nummod (1062; 31% instances), obl (650; 19% instances), root (536; 16% instances), nmod (320; 9% instances), flat (250; 7% instances), parataxis (175; 5% instances), appos (131; 4% instances), conj (118; 3% instances), acl (62; 2% instances), fixed (30; 1% instances), nsubj (22; 1% instances), advcl (11; 0% instances), orphan (7; 0% instances), obj (6; 0% instances), acl:relcl (5; 0% instances), det (5; 0% instances), amod (4; 0% instances), xcomp (3; 0% instances), nsubj:pass (2; 0% instances), ccomp (1; 0% instances)

Parents of NUM nodes belong to 13 different parts of speech: NOUN (1191; 35% instances), VERB (672; 20% instances), (536; 16% instances), PROPN (391; 12% instances), NUM (347; 10% instances), SYM (100; 3% instances), DET (67; 2% instances), ADJ (56; 2% instances), X (17; 1% instances), ADP (9; 0% instances), PRON (7; 0% instances), ADV (6; 0% instances), INTJ (1; 0% instances)

1212 (36%) NUM nodes are leaves.

1241 (37%) NUM nodes have one child.

400 (12%) NUM nodes have two children.

547 (16%) NUM nodes have three or more children.

The highest child degree of a NUM node is 8.

Children of NUM nodes are attached using 23 different relations: punct (1273; 33% instances), case (953; 24% instances), parataxis (572; 15% instances), flat (512; 13% instances), nmod (159; 4% instances), conj (102; 3% instances), cc (63; 2% instances), amod (55; 1% instances), cop (49; 1% instances), nsubj (44; 1% instances), fixed (32; 1% instances), mark (14; 0% instances), advmod (13; 0% instances), appos (11; 0% instances), det (10; 0% instances), obl (10; 0% instances), acl:relcl (9; 0% instances), acl (3; 0% instances), advcl (2; 0% instances), cc:preconj (2; 0% instances), orphan (2; 0% instances), nmod:poss (1; 0% instances), nummod (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: PUNCT (1273; 33% instances), ADP (957; 25% instances), PROPN (742; 19% instances), NUM (347; 9% instances), NOUN (264; 7% instances), CCONJ (71; 2% instances), AUX (49; 1% instances), ADV (39; 1% instances), PRON (39; 1% instances), ADJ (26; 1% instances), DET (22; 1% instances), VERB (22; 1% instances), X (18; 0% instances), SCONJ (12; 0% instances), SYM (11; 0% instances)