home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: NUM

There are 580 NUM lemmas (4%), 602 NUM types (4%) and 3407 NUM tokens (3%). Out of 16 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 9 in number of tokens.

The 10 most frequent NUM lemmas: één, twee, 2004, 2006, 2005, 1, 2003, drie, 2, 2002

The 10 most frequent NUM types: twee, 2004, 2006, 2005, 1, 2003, één, drie, 2, een

The 10 most frequent ambiguous lemmas: één (ADJ 228, NUM 128, PROPN 6), twee (NUM 101, ADJ 45), 1 (NUM 84, ADJ 5, PROPN 1), drie (NUM 62, ADJ 21), 2 (NUM 55, ADJ 6, PROPN 1, X 1), 3 (NUM 37, ADJ 8), 2000 (NUM 36, PROPN 1), 5 (NUM 36, ADJ 1), 4 (NUM 35, ADJ 2), vijf (NUM 28, ADJ 4)

The 10 most frequent ambiguous types: 1 (NUM 84, PROPN 1), één (NUM 72, PROPN 6), 2 (NUM 55, PROPN 1, X 1), een (DET 1598, NUM 45), 2000 (NUM 36, PROPN 1), 10 (NUM 27, PROPN 1), 20 (NUM 25, X 1), 7 (NUM 24, SYM 1), vier (NUM 21, VERB 1), 8 (NUM 23, SYM 1)

Morphology

The form / lemma ratio of NUM is 1.037931 (the average of all parts of speech is 1.174887).

The 1st highest number of forms (4) was observed with the lemma “één”: Eén, een, eentje, één.

The 2nd highest number of forms (3) was observed with the lemma “1992”: 1992, 1992-…, 1992-2004.

The 3rd highest number of forms (3) was observed with the lemma “1995”: 1988-1995, 1995, 1995-….

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 20 different relations: nummod (1025; 30% instances), obl (647; 19% instances), root (538; 16% instances), nmod (321; 9% instances), flat:name (255; 7% instances), parataxis (175; 5% instances), appos (130; 4% instances), conj (119; 3% instances), acl (62; 2% instances), det (40; 1% instances), fixed (29; 1% instances), nsubj (22; 1% instances), advcl (11; 0% instances), orphan (11; 0% instances), acl:relcl (6; 0% instances), obj (6; 0% instances), amod (5; 0% instances), nsubj:pass (3; 0% instances), ccomp (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 14 different parts of speech: NOUN (1240; 36% instances), VERB (636; 19% instances), (538; 16% instances), PROPN (392; 12% instances), NUM (347; 10% instances), SYM (83; 2% instances), DET (68; 2% instances), ADJ (67; 2% instances), X (11; 0% instances), ADP (10; 0% instances), PRON (7; 0% instances), ADV (6; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances)

1219 (36%) NUM nodes are leaves.

1239 (36%) NUM nodes have one child.

398 (12%) NUM nodes have two children.

551 (16%) NUM nodes have three or more children.

The highest child degree of a NUM node is 8.

Children of NUM nodes are attached using 23 different relations: punct (1275; 33% instances), case (952; 24% instances), parataxis (573; 15% instances), flat:name (510; 13% instances), nmod (153; 4% instances), conj (104; 3% instances), cc (67; 2% instances), amod (62; 2% instances), cop (51; 1% instances), nsubj (46; 1% instances), fixed (32; 1% instances), advmod (14; 0% instances), mark (14; 0% instances), appos (11; 0% instances), det (11; 0% instances), obl (10; 0% instances), acl:relcl (9; 0% instances), orphan (4; 0% instances), acl (3; 0% instances), advcl (2; 0% instances), aux (2; 0% instances), nmod:poss (1; 0% instances), nummod (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: PUNCT (1275; 33% instances), ADP (968; 25% instances), PROPN (747; 19% instances), NUM (347; 9% instances), NOUN (266; 7% instances), CCONJ (64; 2% instances), AUX (53; 1% instances), PRON (48; 1% instances), ADV (45; 1% instances), ADJ (28; 1% instances), VERB (22; 1% instances), DET (19; 0% instances), SYM (15; 0% instances), SCONJ (7; 0% instances), X (3; 0% instances)