home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: NUM

There are 1026 NUM lemmas (4%), 1052 NUM types (3%) and 7702 NUM tokens (3%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: één, twee, 1, drie, 2, 2004, 2005, vier, 3, 2003

The 10 most frequent NUM types: twee, één, 1, een, drie, 2, 2004, 2005, vier, 3

The 10 most frequent ambiguous lemmas: één (ADJ 576, NUM 371, PROPN 6), twee (NUM 317, ADJ 129), 1 (NUM 159, ADJ 13, PROPN 3, X 1), drie (NUM 151, ADJ 51), 2 (NUM 108, ADJ 14, X 6, SYM 2, PROPN 1), vier (NUM 94, ADJ 27), 3 (NUM 93, ADJ 11), 2003 (NUM 89, X 1), 10 (NUM 85, ADJ 5, PROPN 1, X 1), 5 (NUM 77, ADJ 1, PROPN 1, X 1)

The 10 most frequent ambiguous types: één (NUM 195, PROPN 6), 1 (NUM 159, PROPN 3, X 1), een (DET 5510, NUM 126, CCONJ 1), 2 (NUM 108, X 6, SYM 2, PROPN 1), vier (NUM 84, VERB 1), 2003 (NUM 89, X 1), 10 (NUM 85, PROPN 1, X 1), 5 (NUM 77, PROPN 1, X 1), 7 (NUM 77, SYM 1), 4 (NUM 76, X 5)

Morphology

The form / lemma ratio of NUM is 1.025341 (the average of all parts of speech is 1.223407).

The 1st highest number of forms (5) was observed with the lemma “één”: Eén, een, eentje, en, één.

The 2nd highest number of forms (3) was observed with the lemma “1975”: (1975), 1955-1975, 1975.

The 3rd highest number of forms (2) was observed with the lemma “150”: 125-150, 150.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 22 different relations: nummod (2502; 32% instances), obl (1808; 23% instances), nmod (733; 10% instances), root (653; 8% instances), flat (651; 8% instances), appos (346; 4% instances), conj (328; 4% instances), parataxis (229; 3% instances), fixed (89; 1% instances), nsubj (86; 1% instances), acl (65; 1% instances), advcl (39; 1% instances), obj (37; 0% instances), obl:arg (32; 0% instances), det (26; 0% instances), xcomp (19; 0% instances), orphan (18; 0% instances), nsubj:pass (15; 0% instances), amod (12; 0% instances), acl:relcl (11; 0% instances), ccomp (2; 0% instances), obl:agent (1; 0% instances)

Parents of NUM nodes belong to 14 different parts of speech: NOUN (2836; 37% instances), VERB (1974; 26% instances), NUM (999; 13% instances), PROPN (711; 9% instances), (653; 8% instances), SYM (221; 3% instances), ADJ (112; 1% instances), DET (76; 1% instances), X (68; 1% instances), ADV (20; 0% instances), ADP (16; 0% instances), PRON (14; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances)

3177 (41%) NUM nodes are leaves.

2440 (32%) NUM nodes have one child.

924 (12%) NUM nodes have two children.

1161 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 10.

Children of NUM nodes are attached using 25 different relations: case (2532; 30% instances), punct (1866; 22% instances), flat (1536; 18% instances), parataxis (663; 8% instances), nmod (379; 5% instances), conj (328; 4% instances), amod (218; 3% instances), fixed (215; 3% instances), cc (164; 2% instances), cop (94; 1% instances), nsubj (94; 1% instances), det (47; 1% instances), advmod (34; 0% instances), mark (32; 0% instances), appos (22; 0% instances), obl (19; 0% instances), acl:relcl (18; 0% instances), acl (14; 0% instances), advcl (11; 0% instances), nmod:poss (9; 0% instances), orphan (4; 0% instances), cc:preconj (3; 0% instances), aux (1; 0% instances), csubj (1; 0% instances), nummod (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: ADP (2537; 31% instances), PUNCT (1866; 22% instances), PROPN (1382; 17% instances), NUM (999; 12% instances), NOUN (541; 7% instances), CCONJ (220; 3% instances), ADV (153; 2% instances), PRON (109; 1% instances), AUX (95; 1% instances), ADJ (87; 1% instances), DET (79; 1% instances), SYM (72; 1% instances), X (70; 1% instances), VERB (69; 1% instances), SCONJ (26; 0% instances)