home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-Alpino: POS Tags: NUM

There are 744 NUM lemmas (3%), 751 NUM types (3%) and 3633 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: twee, drie, hoeveel, vier, een, 1, één, vijf, tien, zes

The 10 most frequent NUM types: twee, drie, hoeveel, een, vier, 1, vijf, tien, zes, 3

The 10 most frequent ambiguous lemmas: twee (NUM 265, ADJ 113, NOUN 1, PROPN 1), drie (NUM 188, ADJ 47), vier (NUM 111, ADJ 21), een (DET 4430, NUM 92, PRON 15), 1 (NUM 83, PROPN 7, ADJ 2), één (ADJ 265, NUM 82, PROPN 1), vijf (NUM 70, ADJ 18, NOUN 1), zes (NUM 60, ADJ 4, NOUN 2), 3 (NUM 50, ADJ 1), 2 (NUM 49, PROPN 4, ADJ 1, SYM 1)

The 10 most frequent ambiguous types: twee (NUM 244, PROPN 1), een (DET 4145, NUM 115, CCONJ 1), 1 (NUM 83, PROPN 7), zes (NUM 57, NOUN 1), 2 (NUM 49, PROPN 4, SYM 1), één (NUM 41, PROPN 1), acht (NUM 30, VERB 12, NOUN 1), 8 (NUM 26, SYM 1), 20 (NUM 18, PROPN 1), 7 (NUM 18, PROPN 1)

Morphology

The form / lemma ratio of NUM is 1.009409 (the average of all parts of speech is 1.214322).

The 1st highest number of forms (4) was observed with the lemma “één”: Eén, een, eentje, één.

The 2nd highest number of forms (3) was observed with the lemma “drie”: drie, drietjes, drieën.

The 3rd highest number of forms (2) was observed with the lemma “duizend”: duizend, duizenden.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 23 different relations: nummod (1761; 48% instances), obl (590; 16% instances), nmod (236; 6% instances), fixed (223; 6% instances), det (167; 5% instances), conj (134; 4% instances), appos (115; 3% instances), flat:name (99; 3% instances), nsubj (69; 2% instances), obj (56; 2% instances), root (52; 1% instances), parataxis (42; 1% instances), amod (21; 1% instances), nsubj:pass (17; 0% instances), advcl (15; 0% instances), orphan (11; 0% instances), advmod (8; 0% instances), acl:relcl (6; 0% instances), acl (3; 0% instances), ccomp (3; 0% instances), iobj (2; 0% instances), xcomp (2; 0% instances), obl:agent (1; 0% instances)

Parents of NUM nodes belong to 14 different parts of speech: NOUN (2187; 60% instances), VERB (697; 19% instances), NUM (257; 7% instances), PROPN (211; 6% instances), ADJ (107; 3% instances), (52; 1% instances), ADV (32; 1% instances), X (30; 1% instances), SYM (26; 1% instances), PRON (16; 0% instances), DET (10; 0% instances), ADP (6; 0% instances), CCONJ (1; 0% instances), SCONJ (1; 0% instances)

2109 (58%) NUM nodes are leaves.

861 (24%) NUM nodes have one child.

423 (12%) NUM nodes have two children.

240 (7%) NUM nodes have three or more children.

The highest child degree of a NUM node is 9.

Children of NUM nodes are attached using 25 different relations: case (766; 30% instances), punct (415; 16% instances), flat:name (233; 9% instances), nmod (226; 9% instances), fixed (225; 9% instances), amod (165; 6% instances), conj (147; 6% instances), cc (117; 5% instances), det (81; 3% instances), cop (37; 1% instances), nsubj (34; 1% instances), advcl (23; 1% instances), parataxis (23; 1% instances), mark (18; 1% instances), advmod (13; 1% instances), acl:relcl (11; 0% instances), acl (9; 0% instances), obl (8; 0% instances), nummod (6; 0% instances), orphan (5; 0% instances), nmod:poss (4; 0% instances), appos (2; 0% instances), aux (2; 0% instances), csubj (2; 0% instances), expl (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: ADP (808; 31% instances), PUNCT (415; 16% instances), NOUN (277; 11% instances), NUM (257; 10% instances), PROPN (205; 8% instances), DET (95; 4% instances), CCONJ (92; 4% instances), ADJ (86; 3% instances), PRON (77; 3% instances), ADV (67; 3% instances), SYM (64; 2% instances), VERB (51; 2% instances), AUX (39; 2% instances), X (28; 1% instances), SCONJ (12; 0% instances)