Treebank Statistics: UD_Dutch-Alpino: POS Tags: NUM
There are 706 NUM lemmas (3%), 719 NUM types (3%) and 3646 NUM tokens (2%).
Out of 16 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.
The 10 most frequent NUM lemmas: twee, drie, één, hoeveel, vier, 1, een, vijf, tien, zes
The 10 most frequent NUM types: twee, een, drie, hoeveel, vier, 1, vijf, tien, zes, 2
The 10 most frequent ambiguous lemmas: twee (NUM 268, ADJ 113, PROPN 1), drie (NUM 190, ADJ 47), één (ADJ 296, NUM 189, PROPN 1), vier (NUM 111, ADJ 20, NOUN 1), 1 (NUM 82, PROPN 8, ADJ 2), een (DET 4337, NUM 78, PRON 15), vijf (NUM 70, ADJ 18, NOUN 1), tien (NUM 68, NOUN 2), zes (NUM 60, ADJ 4, NOUN 2), 2 (NUM 50, PROPN 4, ADJ 1)
The 10 most frequent ambiguous types: twee (NUM 244, PROPN 1), een (DET 4062, NUM 198, CCONJ 1), 1 (NUM 82, PROPN 8), zes (NUM 57, NOUN 1), 2 (NUM 50, PROPN 4), 3 (NUM 49, PROPN 1), één (NUM 41, PROPN 1), acht (NUM 31, VERB 11, NOUN 1), zoveel (NUM 28, ADV 7), 10 (NUM 26, PROPN 1)
- twee
- NUM 244: Van meer dan twee grastoernooien op rij wordt hij te moe .
- PROPN 1: Kenmerkend is , dat de VPRO minder bezwaren had dan de TROS , misschien hierom , omdat de VPRO toch al nooit gewend was te freewheelen , terwijl de TROS gemakshalve een maandagavond op Nederland twee nog weleens als een weggevertje beschouwde .
- een
- 1
- zes
- 2
- 3
- één
- acht
- zoveel
- 10
Morphology
The form / lemma ratio of NUM is 1.018414 (the average of all parts of speech is 1.221985).
The 1st highest number of forms (4) was observed with the lemma “drie”: drie, drieen, drietjes, drieën.
The 2nd highest number of forms (4) was observed with the lemma “één”: Eén, een, eentje, één.
The 3rd highest number of forms (3) was observed with the lemma “twee”: twee, tweeen, tweetjes.
NUM occurs with 1 features: ExtPos (321; 9% instances)
NUM occurs with 3 feature-value pairs: ExtPos=ADP, ExtPos=PRON, ExtPos=PROPN
NUM occurs with 4 feature combinations.
The most frequent feature combination is _ (3325 tokens).
Examples: twee, een, drie, hoeveel, vier, vijf, tien, zes, 1969, één
Relations
NUM nodes are attached to their parents using 23 different relations: nummod (1978; 54% instances), obl (552; 15% instances), nmod (242; 7% instances), fixed (237; 7% instances), conj (129; 4% instances), appos (115; 3% instances), nsubj (72; 2% instances), flat (70; 2% instances), obj (57; 2% instances), parataxis (41; 1% instances), root (38; 1% instances), det (32; 1% instances), advcl (21; 1% instances), obl:arg (21; 1% instances), nsubj:pass (15; 0% instances), acl:relcl (6; 0% instances), orphan (6; 0% instances), acl (3; 0% instances), ccomp (3; 0% instances), xcomp (3; 0% instances), amod (2; 0% instances), iobj (2; 0% instances), obl:agent (1; 0% instances)
Parents of NUM nodes belong to 13 different parts of speech: NOUN (2065; 57% instances), VERB (689; 19% instances), NUM (253; 7% instances), PROPN (187; 5% instances), SYM (164; 4% instances), ADJ (91; 2% instances), X (82; 2% instances), (38; 1% instances), ADV (29; 1% instances), PRON (27; 1% instances), DET (16; 0% instances), ADP (4; 0% instances), CCONJ (1; 0% instances)
2152 (59%) NUM nodes are leaves.
838 (23%) NUM nodes have one child.
417 (11%) NUM nodes have two children.
239 (7%) NUM nodes have three or more children.
The highest child degree of a NUM node is 9.
Children of NUM nodes are attached using 26 different relations: case (751; 30% instances), punct (375; 15% instances), fixed (259; 10% instances), flat (237; 9% instances), nmod (237; 9% instances), conj (142; 6% instances), cc (110; 4% instances), advmod (84; 3% instances), det (76; 3% instances), amod (75; 3% instances), cop (30; 1% instances), nsubj (28; 1% instances), nummod (19; 1% instances), parataxis (18; 1% instances), mark (17; 1% instances), advcl (16; 1% instances), acl (10; 0% instances), acl:relcl (10; 0% instances), nmod:poss (7; 0% instances), obl (5; 0% instances), orphan (5; 0% instances), cc:preconj (4; 0% instances), appos (2; 0% instances), aux (2; 0% instances), csubj (2; 0% instances), expl (1; 0% instances)
Children of NUM nodes belong to 15 different parts of speech: ADP (770; 31% instances), PUNCT (375; 15% instances), NOUN (269; 11% instances), NUM (253; 10% instances), PROPN (206; 8% instances), CCONJ (134; 5% instances), DET (118; 5% instances), ADJ (86; 3% instances), ADV (78; 3% instances), PRON (67; 3% instances), SYM (50; 2% instances), VERB (43; 2% instances), AUX (32; 1% instances), X (27; 1% instances), SCONJ (14; 1% instances)