home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: NUM

There are 1026 NUM lemmas (4%), 1052 NUM types (3%) and 7702 NUM tokens (3%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: één, twee, 1, drie, 2, 2004, 2005, vier, 3, 2003

The 10 most frequent NUM types: twee, één, 1, een, drie, 2, 2004, 2005, vier, 3

The 10 most frequent ambiguous lemmas: één (ADJ 575, NUM 371, PROPN 6), twee (NUM 317, ADJ 129), 1 (NUM 159, ADJ 13, PROPN 3, X 1), drie (NUM 151, ADJ 51), 2 (NUM 108, ADJ 14, X 6, SYM 2, PROPN 1), vier (NUM 94, ADJ 27), 3 (NUM 93, ADJ 11), 2003 (NUM 89, X 1), 10 (NUM 85, ADJ 5, PROPN 1, X 1), 5 (NUM 77, ADJ 1, PROPN 1, X 1)

The 10 most frequent ambiguous types: één (NUM 195, PROPN 6), 1 (NUM 159, PROPN 3, X 1), een (DET 5510, NUM 126, CCONJ 1), 2 (NUM 108, X 6, SYM 2, PROPN 1), vier (NUM 84, VERB 1), 2003 (NUM 89, X 1), 10 (NUM 85, PROPN 1, X 1), 5 (NUM 77, PROPN 1, X 1), 7 (NUM 77, SYM 1), 4 (NUM 76, X 5)

Morphology

The form / lemma ratio of NUM is 1.025341 (the average of all parts of speech is 1.223065).

The 1st highest number of forms (5) was observed with the lemma “één”: Eén, een, eentje, en, één.

The 2nd highest number of forms (3) was observed with the lemma “1975”: (1975), 1955-1975, 1975.

The 3rd highest number of forms (2) was observed with the lemma “150”: 125-150, 150.

NUM occurs with 1 features: ExtPos (609; 8% instances)

NUM occurs with 4 feature-value pairs: ExtPos=ADJ, ExtPos=ADP, ExtPos=PRON, ExtPos=PROPN

NUM occurs with 5 feature combinations. The most frequent feature combination is _ (7093 tokens). Examples: twee, één, een, drie, 1, 2004, 2005, vier, 2003, 2006

Relations

NUM nodes are attached to their parents using 22 different relations: nummod (2506; 33% instances), obl (1804; 23% instances), nmod (736; 10% instances), flat (679; 9% instances), root (653; 8% instances), appos (343; 4% instances), conj (322; 4% instances), parataxis (226; 3% instances), nsubj (86; 1% instances), fixed (72; 1% instances), acl (65; 1% instances), advcl (37; 0% instances), obj (37; 0% instances), obl:arg (34; 0% instances), det (25; 0% instances), orphan (20; 0% instances), xcomp (20; 0% instances), nsubj:pass (15; 0% instances), acl:relcl (11; 0% instances), amod (8; 0% instances), ccomp (2; 0% instances), obl:agent (1; 0% instances)

Parents of NUM nodes belong to 14 different parts of speech: NOUN (2844; 37% instances), VERB (1967; 26% instances), NUM (1001; 13% instances), PROPN (705; 9% instances), (653; 8% instances), SYM (228; 3% instances), ADJ (112; 1% instances), DET (74; 1% instances), X (66; 1% instances), ADV (20; 0% instances), ADP (16; 0% instances), PRON (14; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances)

3185 (41%) NUM nodes are leaves.

2455 (32%) NUM nodes have one child.

900 (12%) NUM nodes have two children.

1162 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 10.

Children of NUM nodes are attached using 25 different relations: case (2532; 31% instances), punct (1851; 22% instances), flat (1544; 19% instances), parataxis (642; 8% instances), nmod (380; 5% instances), conj (328; 4% instances), fixed (207; 3% instances), amod (170; 2% instances), cc (165; 2% instances), cop (94; 1% instances), nsubj (94; 1% instances), advmod (79; 1% instances), det (46; 1% instances), mark (32; 0% instances), appos (22; 0% instances), obl (19; 0% instances), acl:relcl (17; 0% instances), acl (14; 0% instances), advcl (11; 0% instances), nmod:poss (9; 0% instances), orphan (4; 0% instances), cc:preconj (3; 0% instances), nummod (3; 0% instances), aux (1; 0% instances), csubj (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: ADP (2537; 31% instances), PUNCT (1851; 22% instances), PROPN (1374; 17% instances), NUM (1001; 12% instances), NOUN (526; 6% instances), CCONJ (220; 3% instances), ADV (166; 2% instances), PRON (107; 1% instances), AUX (95; 1% instances), ADJ (88; 1% instances), SYM (73; 1% instances), X (70; 1% instances), VERB (68; 1% instances), DET (66; 1% instances), SCONJ (26; 0% instances)