home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: NUM

There are 688 NUM lemmas (3%), 693 NUM types (2%) and 3962 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: to, tre, én, fire, eneste, 2, fem, ti, 20, seks

The 10 most frequent NUM types: to, tre, fire, eneste, ett, 2, fem, ti, 20, seks

The 10 most frequent ambiguous lemmas: to (NUM 356, X 11), tre (NUM 174, NOUN 10, VERB 7), 2 (NUM 84, PROPN 1), 3 (NUM 51, PROPN 1), hundre (NUM 15, NOUN 13), null (NUM 11, NOUN 1), tusen (NOUN 25, NUM 11), 32 (NUM 7, X 1), 34 (NUM 5, X 1), fire-fem (NUM 3, DET 1)

The 10 most frequent ambiguous types: to (NUM 331, X 11), tre (NUM 155, VERB 3, NOUN 2), ett (NUM 86, DET 1, X 1), 2 (NUM 83, PROPN 1), 3 (NUM 51, PROPN 1), hundre (NUM 15, NOUN 13), null (NUM 11, NOUN 1), tusen (NOUN 22, NUM 8), 32 (NUM 7, X 1), 34 (NUM 5, X 1)

Morphology

The form / lemma ratio of NUM is 1.007267 (the average of all parts of speech is 1.381903).

The 1st highest number of forms (3) was observed with the lemma “én”: ett, Èn, én.

The 2nd highest number of forms (2) was observed with the lemma “2”: 2, 2s.

The 3rd highest number of forms (2) was observed with the lemma “2011”: 2011, 2011s.

NUM occurs with 5 features: NumType (3962; 100% instances), Number (3759; 95% instances), Gender (164; 4% instances), Definite (137; 3% instances), Case (3; 0% instances)

NUM occurs with 8 feature-value pairs: Case=Gen, Definite=Def, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, Number=Sing

NUM occurs with 9 feature combinations. The most frequent feature combination is Number=Plur|NumType=Card (3519 tokens). Examples: to, tre, fire, 2, fem, ti, 20, seks, 3, 50

Relations

NUM nodes are attached to their parents using 18 different relations: nummod (2299; 58% instances), nmod (604; 15% instances), obl (464; 12% instances), conj (155; 4% instances), flat:name (86; 2% instances), nsubj (77; 2% instances), compound (64; 2% instances), root (54; 1% instances), obj (47; 1% instances), appos (35; 1% instances), xcomp (34; 1% instances), advcl (12; 0% instances), acl (8; 0% instances), nsubj:pass (8; 0% instances), parataxis (8; 0% instances), acl:relcl (4; 0% instances), orphan (2; 0% instances), csubj (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (2739; 69% instances), VERB (569; 14% instances), PROPN (255; 6% instances), NUM (208; 5% instances), ADJ (109; 3% instances), (54; 1% instances), DET (9; 0% instances), ADP (7; 0% instances), PRON (7; 0% instances), ADV (5; 0% instances)

2332 (59%) NUM nodes are leaves.

1114 (28%) NUM nodes have one child.

374 (9%) NUM nodes have two children.

142 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 10.

Children of NUM nodes are attached using 25 different relations: case (778; 32% instances), punct (488; 20% instances), advmod (262; 11% instances), obl (218; 9% instances), conj (156; 6% instances), nmod (134; 5% instances), det (72; 3% instances), cc (68; 3% instances), cop (58; 2% instances), compound (53; 2% instances), nsubj (53; 2% instances), acl:relcl (29; 1% instances), mark (25; 1% instances), orphan (13; 1% instances), appos (9; 0% instances), amod (8; 0% instances), acl (6; 0% instances), advcl (6; 0% instances), expl (4; 0% instances), flat:name (4; 0% instances), acl:cleft (2; 0% instances), aux (2; 0% instances), xcomp (2; 0% instances), obj (1; 0% instances), parataxis (1; 0% instances)

Children of NUM nodes belong to 14 different parts of speech: ADP (794; 32% instances), PUNCT (488; 20% instances), NOUN (306; 12% instances), NUM (208; 8% instances), ADV (177; 7% instances), ADJ (108; 4% instances), DET (79; 3% instances), CCONJ (68; 3% instances), AUX (60; 2% instances), PROPN (49; 2% instances), PRON (48; 2% instances), VERB (41; 2% instances), SCONJ (24; 1% instances), PART (2; 0% instances)