home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: NUM

There are 790 NUM lemmas (3%), 797 NUM types (2%) and 4032 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: to, tre, fire, ti, fem, 20, 1, seks, 2005, 2006

The 10 most frequent NUM types: to, tre, fire, ti, fem, 20, 1, seks, 2005, 2006

The 10 most frequent ambiguous lemmas: to (NUM 387, X 7), tre (NUM 149, VERB 8, NOUN 4), fire (NUM 99, VERB 1, X 1), 2 (NUM 50, X 1), 50 (NUM 48, X 1), eine (NUM 43, ADJ 1), 30 (NUM 40, X 1), 40 (NUM 29, X 2), hundre (NUM 24, NOUN 18), 8 (NUM 15, PROPN 2)

The 10 most frequent ambiguous types: to (NUM 357, X 7), tre (NUM 135, NOUN 3), fire (NUM 92, VERB 1, X 1), 2 (NUM 50, X 1), 50 (NUM 48, X 1), 30 (NUM 40, X 1), åtte (NUM 34, VERB 1), 40 (NUM 29, X 2), hundre (NUM 24, NOUN 18), 8 (NUM 15, PROPN 2)

Morphology

The form / lemma ratio of NUM is 1.008861 (the average of all parts of speech is 1.352830).

The 1st highest number of forms (3) was observed with the lemma “annankvar”: annakvar, annakvart, annankvar.

The 2nd highest number of forms (3) was observed with the lemma “éin”: èin, éi, éin.

The 3rd highest number of forms (2) was observed with the lemma “en”: ei, en.

NUM occurs with 5 features: NumType (4032; 100% instances), Number (3757; 93% instances), Definite (94; 2% instances), Gender (57; 1% instances), PronType (1; 0% instances)

NUM occurs with 8 feature-value pairs: Definite=Def, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, Number=Sing, PronType=Dem

NUM occurs with 8 feature combinations. The most frequent feature combination is Number=Plur|NumType=Card (3657 tokens). Examples: to, tre, fire, ti, fem, 20, seks, 2005, 2006, 2

Relations

NUM nodes are attached to their parents using 19 different relations: nummod (2330; 58% instances), nmod (552; 14% instances), obl (515; 13% instances), conj (208; 5% instances), nsubj (87; 2% instances), flat:name (86; 2% instances), parataxis (60; 1% instances), root (50; 1% instances), obj (47; 1% instances), compound (34; 1% instances), appos (14; 0% instances), orphan (14; 0% instances), xcomp (13; 0% instances), advcl (8; 0% instances), nsubj:pass (5; 0% instances), ccomp (3; 0% instances), flat:foreign (3; 0% instances), acl (2; 0% instances), csubj (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (2819; 70% instances), VERB (573; 14% instances), NUM (273; 7% instances), PROPN (160; 4% instances), ADJ (124; 3% instances), (50; 1% instances), DET (14; 0% instances), PRON (7; 0% instances), ADV (6; 0% instances), ADP (3; 0% instances), X (3; 0% instances)

2318 (57%) NUM nodes are leaves.

1222 (30%) NUM nodes have one child.

295 (7%) NUM nodes have two children.

197 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 9.

Children of NUM nodes are attached using 25 different relations: case (799; 32% instances), punct (550; 22% instances), obl (304; 12% instances), conj (198; 8% instances), advmod (178; 7% instances), nmod (113; 4% instances), cc (73; 3% instances), det (73; 3% instances), cop (45; 2% instances), nsubj (38; 2% instances), acl:relcl (26; 1% instances), compound (23; 1% instances), mark (20; 1% instances), flat:name (15; 1% instances), appos (14; 1% instances), orphan (13; 1% instances), acl (8; 0% instances), advcl (8; 0% instances), parataxis (7; 0% instances), amod (5; 0% instances), expl (5; 0% instances), aux (3; 0% instances), csubj (2; 0% instances), nummod (2; 0% instances), xcomp (2; 0% instances)

Children of NUM nodes belong to 14 different parts of speech: ADP (810; 32% instances), PUNCT (550; 22% instances), NOUN (354; 14% instances), NUM (273; 11% instances), ADV (101; 4% instances), ADJ (94; 4% instances), DET (78; 3% instances), CCONJ (74; 3% instances), AUX (48; 2% instances), PRON (47; 2% instances), VERB (40; 2% instances), PROPN (29; 1% instances), SCONJ (19; 1% instances), PART (7; 0% instances)