home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: NUM

There are 789 NUM lemmas (3%), 795 NUM types (2%) and 4050 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: to, tre, fire, ti, fem, 20, 1, seks, 2005, 2006

The 10 most frequent NUM types: to, tre, fire, ti, fem, 20, 1, seks, 2005, 2006

The 10 most frequent ambiguous lemmas: to (NUM 387, X 7), tre (NUM 149, VERB 8, NOUN 4), fire (NUM 99, VERB 1, X 1), 2 (NUM 50, X 1), 50 (NUM 48, X 1), eine (NUM 43, ADJ 1), 30 (NUM 40, X 1), 40 (NUM 29, X 2), hundre (NUM 24, NOUN 18), noko (PRON 283, NUM 24)

The 10 most frequent ambiguous types: to (NUM 357, X 7), tre (NUM 135, NOUN 3), fire (NUM 92, VERB 1, X 1), 2 (NUM 50, X 1), 50 (NUM 48, X 1), 30 (NUM 40, X 1), åtte (NUM 34, VERB 1), 40 (NUM 29, X 2), hundre (NUM 24, NOUN 18), noko (PRON 274, DET 183, NUM 23)

Morphology

The form / lemma ratio of NUM is 1.007605 (the average of all parts of speech is 1.346455).

The 1st highest number of forms (3) was observed with the lemma “annankvar”: annakvar, annakvart, annankvar.

The 2nd highest number of forms (3) was observed with the lemma “éin”: èin, éi, éin.

The 3rd highest number of forms (2) was observed with the lemma “halvannan”: halvanna, halvtanna.

NUM occurs with 4 features: NumType (4050; 100% instances), Number (3657; 90% instances), Gender (75; 2% instances), PronType (1; 0% instances)

NUM occurs with 6 feature-value pairs: Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, PronType=Dem

NUM occurs with 6 feature combinations. The most frequent feature combination is Number=Plur|NumType=Card (3657 tokens). Examples: to, tre, fire, ti, fem, 20, seks, 2005, 2006, 2

Relations

NUM nodes are attached to their parents using 18 different relations: nummod (2344; 58% instances), nmod (554; 14% instances), obl (528; 13% instances), conj (206; 5% instances), nsubj (84; 2% instances), flat:name (81; 2% instances), parataxis (59; 1% instances), obj (52; 1% instances), root (51; 1% instances), compound (34; 1% instances), appos (19; 0% instances), xcomp (16; 0% instances), nsubj:pass (6; 0% instances), flat (5; 0% instances), nsubj:outer (4; 0% instances), ccomp (3; 0% instances), flat:foreign (3; 0% instances), csubj (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (2780; 69% instances), VERB (491; 12% instances), NUM (273; 7% instances), PROPN (214; 5% instances), ADJ (209; 5% instances), (51; 1% instances), DET (14; 0% instances), ADV (7; 0% instances), PRON (6; 0% instances), X (3; 0% instances), ADP (2; 0% instances)

2336 (58%) NUM nodes are leaves.

1225 (30%) NUM nodes have one child.

295 (7%) NUM nodes have two children.

194 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 9.

Children of NUM nodes are attached using 24 different relations: case (806; 32% instances), punct (518; 21% instances), obl (296; 12% instances), conj (198; 8% instances), advmod (193; 8% instances), nmod (116; 5% instances), cc (98; 4% instances), det (73; 3% instances), cop (46; 2% instances), nsubj (39; 2% instances), acl:relcl (29; 1% instances), compound (23; 1% instances), flat (15; 1% instances), appos (14; 1% instances), mark (13; 1% instances), advcl (7; 0% instances), parataxis (7; 0% instances), amod (5; 0% instances), expl (5; 0% instances), xcomp (5; 0% instances), aux (3; 0% instances), csubj (2; 0% instances), nummod (2; 0% instances), acl (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: ADP (806; 32% instances), PUNCT (518; 21% instances), NOUN (347; 14% instances), NUM (273; 11% instances), ADV (111; 4% instances), ADJ (96; 4% instances), DET (78; 3% instances), CCONJ (73; 3% instances), AUX (49; 2% instances), PRON (49; 2% instances), VERB (35; 1% instances), PROPN (33; 1% instances), SYM (27; 1% instances), SCONJ (12; 0% instances), PART (7; 0% instances)