home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-LinES: POS Tags: NUM

There are 139 NUM lemmas (1%), 148 NUM types (1%) and 534 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: två, tre, ett, fem, sex, fyra, tio, 1, 2, 2000

The 10 most frequent NUM types: två, tre, en, fem, sex, fyra, tio, 1, 2, 2000

The 10 most frequent ambiguous lemmas: två (NUM 92, ADJ 4), ett (NUM 31, PRON 1), fem (NUM 21, ADJ 1), sex (NUM 20, ADJ 1), 1 (NUM 13, ADJ 1), en (DET 2835, PRON 84, NUM 11), sju (NUM 11, ADJ 1), 3 (NUM 5, ADJ 1), elva (NUM 5, NOUN 1), 4 (NUM 3, NOUN 1)

The 10 most frequent ambiguous types: en (DET 1855, PRON 63, NUM 27), 1 (NUM 13, ADJ 2), ett (DET 844, PRON 14, NUM 8), 3 (NUM 5, ADJ 1), 12 (NUM 3, ADJ 2), 4 (NUM 3, ADJ 1), 30 (NUM 2, ADJ 1), U (NUM 2, PROPN 1), 14 (ADJ 1, NUM 1), 22 (ADJ 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.064748 (the average of all parts of speech is 1.415109).

The 1st highest number of forms (3) was observed with the lemma “25”: 25, tjugofem, tjugufem.

The 2nd highest number of forms (3) was observed with the lemma “60”: 60, sexti, sextio.

The 3rd highest number of forms (2) was observed with the lemma “1999.07.01”: 1999.07.01, _1999.07.01.

NUM occurs with 5 features: Number (10; 2% instances), Gender (9; 2% instances), Definite (8; 1% instances), NumType (3; 1% instances), Case (2; 0% instances)

NUM occurs with 6 feature-value pairs: Case=Nom, Definite=Ind, Gender=Com, Gender=Neut, NumType=Card, Number=Sing

NUM occurs with 7 feature combinations. The most frequent feature combination is _ (521 tokens). Examples: två, tre, en, fem, sex, fyra, tio, 1, 2, 2000

Relations

NUM nodes are attached to their parents using 14 different relations: nummod (354; 66% instances), obl (55; 10% instances), conj (28; 5% instances), discourse (24; 4% instances), appos (15; 3% instances), nsubj (15; 3% instances), root (15; 3% instances), obj (10; 2% instances), nmod (6; 1% instances), xcomp (5; 1% instances), advcl (2; 0% instances), ccomp (2; 0% instances), nsubj:pass (2; 0% instances), parataxis (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (353; 66% instances), VERB (91; 17% instances), NUM (31; 6% instances), PROPN (25; 5% instances), (15; 3% instances), ADJ (7; 1% instances), PRON (5; 1% instances), ADV (3; 1% instances), SYM (2; 0% instances), AUX (1; 0% instances), X (1; 0% instances)

330 (62%) NUM nodes are leaves.

127 (24%) NUM nodes have one child.

42 (8%) NUM nodes have two children.

35 (7%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 17 different relations: nmod (85; 24% instances), punct (62; 18% instances), conj (46; 13% instances), case (42; 12% instances), advmod (40; 11% instances), cc (20; 6% instances), cop (16; 5% instances), nsubj (14; 4% instances), det (8; 2% instances), mark (4; 1% instances), acl:relcl (3; 1% instances), amod (3; 1% instances), nummod (3; 1% instances), parataxis (3; 1% instances), appos (2; 1% instances), obl (2; 1% instances), advcl (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: NOUN (90; 25% instances), PUNCT (62; 18% instances), ADP (41; 12% instances), ADV (41; 12% instances), NUM (31; 9% instances), CCONJ (20; 6% instances), AUX (16; 5% instances), PRON (14; 4% instances), ADJ (12; 3% instances), DET (9; 3% instances), VERB (6; 2% instances), SCONJ (5; 1% instances), PART (3; 1% instances), PROPN (3; 1% instances), X (1; 0% instances)