home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-LinES: POS Tags: NUM

There are 136 NUM lemmas (1%), 145 NUM types (1%) and 496 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: två, tre, ett, fem, sex, fyra, tio, 1, 2, 2000

The 10 most frequent NUM types: två, tre, en, fem, sex, fyra, tio, 1, 2, 2000

The 10 most frequent ambiguous lemmas: två (NUM 84, ADJ 44), tre (NUM 38, ADJ 11), ett (NUM 30, PRON 1), fem (NUM 21, ADJ 1), sex (NUM 20, ADJ 2), fyra (NUM 14, ADJ 6), 1 (NUM 13, ADJ 2), 3 (NUM 5, ADJ 1), elva (NUM 5, ADJ 2), 60 (NUM 4, NOUN 1)

The 10 most frequent ambiguous types: en (DET 1652, PRON 53, NUM 23), 1 (NUM 13, ADJ 2), 3 (NUM 5, ADJ 1), ett (DET 743, PRON 11, NUM 3), 12 (NUM 3, ADJ 2), 4 (NUM 3, ADJ 1), 30 (NUM 2, ADJ 1), U (NUM 2, PROPN 1), sexti (NUM 2, NOUN 1), 14 (ADJ 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.066176 (the average of all parts of speech is 1.416904).

The 1st highest number of forms (3) was observed with the lemma “25”: 25, tjugofem, tjugufem.

The 2nd highest number of forms (3) was observed with the lemma “60”: 60, sexti, sextio.

The 3rd highest number of forms (2) was observed with the lemma “1999.07.01”: 1999.07.01, _1999.07.01.

NUM occurs with 4 features: Case (2; 0% instances), NumType (2; 0% instances), Gender (1; 0% instances), Number (1; 0% instances)

NUM occurs with 4 feature-value pairs: Case=Nom, Gender=Neut, NumType=Card, Number=Sing

NUM occurs with 4 feature combinations. The most frequent feature combination is _ (492 tokens). Examples: två, tre, en, fem, sex, fyra, tio, 1, 2, 2000

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (325; 66% instances), obl (58; 12% instances), conj (28; 6% instances), discourse (24; 5% instances), appos (15; 3% instances), obj (12; 2% instances), nsubj (11; 2% instances), root (11; 2% instances), xcomp (5; 1% instances), nmod (3; 1% instances), nsubj:pass (2; 0% instances), ccomp (1; 0% instances), parataxis (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (322; 65% instances), VERB (94; 19% instances), NUM (30; 6% instances), PROPN (24; 5% instances), (11; 2% instances), ADJ (7; 1% instances), PRON (3; 1% instances), ADV (2; 0% instances), SYM (2; 0% instances), X (1; 0% instances)

307 (62%) NUM nodes are leaves.

120 (24%) NUM nodes have one child.

40 (8%) NUM nodes have two children.

29 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 16 different relations: nmod (82; 26% instances), punct (58; 18% instances), case (41; 13% instances), conj (39; 12% instances), advmod (36; 11% instances), cc (18; 6% instances), cop (10; 3% instances), det (8; 3% instances), nsubj (8; 3% instances), nummod (3; 1% instances), parataxis (3; 1% instances), amod (2; 1% instances), appos (2; 1% instances), mark (2; 1% instances), acl:relcl (1; 0% instances), obl (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: NOUN (86; 27% instances), PUNCT (58; 18% instances), ADP (40; 13% instances), ADV (37; 12% instances), NUM (30; 10% instances), CCONJ (18; 6% instances), AUX (10; 3% instances), DET (8; 3% instances), PRON (8; 3% instances), ADJ (5; 2% instances), VERB (5; 2% instances), PROPN (3; 1% instances), SCONJ (3; 1% instances), PART (2; 1% instances), X (1; 0% instances)