home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-LinES: POS Tags: NUM

There are 136 NUM lemmas (1%), 145 NUM types (1%) and 505 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: två, tre, ett, fem, sex, fyra, tio, 1, 2, 2000

The 10 most frequent NUM types: två, tre, en, fem, sex, fyra, tio, 1, 2, 2000

The 10 most frequent ambiguous lemmas: två (NUM 84, ADJ 1), ett (NUM 30, PRON 1), 1 (NUM 13, ADJ 1), en (DET 2522, PRON 71, NUM 11), 3 (NUM 5, ADJ 1), 4 (NUM 3, NOUN 1), 22 (NUM 2, ADJ 1), 14 (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: en (DET 1649, PRON 53, NUM 27), 1 (NUM 13, ADJ 2), ett (DET 739, PRON 11, NUM 7), 3 (NUM 5, ADJ 1), 12 (NUM 3, ADJ 2), 4 (NUM 3, ADJ 1), 30 (NUM 2, ADJ 1), U (NUM 2, PROPN 1), 14 (ADJ 1, NUM 1), 22 (ADJ 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.066176 (the average of all parts of speech is 1.414579).

The 1st highest number of forms (3) was observed with the lemma “25”: 25, tjugofem, tjugufem.

The 2nd highest number of forms (3) was observed with the lemma “60”: 60, sexti, sextio.

The 3rd highest number of forms (2) was observed with the lemma “1999.07.01”: 1999.07.01, _1999.07.01.

NUM occurs with 5 features: Number (9; 2% instances), Definite (8; 2% instances), Gender (8; 2% instances), Case (2; 0% instances), NumType (2; 0% instances)

NUM occurs with 6 feature-value pairs: Case=Nom, Definite=Ind, Gender=Com, Gender=Neut, NumType=Card, Number=Sing

NUM occurs with 7 feature combinations. The most frequent feature combination is _ (493 tokens). Examples: två, tre, en, fem, sex, fyra, tio, 1, 2, 2000

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (333; 66% instances), obl (58; 11% instances), conj (28; 6% instances), discourse (24; 5% instances), appos (15; 3% instances), obj (12; 2% instances), nsubj (11; 2% instances), root (11; 2% instances), xcomp (5; 1% instances), nmod (4; 1% instances), nsubj:pass (2; 0% instances), ccomp (1; 0% instances), parataxis (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (331; 66% instances), VERB (94; 19% instances), NUM (30; 6% instances), PROPN (24; 5% instances), (11; 2% instances), ADJ (7; 1% instances), PRON (3; 1% instances), ADV (2; 0% instances), SYM (2; 0% instances), X (1; 0% instances)

308 (61%) NUM nodes are leaves.

128 (25%) NUM nodes have one child.

40 (8%) NUM nodes have two children.

29 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 16 different relations: nmod (83; 26% instances), punct (58; 18% instances), conj (45; 14% instances), case (41; 13% instances), advmod (37; 11% instances), cc (19; 6% instances), cop (10; 3% instances), det (8; 2% instances), nsubj (8; 2% instances), nummod (3; 1% instances), parataxis (3; 1% instances), amod (2; 1% instances), appos (2; 1% instances), mark (2; 1% instances), obl (2; 1% instances), acl:relcl (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: NOUN (88; 27% instances), PUNCT (58; 18% instances), ADP (40; 12% instances), ADV (38; 12% instances), NUM (30; 9% instances), CCONJ (19; 6% instances), ADJ (11; 3% instances), AUX (10; 3% instances), PRON (9; 3% instances), DET (8; 2% instances), VERB (4; 1% instances), PROPN (3; 1% instances), SCONJ (3; 1% instances), PART (2; 1% instances), X (1; 0% instances)