home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-PUD: POS Tags: NUM

There are 210 NUM lemmas (4%), 211 NUM types (3%) and 402 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: två, tre, 1, fyra, 3, sex, 10, tio, 000, 2

The 10 most frequent NUM types: två, tre, 1, fyra, sex, 10, tio, 000, 2014, 2015

The 10 most frequent ambiguous lemmas: en (DET 453, PRON 15, NUM 2), 45 (ADJ 1, NUM 1), ett (PRON 6, NUM 1)

The 10 most frequent ambiguous types: 3 (NUM 4, ADJ 1), I (ADP 42, NUM 4), en (DET 294, PRON 14, NUM 2), 4 (ADJ 1, NUM 1), ett (DET 129, PRON 4, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.004762 (the average of all parts of speech is 1.239021).

The 1st highest number of forms (2) was observed with the lemma “1”: 1, I.

The 2nd highest number of forms (2) was observed with the lemma “2”: 2, II.

The 3rd highest number of forms (2) was observed with the lemma “3”: 3, III.

NUM occurs with 5 features: Case (387; 96% instances), NumType (6; 1% instances), Gender (3; 1% instances), Number (3; 1% instances), Definite (2; 0% instances)

NUM occurs with 6 feature-value pairs: Case=Nom, Definite=Ind, Gender=Com, Gender=Neut, NumType=Card, Number=Sing

NUM occurs with 6 feature combinations. The most frequent feature combination is Case=Nom (385 tokens). Examples: två, tre, fyra, 1, sex, 10, tio, 000, 2014, 2015

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (275; 68% instances), obl (75; 19% instances), nmod (16; 4% instances), flat:name (7; 2% instances), nsubj (7; 2% instances), conj (6; 1% instances), advcl (4; 1% instances), appos (4; 1% instances), obj (2; 0% instances), orphan (2; 0% instances), parataxis (2; 0% instances), flat (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 8 different parts of speech: NOUN (272; 68% instances), VERB (81; 20% instances), PROPN (19; 5% instances), NUM (18; 4% instances), ADJ (6; 1% instances), PRON (3; 1% instances), ADV (2; 0% instances), DET (1; 0% instances)

294 (73%) NUM nodes are leaves.

63 (16%) NUM nodes have one child.

25 (6%) NUM nodes have two children.

20 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 15 different relations: advmod (46; 25% instances), case (41; 22% instances), nmod (30; 16% instances), punct (24; 13% instances), nummod (12; 7% instances), cc (7; 4% instances), conj (4; 2% instances), cop (4; 2% instances), mark (4; 2% instances), nsubj (4; 2% instances), det (3; 2% instances), acl:relcl (1; 1% instances), amod (1; 1% instances), appos (1; 1% instances), obl (1; 1% instances)

Children of NUM nodes belong to 14 different parts of speech: ADV (43; 23% instances), ADP (40; 22% instances), NOUN (31; 17% instances), PUNCT (24; 13% instances), NUM (18; 10% instances), CCONJ (7; 4% instances), SCONJ (5; 3% instances), ADJ (4; 2% instances), AUX (4; 2% instances), DET (3; 2% instances), PART (1; 1% instances), PROPN (1; 1% instances), SYM (1; 1% instances), VERB (1; 1% instances)