Treebank Statistics: UD_Swedish-SweLL: POS Tags: NUM
There are 29 NUM lemmas (2%), 31 NUM types (1%) and 55 NUM tokens (1%).
Out of 16 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 14 in number of tokens.
The 10 most frequent NUM lemmas: två, 18, en, 1, tre, fyra, 2, 25, 4, 50
The 10 most frequent NUM types: två, 18, 1, tre, ett, fyra, 2, 25, 4, 50
The 10 most frequent ambiguous lemmas: en (DET 206, NUM 5, PRON 4)
The 10 most frequent ambiguous types: ett (DET 63, NUM 3, PRON 2), en (DET 126, NUM 2, PRON 2)
- ett
- en
Morphology
The form / lemma ratio of NUM is 1.068966 (the average of all parts of speech is 1.401542).
The 1st highest number of forms (2) was observed with the lemma “en”: en, ett.
The 2nd highest number of forms (2) was observed with the lemma “två”: tva, två.
The 3rd highest number of forms (1) was observed with the lemma “1”: 1.
NUM occurs with 6 features: NumType (54; 98% instances), Case (53; 96% instances), Definite (5; 9% instances), Gender (5; 9% instances), Number (5; 9% instances), Typo (4; 7% instances)
NUM occurs with 7 feature-value pairs: Case=Nom, Definite=Ind, Gender=Com, Gender=Neut, NumType=Card, Number=Sing, Typo=Yes
NUM occurs with 5 feature combinations.
The most frequent feature combination is Case=Nom|NumType=Card (45 tokens).
Examples: två, 18, 1, tre, fyra, 2, 25, 4, 1-12, 10
Relations
NUM nodes are attached to their parents using 7 different relations: nummod (43; 78% instances), discourse (3; 5% instances), obl (3; 5% instances), nmod (2; 4% instances), root (2; 4% instances), conj (1; 2% instances), dep (1; 2% instances)
Parents of NUM nodes belong to 5 different parts of speech: NOUN (46; 84% instances), VERB (5; 9% instances), (2; 4% instances), NUM (1; 2% instances), PROPN (1; 2% instances)
38 (69%) NUM nodes are leaves.
15 (27%) NUM nodes have one child.
0 (0%) NUM nodes have two children.
2 (4%) NUM nodes have three or more children.
The highest child degree of a NUM node is 6.
Children of NUM nodes are attached using 11 different relations: advmod (9; 33% instances), punct (4; 15% instances), case (2; 7% instances), conj (2; 7% instances), cop (2; 7% instances), nmod (2; 7% instances), obl (2; 7% instances), advcl (1; 4% instances), cc (1; 4% instances), expl (1; 4% instances), nsubj (1; 4% instances)
Children of NUM nodes belong to 9 different parts of speech: ADV (9; 33% instances), NOUN (5; 19% instances), PUNCT (4; 15% instances), ADP (2; 7% instances), AUX (2; 7% instances), VERB (2; 7% instances), CCONJ (1; 4% instances), NUM (1; 4% instances), PRON (1; 4% instances)