Treebank Statistics: UD_Slovenian-SST: POS Tags: NUM
There are 53 NUM
lemmas (1%), 83 NUM
types (1%) and 499 NUM
tokens (2%).
Out of 16 observed tags, the rank of NUM
is: 8 in number of lemmas, 8 in number of types and 15 in number of tokens.
The 10 most frequent NUM
lemmas: en, dva, trije, štirje, tisoč, dvajset, pet, petnajst, deset, sto
The 10 most frequent NUM
types: eno, dva, en, ena, tri, tisoč, dvajset, dve, pet, enega
The 10 most frequent ambiguous lemmas: pet (NUM 17, ADJ 2)
The 10 most frequent ambiguous types: ene (NUM 10, ADV 6), osemdesetih (ADJ 2, NUM 1)
- ene
- osemdesetih
Morphology
The form / lemma ratio of NUM
is 1.566038 (the average of all parts of speech is 1.573353).
The 1st highest number of forms (9) was observed with the lemma “en”: en, ena, ene, enega, enemu, eni, enih, enim, eno.
The 2nd highest number of forms (4) was observed with the lemma “trije”: treh, tremi, tri, trije.
The 3rd highest number of forms (4) was observed with the lemma “štirje”: štiri, štirih, štirim, štirje.
NUM
occurs with 5 features: Case (499; 100% instances), NumForm (499; 100% instances), NumType (499; 100% instances), Number (499; 100% instances), Gender (270; 54% instances)
NUM
occurs with 15 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumForm=Word
, NumType=Card
, NumType=Sets
, Number=Dual
, Number=Plur
, Number=Sing
NUM
occurs with 39 feature combinations.
The most frequent feature combination is Case=Acc|Number=Plur|NumForm=Word|NumType=Card
(122 tokens).
Examples: dvajset, sto, tisoč, petnajst, deset, pet, šestdeset, osemdeset, petdeset, petsto
Relations
NUM
nodes are attached to their parents using 16 different relations: nummod (287; 58% instances), flat (51; 10% instances), conj (30; 6% instances), root (26; 5% instances), nsubj (25; 5% instances), obl (25; 5% instances), obj (20; 4% instances), parataxis (9; 2% instances), reparandum (8; 2% instances), nmod (5; 1% instances), fixed (4; 1% instances), advmod (2; 0% instances), appos (2; 0% instances), ccomp (2; 0% instances), dislocated (2; 0% instances), acl (1; 0% instances)
Parents of NUM
nodes belong to 12 different parts of speech: NOUN (271; 54% instances), NUM (87; 17% instances), VERB (81; 16% instances), (26; 5% instances), ADJ (12; 2% instances), DET (7; 1% instances), PRON (5; 1% instances), PROPN (4; 1% instances), ADV (2; 0% instances), AUX (2; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)
346 (69%) NUM
nodes are leaves.
86 (17%) NUM
nodes have one child.
37 (7%) NUM
nodes have two children.
30 (6%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 8.
Children of NUM
nodes are attached using 23 different relations: flat (54; 19% instances), advmod (41; 15% instances), conj (40; 14% instances), case (32; 12% instances), punct (12; 4% instances), cc (11; 4% instances), cop (11; 4% instances), discourse (11; 4% instances), reparandum (10; 4% instances), nsubj (9; 3% instances), discourse:filler (7; 3% instances), nmod (7; 3% instances), parataxis (7; 3% instances), acl (4; 1% instances), amod (4; 1% instances), det (4; 1% instances), fixed (4; 1% instances), mark (3; 1% instances), nummod (2; 1% instances), parataxis:discourse (2; 1% instances), aux (1; 0% instances), cc:preconj (1; 0% instances), orphan (1; 0% instances)
Children of NUM
nodes belong to 16 different parts of speech: NUM (87; 31% instances), ADP (37; 13% instances), DET (26; 9% instances), ADV (20; 7% instances), PART (20; 7% instances), NOUN (16; 6% instances), AUX (12; 4% instances), PUNCT (12; 4% instances), VERB (11; 4% instances), INTJ (10; 4% instances), ADJ (8; 3% instances), CCONJ (7; 3% instances), X (5; 2% instances), PROPN (3; 1% instances), SCONJ (3; 1% instances), PRON (1; 0% instances)