Treebank Statistics: UD_Slovenian-SST: POS Tags: NUM
There are 77 NUM
lemmas (1%), 123 NUM
types (1%) and 1187 NUM
tokens (2%).
Out of 15 observed tags, the rank of NUM
is: 8 in number of lemmas, 8 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: en, dva, trije, pet, tisoč, dvajset, štirje, deset, trideset, sedem
The 10 most frequent NUM
types: en, dva, eno, ena, tri, tisoč, pet, dve, dvajset, enega
The 10 most frequent ambiguous lemmas: pet (NUM 53, ADJ 2), dvajseti (ADJ 4, NUM 1)
The 10 most frequent ambiguous types: ene (NUM 22, ADV 13), sto (NUM 21, X 1), tridesetih (NUM 3, ADJ 1), een (INTJ 2, NUM 1), osemdesetih (ADJ 2, NUM 1)
- ene
- sto
- NUM 21: on ima približno sto ljudi v veleposlaništvih po Evropi
- X 1: aja zdaj sem jaz na kratko že opisal zgodbo tega filma e pravzaprav v vsaki v vsakem filmu je ena zgodba mogoče opišem zgodbo e Kraljeva vrnitev kjer na prestol že več sto leti ni sedel pravi kralj e na prestolu vladajo sicer ljudje kot nekaj namestniki kralja kralj ki je po krvni liniji kralj pa se boji tega e svojega nasledstva ker je njegov prednik Ilzidur e bil e pravzaprav podvržen e slabosti e gospodarja prstana se pravi tega prstana
- tridesetih
- een
- osemdesetih
Morphology
The form / lemma ratio of NUM
is 1.597403 (the average of all parts of speech is 1.748943).
The 1st highest number of forms (11) was observed with the lemma “en”: een, en, ena, ene, enega, enem, enemu, eni, enih, enim, eno.
The 2nd highest number of forms (4) was observed with the lemma “dva”: dva, dve, dveh, dvema.
The 3rd highest number of forms (4) was observed with the lemma “trije”: treh, tremi, tri, trije.
NUM
occurs with 5 features: Case (1187; 100% instances), NumType (1187; 100% instances), Number (1187; 100% instances), NumForm (1186; 100% instances), Gender (635; 53% instances)
NUM
occurs with 16 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumForm=Word
, NumType=Card
, NumType=Ord
, NumType=Sets
, Number=Dual
, Number=Plur
, Number=Sing
NUM
occurs with 54 feature combinations.
The most frequent feature combination is Case=Acc|Number=Plur|NumForm=Word|NumType=Card
(317 tokens).
Examples: pet, dvajset, tisoč, trideset, deset, petnajst, sto, petdeset, sedem, tristo
Relations
NUM
nodes are attached to their parents using 20 different relations: nummod (713; 60% instances), flat (144; 12% instances), obl (73; 6% instances), conj (64; 5% instances), root (48; 4% instances), nsubj (47; 4% instances), obj (29; 2% instances), parataxis (19; 2% instances), reparandum (14; 1% instances), appos (8; 1% instances), nmod (7; 1% instances), fixed (4; 0% instances), orphan (4; 0% instances), amod (3; 0% instances), ccomp (3; 0% instances), acl (2; 0% instances), advmod (2; 0% instances), discourse:filler (1; 0% instances), dislocated (1; 0% instances), xcomp (1; 0% instances)
Parents of NUM
nodes belong to 12 different parts of speech: NOUN (710; 60% instances), NUM (197; 17% instances), VERB (157; 13% instances), (48; 4% instances), ADJ (33; 3% instances), PROPN (13; 1% instances), DET (10; 1% instances), PRON (8; 1% instances), X (6; 1% instances), ADV (3; 0% instances), AUX (1; 0% instances), PART (1; 0% instances)
833 (70%) NUM
nodes are leaves.
206 (17%) NUM
nodes have one child.
93 (8%) NUM
nodes have two children.
55 (5%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 7.
Children of NUM
nodes are attached using 25 different relations: flat (142; 23% instances), advmod (122; 20% instances), conj (72; 12% instances), case (63; 10% instances), cc (26; 4% instances), nmod (23; 4% instances), discourse (21; 3% instances), cop (20; 3% instances), nsubj (15; 2% instances), parataxis (15; 2% instances), reparandum (14; 2% instances), amod (12; 2% instances), det (11; 2% instances), orphan (11; 2% instances), discourse:filler (9; 1% instances), acl (7; 1% instances), mark (6; 1% instances), fixed (5; 1% instances), aux (3; 0% instances), nummod (3; 0% instances), appos (2; 0% instances), parataxis:discourse (2; 0% instances), advcl (1; 0% instances), obl (1; 0% instances), parataxis:restart (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: NUM (197; 32% instances), ADP (70; 12% instances), PART (69; 11% instances), ADV (56; 9% instances), DET (50; 8% instances), NOUN (38; 6% instances), ADJ (24; 4% instances), AUX (23; 4% instances), VERB (21; 3% instances), CCONJ (20; 3% instances), INTJ (13; 2% instances), X (12; 2% instances), SCONJ (6; 1% instances), PRON (4; 1% instances), PROPN (4; 1% instances)