Treebank Statistics: UD_Slovenian-SST: POS Tags: NUM
There are 78 NUM
lemmas (1%), 123 NUM
types (1%) and 1048 NUM
tokens (1%).
Out of 16 observed tags, the rank of NUM
is: 8 in number of lemmas, 8 in number of types and 15 in number of tokens.
The 10 most frequent NUM
lemmas: en, dva, trije, pet, tisoč, dvajset, štirje, deset, trideset, sedem
The 10 most frequent NUM
types: dva, ena, en, tri, tisoč, pet, eno, dve, dvajset, trideset
The 10 most frequent ambiguous lemmas: en (NUM 225, DET 140), pet (NUM 53, ADJ 2), drug (ADJ 191, NUM 1), dvajseti (ADJ 4, NUM 1)
The 10 most frequent ambiguous types: ena (NUM 61, DET 22), en (NUM 54, DET 44), eno (NUM 45, DET 40), eni (NUM 21, DET 5), sto (NUM 21, X 1), enega (NUM 18, DET 9), ene (ADV 13, NUM 13, DET 9), enem (NUM 6, DET 3), enim (NUM 4, DET 2), tridesetih (NUM 3, ADJ 1)
- ena
- en
- eno
- eni
- sto
- NUM 21: on ima približno sto ljudi v veleposlaništvih po Evropi .
- X 1: aja , zdaj sem jaz na kratko že opisal zgodbo tega filma , e , pravzaprav v vsaki , v vsakem filmu je ena zgodba , mogoče opišem zgodbo , e , Kraljeva vrnitev , kjer na prestol že več sto leti ni sedel pravi kralj , e , na prestolu vladajo sicer ljudje kot neki namestniki kralja , kralj , ki je po krvni liniji kralj , pa se boji tega , e , svojega nasledstva , ker je njegov prednik Ilzidur , e , bil , e , pravzaprav podvržen , e , slabosti , e , gospodarja prstana , se pravi tega prstana .
- enega
- ene
- enem
- NUM 6: ja , ker jaz se spomnim , da sem pa šel na enem izmed teh pohodov na Kamniško sedlo iz Kamniške Bistrice gor .
- DET 3: no , skozi zgodbo spremljamo dekle po imenu Mima , ki je nastopala kot pevka in plesalka v enem triu , kjer je bilo zelo važno vzdrževati neko tako javno podobo neke take popolne , nedolžne deklice .
- enim
- NUM 4: in smo pred enim mescem dobili pa psičko vipet- , eem , vipetko , mmm , ki ji je srednji otrok dal ime Luna .
- DET 2: s temi tisoč petsto markami sem se peljal z enim kolegom , [name:personal] [name:surname] , iz Maribora v Nemčijo , on je kupil dejansko kamion , jaz sem pa iskal enega Opel , Opel karavan , brez šip .
- tridesetih
- NUM 3: e , v tridesetih letih od sprejetja Konvencije o otrokovih pravicah je le-ta , e , pomembno vplivala in pomagala položaju otrok po svetu .
- ADJ 1: je pa tudi posebna moda , pač glih ta moda iz dvajsetih , tridesetih let prejšnjega stoletja , pa te polkadot pač tele pike po oblekah ali pa resice .
Morphology
The form / lemma ratio of NUM
is 1.576923 (the average of all parts of speech is 1.751794).
The 1st highest number of forms (10) was observed with the lemma “en”: een, en, ena, ene, enega, enem, eni, enih, enim, eno.
The 2nd highest number of forms (4) was observed with the lemma “dva”: dva, dve, dveh, dvema.
The 3rd highest number of forms (4) was observed with the lemma “trije”: treh, tremi, tri, trije.
NUM
occurs with 5 features: Case (1048; 100% instances), NumType (1048; 100% instances), Number (1048; 100% instances), NumForm (1047; 100% instances), Gender (496; 47% instances)
NUM
occurs with 16 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumForm=Word
, NumType=Card
, NumType=Ord
, NumType=Sets
, Number=Dual
, Number=Plur
, Number=Sing
NUM
occurs with 53 feature combinations.
The most frequent feature combination is Case=Acc|Number=Plur|NumForm=Word|NumType=Card
(317 tokens).
Examples: pet, dvajset, tisoč, trideset, deset, petnajst, sto, petdeset, sedem, tristo
Relations
NUM
nodes are attached to their parents using 19 different relations: nummod (572; 55% instances), flat (144; 14% instances), obl (72; 7% instances), conj (65; 6% instances), nsubj (48; 5% instances), root (48; 5% instances), obj (28; 3% instances), parataxis (22; 2% instances), reparandum (14; 1% instances), nmod (7; 1% instances), appos (6; 1% instances), orphan (5; 0% instances), amod (4; 0% instances), fixed (4; 0% instances), ccomp (3; 0% instances), acl (2; 0% instances), advmod (2; 0% instances), discourse:filler (1; 0% instances), xcomp (1; 0% instances)
Parents of NUM
nodes belong to 12 different parts of speech: NOUN (585; 56% instances), NUM (195; 19% instances), VERB (157; 15% instances), (48; 5% instances), ADJ (27; 3% instances), DET (10; 1% instances), PRON (8; 1% instances), PROPN (8; 1% instances), X (6; 1% instances), ADV (2; 0% instances), AUX (1; 0% instances), PART (1; 0% instances)
630 (60%) NUM
nodes are leaves.
241 (23%) NUM
nodes have one child.
105 (10%) NUM
nodes have two children.
72 (7%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 8.
Children of NUM
nodes are attached using 27 different relations: flat (142; 19% instances), punct (142; 19% instances), advmod (123; 16% instances), conj (73; 10% instances), case (63; 8% instances), cc (26; 3% instances), discourse (22; 3% instances), nmod (22; 3% instances), cop (20; 3% instances), det (16; 2% instances), parataxis (16; 2% instances), nsubj (15; 2% instances), reparandum (13; 2% instances), amod (12; 2% instances), orphan (9; 1% instances), discourse:filler (8; 1% instances), acl (7; 1% instances), mark (6; 1% instances), fixed (5; 1% instances), aux (3; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), cc:preconj (1; 0% instances), nummod (1; 0% instances), obl (1; 0% instances), parataxis:discourse (1; 0% instances), parataxis:restart (1; 0% instances)
Children of NUM
nodes belong to 16 different parts of speech: NUM (195; 26% instances), PUNCT (142; 19% instances), PART (75; 10% instances), ADP (70; 9% instances), ADV (53; 7% instances), DET (52; 7% instances), NOUN (37; 5% instances), ADJ (24; 3% instances), AUX (23; 3% instances), VERB (21; 3% instances), CCONJ (20; 3% instances), X (13; 2% instances), INTJ (12; 2% instances), SCONJ (6; 1% instances), PRON (4; 1% instances), PROPN (3; 0% instances)