home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SST: POS Tags: NUM

There are 53 NUM lemmas (1%), 83 NUM types (1%) and 499 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 16 in number of tokens.

The 10 most frequent NUM lemmas: en, dva, trije, štirje, tisoč, dvajset, pet, petnajst, deset, sto

The 10 most frequent NUM types: eno, dva, en, ena, tri, tisoč, dvajset, dve, pet, enega

The 10 most frequent ambiguous lemmas: pet (NUM 17, ADJ 2)

The 10 most frequent ambiguous types: pet (NUM 14, X 1), ene (NUM 10, ADV 6), osemdesetih (ADJ 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.566038 (the average of all parts of speech is 1.570645).

The 1st highest number of forms (9) was observed with the lemma “en”: en, ena, ene, enega, enemu, eni, enih, enim, eno.

The 2nd highest number of forms (4) was observed with the lemma “trije”: treh, tremi, tri, trije.

The 3rd highest number of forms (4) was observed with the lemma “štirje”: štiri, štirih, štirim, štirje.

NUM occurs with 5 features: Case (499; 100% instances), NumForm (499; 100% instances), NumType (499; 100% instances), Number (499; 100% instances), Gender (270; 54% instances)

NUM occurs with 15 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Word, NumType=Card, NumType=Sets, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 39 feature combinations. The most frequent feature combination is Case=Acc|Number=Plur|NumForm=Word|NumType=Card (122 tokens). Examples: dvajset, sto, tisoč, petnajst, deset, pet, šestdeset, osemdeset, petdeset, petsto

Relations

NUM nodes are attached to their parents using 16 different relations: nummod (287; 58% instances), flat (51; 10% instances), conj (30; 6% instances), root (26; 5% instances), nsubj (25; 5% instances), obl (25; 5% instances), obj (20; 4% instances), parataxis (9; 2% instances), reparandum (8; 2% instances), nmod (5; 1% instances), fixed (4; 1% instances), advmod (2; 0% instances), appos (2; 0% instances), ccomp (2; 0% instances), dislocated (2; 0% instances), acl (1; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: NOUN (271; 54% instances), NUM (87; 17% instances), VERB (81; 16% instances), (26; 5% instances), ADJ (12; 2% instances), DET (7; 1% instances), PRON (5; 1% instances), PROPN (4; 1% instances), ADV (2; 0% instances), AUX (2; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)

346 (69%) NUM nodes are leaves.

86 (17%) NUM nodes have one child.

37 (7%) NUM nodes have two children.

30 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 8.

Children of NUM nodes are attached using 23 different relations: flat (54; 19% instances), advmod (41; 15% instances), conj (40; 14% instances), case (32; 12% instances), punct (12; 4% instances), cc (11; 4% instances), cop (11; 4% instances), discourse (11; 4% instances), reparandum (10; 4% instances), nsubj (9; 3% instances), discourse:filler (7; 3% instances), nmod (7; 3% instances), parataxis (7; 3% instances), acl (4; 1% instances), amod (4; 1% instances), det (4; 1% instances), fixed (4; 1% instances), mark (3; 1% instances), nummod (2; 1% instances), parataxis:discourse (2; 1% instances), aux (1; 0% instances), cc:preconj (1; 0% instances), orphan (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: NUM (87; 31% instances), ADP (37; 13% instances), DET (26; 9% instances), ADV (20; 7% instances), PART (20; 7% instances), NOUN (16; 6% instances), AUX (12; 4% instances), PUNCT (12; 4% instances), VERB (11; 4% instances), INTJ (10; 4% instances), ADJ (8; 3% instances), CCONJ (7; 3% instances), X (5; 2% instances), PROPN (3; 1% instances), SCONJ (3; 1% instances), PRON (1; 0% instances)