home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SST: POS Tags: NUM

There are 77 NUM lemmas (1%), 123 NUM types (1%) and 1187 NUM tokens (2%). Out of 15 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: en, dva, trije, pet, tisoč, dvajset, štirje, deset, trideset, sedem

The 10 most frequent NUM types: en, dva, eno, ena, tri, tisoč, pet, dve, dvajset, enega

The 10 most frequent ambiguous lemmas: pet (NUM 53, ADJ 2), dvajseti (ADJ 4, NUM 1)

The 10 most frequent ambiguous types: ene (NUM 22, ADV 13), sto (NUM 21, X 1), tridesetih (NUM 3, ADJ 1), een (INTJ 2, NUM 1), osemdesetih (ADJ 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.597403 (the average of all parts of speech is 1.748943).

The 1st highest number of forms (11) was observed with the lemma “en”: een, en, ena, ene, enega, enem, enemu, eni, enih, enim, eno.

The 2nd highest number of forms (4) was observed with the lemma “dva”: dva, dve, dveh, dvema.

The 3rd highest number of forms (4) was observed with the lemma “trije”: treh, tremi, tri, trije.

NUM occurs with 5 features: Case (1187; 100% instances), NumType (1187; 100% instances), Number (1187; 100% instances), NumForm (1186; 100% instances), Gender (635; 53% instances)

NUM occurs with 16 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Word, NumType=Card, NumType=Ord, NumType=Sets, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 54 feature combinations. The most frequent feature combination is Case=Acc|Number=Plur|NumForm=Word|NumType=Card (317 tokens). Examples: pet, dvajset, tisoč, trideset, deset, petnajst, sto, petdeset, sedem, tristo

Relations

NUM nodes are attached to their parents using 20 different relations: nummod (713; 60% instances), flat (144; 12% instances), obl (73; 6% instances), conj (64; 5% instances), root (48; 4% instances), nsubj (47; 4% instances), obj (29; 2% instances), parataxis (19; 2% instances), reparandum (14; 1% instances), appos (8; 1% instances), nmod (7; 1% instances), fixed (4; 0% instances), orphan (4; 0% instances), amod (3; 0% instances), ccomp (3; 0% instances), acl (2; 0% instances), advmod (2; 0% instances), discourse:filler (1; 0% instances), dislocated (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: NOUN (710; 60% instances), NUM (197; 17% instances), VERB (157; 13% instances), (48; 4% instances), ADJ (33; 3% instances), PROPN (13; 1% instances), DET (10; 1% instances), PRON (8; 1% instances), X (6; 1% instances), ADV (3; 0% instances), AUX (1; 0% instances), PART (1; 0% instances)

833 (70%) NUM nodes are leaves.

206 (17%) NUM nodes have one child.

93 (8%) NUM nodes have two children.

55 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 25 different relations: flat (142; 23% instances), advmod (122; 20% instances), conj (72; 12% instances), case (63; 10% instances), cc (26; 4% instances), nmod (23; 4% instances), discourse (21; 3% instances), cop (20; 3% instances), nsubj (15; 2% instances), parataxis (15; 2% instances), reparandum (14; 2% instances), amod (12; 2% instances), det (11; 2% instances), orphan (11; 2% instances), discourse:filler (9; 1% instances), acl (7; 1% instances), mark (6; 1% instances), fixed (5; 1% instances), aux (3; 0% instances), nummod (3; 0% instances), appos (2; 0% instances), parataxis:discourse (2; 0% instances), advcl (1; 0% instances), obl (1; 0% instances), parataxis:restart (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: NUM (197; 32% instances), ADP (70; 12% instances), PART (69; 11% instances), ADV (56; 9% instances), DET (50; 8% instances), NOUN (38; 6% instances), ADJ (24; 4% instances), AUX (23; 4% instances), VERB (21; 3% instances), CCONJ (20; 3% instances), INTJ (13; 2% instances), X (12; 2% instances), SCONJ (6; 1% instances), PRON (4; 1% instances), PROPN (4; 1% instances)