Statistics of NUM in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Slovenian-SST: POS Tags: `NUM`

There are 77 NUM lemmas (1%), 123 NUM types (1%) and 1187 NUM tokens (2%). Out of 15 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: en, dva, trije, pet, tisoč, dvajset, štirje, deset, trideset, sedem

The 10 most frequent NUM types: en, dva, eno, ena, tri, tisoč, pet, dve, dvajset, enega

The 10 most frequent ambiguous lemmas: pet (NUM 53, ADJ 2), dvajseti (ADJ 4, NUM 1)

The 10 most frequent ambiguous types: ene (NUM 22, ADV 13), sto (NUM 21, X 1), tridesetih (NUM 3, ADJ 1), een (INTJ 2, NUM 1), osemdesetih (ADJ 2, NUM 1)

ene
- NUM 22: ja teh je bilo približno ko se je ura premaknila tisto ko ene ure ni bilo
- ADV 13: je ene štir- ene š- štirideset
sto
- NUM 21: on ima približno sto ljudi v veleposlaništvih po Evropi
- X 1: aja zdaj sem jaz na kratko že opisal zgodbo tega filma e pravzaprav v vsaki v vsakem filmu je ena zgodba mogoče opišem zgodbo e Kraljeva vrnitev kjer na prestol že več sto leti ni sedel pravi kralj e na prestolu vladajo sicer ljudje kot nekaj namestniki kralja kralj ki je po krvni liniji kralj pa se boji tega e svojega nasledstva ker je njegov prednik Ilzidur e bil e pravzaprav podvržen e slabosti e gospodarja prstana se pravi tega prstana
tridesetih
- NUM 3: e v tridesetih letih od sprejetja Konvencije o otrokovih pravicah je le-ta e pomembno vplivala in pomagala položaju otrok po svetu
- ADJ 1: je pa tudi posebna moda pač glih ta moda iz dvajsetih tridesetih let prejšnjega stoletja pa te polkadot pač tele pike po oblekah ali pa resice
een
- INTJ 2: eem een ker je glagol v osebni obliki ne
- NUM 1: ga zaboli ti- een tisti
osemdesetih
- ADJ 2: tole so Mister Mister Broken wings bili so to časi osemdesetih let ko se je pač tale pesem držala pa še vedno se
- NUM 1: v nadaljevanju pesem iz osemdesetih Belinda Karlyle Heaven is a place on earth pa Star čebelji pregovor tudi sledi

Morphology

The form / lemma ratio of NUM is 1.597403 (the average of all parts of speech is 1.748943).

The 1st highest number of forms (11) was observed with the lemma “en”: een, en, ena, ene, enega, enem, enemu, eni, enih, enim, eno.

The 2nd highest number of forms (4) was observed with the lemma “dva”: dva, dve, dveh, dvema.

The 3rd highest number of forms (4) was observed with the lemma “trije”: treh, tremi, tri, trije.

NUM occurs with 5 features: Case (1187; 100% instances), NumType (1187; 100% instances), Number (1187; 100% instances), NumForm (1186; 100% instances), Gender (635; 53% instances)

NUM occurs with 16 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Word, NumType=Card, NumType=Ord, NumType=Sets, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 54 feature combinations. The most frequent feature combination is Case=Acc|Number=Plur|NumForm=Word|NumType=Card (317 tokens). Examples: pet, dvajset, tisoč, trideset, deset, petnajst, sto, petdeset, sedem, tristo

Relations

NUM nodes are attached to their parents using 20 different relations: nummod (713; 60% instances), flat (144; 12% instances), obl (73; 6% instances), conj (64; 5% instances), root (48; 4% instances), nsubj (47; 4% instances), obj (29; 2% instances), parataxis (19; 2% instances), reparandum (14; 1% instances), appos (8; 1% instances), nmod (7; 1% instances), fixed (4; 0% instances), orphan (4; 0% instances), amod (3; 0% instances), ccomp (3; 0% instances), acl (2; 0% instances), advmod (2; 0% instances), discourse:filler (1; 0% instances), dislocated (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: NOUN (710; 60% instances), NUM (197; 17% instances), VERB (157; 13% instances), (48; 4% instances), ADJ (33; 3% instances), PROPN (13; 1% instances), DET (10; 1% instances), PRON (8; 1% instances), X (6; 1% instances), ADV (3; 0% instances), AUX (1; 0% instances), PART (1; 0% instances)

833 (70%) NUM nodes are leaves.

206 (17%) NUM nodes have one child.

93 (8%) NUM nodes have two children.

55 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 25 different relations: flat (142; 23% instances), advmod (122; 20% instances), conj (72; 12% instances), case (63; 10% instances), cc (26; 4% instances), nmod (23; 4% instances), discourse (21; 3% instances), cop (20; 3% instances), nsubj (15; 2% instances), parataxis (15; 2% instances), reparandum (14; 2% instances), amod (12; 2% instances), det (11; 2% instances), orphan (11; 2% instances), discourse:filler (9; 1% instances), acl (7; 1% instances), mark (6; 1% instances), fixed (5; 1% instances), aux (3; 0% instances), nummod (3; 0% instances), appos (2; 0% instances), parataxis:discourse (2; 0% instances), advcl (1; 0% instances), obl (1; 0% instances), parataxis:restart (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: NUM (197; 32% instances), ADP (70; 12% instances), PART (69; 11% instances), ADV (56; 9% instances), DET (50; 8% instances), NOUN (38; 6% instances), ADJ (24; 4% instances), AUX (23; 4% instances), VERB (21; 3% instances), CCONJ (20; 3% instances), INTJ (13; 2% instances), X (12; 2% instances), SCONJ (6; 1% instances), PRON (4; 1% instances), PROPN (4; 1% instances)

Treebank Statistics: UD_Slovenian-SST: POS Tags: NUM

Morphology

Relations

Treebank Statistics: UD_Slovenian-SST: POS Tags: `NUM`