Statistics of X in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Slovenian-SST: POS Tags: `X`

There are 429 X lemmas (5%), 441 X types (3%) and 906 X tokens (1%). Out of 15 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: s-, z-, n-, j-, k-, p-, m-, t-, b-, po-

The 10 most frequent X types: s-, z-, n-, j-, k-, p-, m-, t-, b-, po-

The 10 most frequent ambiguous lemmas: on (PRON 753, X 5), ti (PRON 366, X 4, INTJ 1), ka (SCONJ 40, X 2, ADV 1), para (X 2, NOUN 1), što (PRON 1, X 1), a (PART 143, ADV 137, INTJ 21, CCONJ 9, NOUN 8, X 1), d (NOUN 1, X 1), da (SCONJ 1527, PART 18, X 1), i (NOUN 1, PART 1, X 1), in (CCONJ 1473, ADV 1, X 1)

The 10 most frequent ambiguous types: on (PRON 49, X 5), ti (PRON 182, DET 43, X 4, INTJ 1), ka (SCONJ 40, X 3, ADV 1), je (AUX 1895, VERB 654, PRON 11, INTJ 4, X 2), para (X 2, NOUN 1), što (PRON 1, X 1), a (PART 143, ADV 137, INTJ 21, CCONJ 9, NOUN 8, X 1), bi (AUX 399, VERB 29, X 1), d (NOUN 1, X 1), da (SCONJ 1526, VERB 42, PART 18, X 1)

on
- PRON 49: ne ker on tukaj ni uporabljal gradbeni material on si tega ni
- X 5: ja pa Rock on net
ti
- PRON 182: ti si napisal te preobleke kdo pa še stoji na odru znana imena
- DET 43: eee kaj več v bistvu tukaj res eee ti starši ne morejo narediti
- X 4: in se smejem kot matasta tipo ki že costi ti ga bi bujeri ser
- INTJ 1: pa če noče zaspati ko je še mala pa ji špilaš pred posteljico veš tako delaš ti di di pa še igraš
ka
- SCONJ 40: ka so stopnjo više kakor ti
- X 3: ta človek se tako pisal ka to fi
- ADV 1: ka to je zato ker morajo te avtorske pravice redno mislim pa- pač plačevati pa to
je
- AUX 1895: v principu zelo mi je bila všeč nadaljevanka Igra prestolov
- VERB 654: e kateri izletniški kraj vam je všeč in zakaj
- PRON 11: seveda je niste
- INTJ 4: je ene štir- ene š- štirideset
- X 2: glavna znamenitost je Arboretuma Volčji Potok so prav gotovo otoki cvetic ki jih lahko najdemo vsepovsod po parku
para
- X 2: ki je bila tam nekaj pri [name:personal] ali nekaj takega ma para da je ona
- NOUN 1: aha okej eee oboje oboje bi vzel po dva para
što
- PRON 1: pa to za vajudva tudi pa što tako koga po- eee me pozna pa to
- X 1: in jaz odhajam samo mimo ne gledam što je ampak sem samo šla naprej zaenkrat skrči Olga Joža Ha ja od kod pa ti
a
- PART 143: em Zdaj mene zanima a ste vsaj vi ti štirje pravniki to zahtevo prebrali
- ADV 137: in to je tisto kar otrok tudi rabi a ne
- INTJ 21: a
- CCONJ 9: a oba se izražava knjižno
- NOUN 8: da je to mislim a
- X 1: v nadaljevanju pesem iz osemdesetih Belinda Karlyle Heaven is a place on earth pa Star čebelji pregovor tudi sledi
bi
- AUX 399: e z e omenil sem več različnih eem eee bi rekel celo področje
- VERB 29: ker eee je pol kontra efekt ne namesto da bi ne pa ne če razumete
- X 1: in se smejem kot matasta tipo ki že costi ti ga bi bujeri ser
d
- NOUN 1: brez drugega dramaturga tako da pozorni še posebej bodite na [name:personal] ker bo drugi del v d dramaturga v smislu da te stvari kar bomo eee poskušali v likih ustvarjati bomo t- skozi govor in bo [name:personal] mogla malo večkrat posr- pomagati na tej poti da to dejansko dobimo
- X 1: tako da bolj novic ne zasle- nisem nekaj zasledil razen da bomo mogli se mislim d do naslednjega meseca tam do dvajsetega ali sedemindvajsetega eem bo še zastonj eee testiranje potem bo plačljivo se mi zdi štirinajst evrov na oseminštirideset ur kar zame kot ke sem zaposlen v hotelu mi glih ni do tega da bi vsakih oseminštirideset ur plačeval štirinajst evrov samo zato da lahko delam
da
- SCONJ 1526: Zdi se mi da se vsi Slovenci najdemo v tej zgodbi
- VERB 42: v bistvu se vegani temu izogibajo kolikor se le da recimo
- PART 18: da ali ne
- X 1: genau wo ist den das ach so da

Morphology

The form / lemma ratio of X is 1.027972 (the average of all parts of speech is 1.748943).

The 1st highest number of forms (14) was observed with the lemma “_”: ci, dej, di, je, ka, ki, kompliciraš, leti, pa, smo, ste, sto, vedno, če.

The 2nd highest number of forms (1) was observed with the lemma “Abicanti”: Abicanti.

The 3rd highest number of forms (1) was observed with the lemma “B-”: B-.

X occurs with 3 features: Foreign (161; 18% instances), Typo (5; 1% instances), Abbr (3; 0% instances)

X occurs with 3 feature-value pairs: Abbr=Yes, Foreign=Yes, Typo=Yes

X occurs with 4 feature combinations. The most frequent feature combination is _ (737 tokens). Examples: s-, n-, z-, j-, k-, p-, m-, t-, b-, po-

Relations

X nodes are attached to their parents using 26 different relations: reparandum (532; 59% instances), orphan (135; 15% instances), flat:foreign (82; 9% instances), root (29; 3% instances), nmod (18; 2% instances), obj (13; 1% instances), conj (12; 1% instances), goeswith (11; 1% instances), flat (9; 1% instances), fixed (7; 1% instances), parataxis (7; 1% instances), advmod (6; 1% instances), amod (6; 1% instances), dep (6; 1% instances), appos (5; 1% instances), obl (5; 1% instances), nsubj (4; 0% instances), case (3; 0% instances), mark (3; 0% instances), parataxis:restart (3; 0% instances), advcl (2; 0% instances), cc (2; 0% instances), discourse (2; 0% instances), xcomp (2; 0% instances), ccomp (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 16 different parts of speech: VERB (223; 25% instances), NOUN (180; 20% instances), X (106; 12% instances), ADJ (71; 8% instances), ADV (56; 6% instances), DET (50; 6% instances), PROPN (36; 4% instances), PRON (35; 4% instances), PART (33; 4% instances), (29; 3% instances), AUX (24; 3% instances), CCONJ (18; 2% instances), SCONJ (18; 2% instances), ADP (13; 1% instances), NUM (12; 1% instances), INTJ (2; 0% instances)

749 (83%) X nodes are leaves.

84 (9%) X nodes have one child.

28 (3%) X nodes have two children.

45 (5%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 29 different relations: flat:foreign (87; 26% instances), case (39; 12% instances), advmod (29; 9% instances), cc (16; 5% instances), det (12; 4% instances), discourse (12; 4% instances), flat (12; 4% instances), nsubj (12; 4% instances), parataxis (11; 3% instances), mark (10; 3% instances), reparandum (10; 3% instances), cop (9; 3% instances), obj (9; 3% instances), orphan (8; 2% instances), aux (7; 2% instances), discourse:filler (7; 2% instances), fixed (7; 2% instances), acl (5; 2% instances), conj (5; 2% instances), goeswith (5; 2% instances), nmod (4; 1% instances), parataxis:discourse (3; 1% instances), amod (2; 1% instances), appos (2; 1% instances), expl (2; 1% instances), nummod (2; 1% instances), obl (1; 0% instances), parataxis:restart (1; 0% instances), vocative (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (106; 32% instances), ADP (40; 12% instances), DET (26; 8% instances), PART (22; 7% instances), CCONJ (17; 5% instances), VERB (17; 5% instances), ADV (16; 5% instances), AUX (16; 5% instances), NOUN (16; 5% instances), PRON (15; 5% instances), SCONJ (11; 3% instances), PROPN (10; 3% instances), INTJ (8; 2% instances), NUM (6; 2% instances), ADJ (4; 1% instances)

Treebank Statistics: UD_Slovenian-SST: POS Tags: X

Morphology

Relations

Treebank Statistics: UD_Slovenian-SST: POS Tags: `X`