home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SSJ: POS Tags: X

There are 170 X lemmas (1%), 171 X types (1%) and 348 X tokens (0%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 8 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: dr., oz., t., d., sv., de, P., of, i., the

The 10 most frequent X types: dr., oz., t., d., sv., de, P., of, i., the

The 10 most frequent ambiguous lemmas: V. (X 3, NUM 1), da (SCONJ 1776, PART 9, X 2), les (NOUN 9, X 2), a (CCONJ 96, ADV 2, X 1), del (NOUN 120, X 1), do (ADP 354, X 1), in (CCONJ 3253, ADV 4, X 1), life (NOUN 1, X 1), on (PRON 1560, X 1), pa (CCONJ 957, X 1)

The 10 most frequent ambiguous types: de (X 9, VERB 1), sta (AUX 196, VERB 6, X 4), V. (X 3, NUM 1), Les (PROPN 2, X 2), da (SCONJ 1730, VERB 9, X 2, PART 1), mu (PRON 159, X 2), A (CCONJ 31, NOUN 7, ADV 1, X 1), Art (PROPN 1, X 1), Life (NOUN 1, X 1), National (PROPN 2, X 1)

Morphology

The form / lemma ratio of X is 1.005882 (the average of all parts of speech is 1.892155).

The 1st highest number of forms (2) was observed with the lemma “european”: EUROPEAN, European.

The 2nd highest number of forms (1) was observed with the lemma “A.”: A..

The 3rd highest number of forms (1) was observed with the lemma “B.”: B..

X occurs with 2 features: Abbr (164; 47% instances), Foreign (121; 35% instances)

X occurs with 2 feature-value pairs: Abbr=Yes, Foreign=Yes

X occurs with 3 feature combinations. The most frequent feature combination is Abbr=Yes (164 tokens). Examples: dr., oz., t., d., sv., P., i., M., j., o.

Relations

X nodes are attached to their parents using 13 different relations: nmod (122; 35% instances), root (58; 17% instances), flat:foreign (55; 16% instances), fixed (21; 6% instances), amod (17; 5% instances), cc (14; 4% instances), nsubj (12; 3% instances), appos (9; 3% instances), flat (9; 3% instances), conj (8; 2% instances), dep (8; 2% instances), obl (8; 2% instances), obj (7; 2% instances)

Parents of X nodes belong to 10 different parts of speech: X (116; 33% instances), PROPN (78; 22% instances), (58; 17% instances), NOUN (54; 16% instances), VERB (33; 9% instances), ADJ (4; 1% instances), NUM (2; 1% instances), DET (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances)

217 (62%) X nodes are leaves.

45 (13%) X nodes have one child.

56 (16%) X nodes have two children.

30 (9%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 12 different relations: punct (118; 43% instances), flat:foreign (55; 20% instances), nmod (40; 15% instances), fixed (20; 7% instances), amod (9; 3% instances), case (8; 3% instances), conj (8; 3% instances), flat (7; 3% instances), appos (3; 1% instances), cc (3; 1% instances), acl (2; 1% instances), nummod (2; 1% instances)

Children of X nodes belong to 11 different parts of speech: PUNCT (118; 43% instances), X (116; 42% instances), ADJ (10; 4% instances), PROPN (9; 3% instances), ADP (6; 2% instances), NOUN (6; 2% instances), CCONJ (3; 1% instances), NUM (2; 1% instances), SCONJ (2; 1% instances), VERB (2; 1% instances), DET (1; 0% instances)