home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SST: POS Tags: PART

There are 46 PART lemmas (1%), 46 PART types (1%) and 2678 PART tokens (9%). Out of 16 observed tags, the rank of PART is: 10 in number of lemmas, 11 in number of types and 4 in number of tokens.

The 10 most frequent PART lemmas: ne, ja, tudi, še, že, no, samo, pač, seveda, naj

The 10 most frequent PART types: ne, ja, tudi, še, že, no, samo, pač, seveda, naj

The 10 most frequent ambiguous lemmas: ne (PART 787, CCONJ 2), prav (PART 18, ADV 8), da (SCONJ 533, PART 16, X 1), več (DET 27, PART 16), sicer (CCONJ 15, PART 14), ravno (PART 4, ADV 1), edino (ADV 1, PART 1)

The 10 most frequent ambiguous types: ne (PART 787, CCONJ 2), še (PART 203, X 1), samo (PART 113, ADJ 1), naj (PART 27, X 1), ma (PART 19, X 2), prav (PART 18, ADV 8, VERB 1), da (SCONJ 533, PART 16, VERB 15, X 1), več (DET 27, PART 16), sicer (CCONJ 15, PART 14), celo (PART 7, ADJ 6)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.570645).

The 1st highest number of forms (1) was observed with the lemma “alora”: alora.

The 2nd highest number of forms (1) was observed with the lemma “arki”: arki.

The 3rd highest number of forms (1) was observed with the lemma “baje”: baje.

PART occurs with 1 features: Polarity (292; 11% instances)

PART occurs with 1 feature-value pairs: Polarity=Neg

PART occurs with 2 feature combinations. The most frequent feature combination is _ (2386 tokens). Examples: ja, ne, tudi, še, že, no, samo, pač, seveda, naj

Relations

PART nodes are attached to their parents using 16 different relations: advmod (1195; 45% instances), discourse (1149; 43% instances), root (193; 7% instances), fixed (41; 2% instances), cc (37; 1% instances), parataxis (20; 1% instances), conj (12; 0% instances), reparandum (9; 0% instances), cc:preconj (7; 0% instances), advcl (3; 0% instances), mark (3; 0% instances), orphan (3; 0% instances), ccomp (2; 0% instances), csubj (2; 0% instances), case (1; 0% instances), conj:extend (1; 0% instances)

Parents of PART nodes belong to 14 different parts of speech: VERB (1596; 60% instances), NOUN (265; 10% instances), (193; 7% instances), ADJ (166; 6% instances), ADV (138; 5% instances), PART (107; 4% instances), DET (64; 2% instances), PROPN (38; 1% instances), PRON (36; 1% instances), INTJ (26; 1% instances), NUM (20; 1% instances), X (16; 1% instances), CCONJ (11; 0% instances), ADP (2; 0% instances)

2488 (93%) PART nodes are leaves.

98 (4%) PART nodes have one child.

56 (2%) PART nodes have two children.

36 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 7.

Children of PART nodes are attached using 19 different relations: discourse (93; 27% instances), punct (82; 24% instances), fixed (41; 12% instances), advmod (26; 8% instances), reparandum (24; 7% instances), cc (19; 5% instances), discourse:filler (17; 5% instances), mark (8; 2% instances), parataxis (8; 2% instances), nmod (5; 1% instances), nsubj (5; 1% instances), obj (5; 1% instances), conj (4; 1% instances), expl (3; 1% instances), orphan (2; 1% instances), advcl (1; 0% instances), obl (1; 0% instances), parataxis:discourse (1; 0% instances), vocative (1; 0% instances)

Children of PART nodes belong to 14 different parts of speech: PART (107; 31% instances), PUNCT (82; 24% instances), ADV (31; 9% instances), INTJ (28; 8% instances), CCONJ (25; 7% instances), X (22; 6% instances), SCONJ (18; 5% instances), PRON (9; 3% instances), VERB (8; 2% instances), NOUN (6; 2% instances), DET (4; 1% instances), ADP (3; 1% instances), ADJ (2; 1% instances), PROPN (1; 0% instances)