home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovak-SNK: POS Tags: PART

There are 146 PART lemmas (1%), 144 PART types (1%) and 1908 PART tokens (2%). Out of 17 observed tags, the rank of PART is: 8 in number of lemmas, 9 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: aj, však, nie, len, už, až, a, iba, ani, ešte

The 10 most frequent PART types: aj, však, nie, len, už, až, a, iba, ani, ešte

The 10 most frequent ambiguous lemmas: aj (PART 227, CCONJ 101), však (PART 124, CCONJ 54), len (PART 95, CCONJ 19), (PART 95, ADV 52), (PART 92, SCONJ 9), a (CCONJ 2507, PART 89, PROPN 9, X 6), ani (PART 68, CCONJ 61), ešte (PART 63, ADV 19), tak (ADV 79, PART 47, CCONJ 23, X 1), možno (PART 35, ADV 4)

The 10 most frequent ambiguous types: aj (PART 200, CCONJ 87), však (PART 122, CCONJ 54), len (PART 88, CCONJ 8), (PART 69, ADV 39), (PART 83, SCONJ 9), a (CCONJ 2379, X 3, PART 1, PRON 1), ani (CCONJ 52, PART 52), ešte (PART 53, ADV 16), tak (ADV 68, CCONJ 23, PART 19, X 1), možno (PART 6, ADV 3)

Morphology

The form / lemma ratio of PART is 0.986301 (the average of all parts of speech is 1.802691).

The 1st highest number of forms (2) was observed with the lemma “tuším”: TUŠÍM, Tuším.

The 2nd highest number of forms (2) was observed with the lemma “áno”: Ano, áno.

The 3rd highest number of forms (1) was observed with the lemma “I”: I.

PART occurs with 1 features: Typo (1; 0% instances)

PART occurs with 1 feature-value pairs: Typo=Yes

PART occurs with 2 feature combinations. The most frequent feature combination is _ (1907 tokens). Examples: aj, však, nie, len, už, až, a, iba, ani, ešte

Relations

PART nodes are attached to their parents using 15 different relations: advmod:emph (863; 45% instances), advmod (763; 40% instances), cc (97; 5% instances), root (75; 4% instances), dep (67; 4% instances), mark (14; 1% instances), conj (12; 1% instances), fixed (4; 0% instances), orphan (4; 0% instances), discourse (3; 0% instances), parataxis (2; 0% instances), case (1; 0% instances), nmod (1; 0% instances), nsubj (1; 0% instances), xcomp (1; 0% instances)

Parents of PART nodes belong to 13 different parts of speech: VERB (777; 41% instances), NOUN (452; 24% instances), ADJ (162; 8% instances), ADV (159; 8% instances), (75; 4% instances), NUM (73; 4% instances), PROPN (65; 3% instances), PRON (63; 3% instances), DET (45; 2% instances), PART (27; 1% instances), X (5; 0% instances), CCONJ (4; 0% instances), INTJ (1; 0% instances)

1757 (92%) PART nodes are leaves.

63 (3%) PART nodes have one child.

59 (3%) PART nodes have two children.

29 (2%) PART nodes have three or more children.

The highest child degree of a PART node is 6.

Children of PART nodes are attached using 10 different relations: punct (204; 73% instances), dep (30; 11% instances), conj (19; 7% instances), cc (11; 4% instances), advmod:emph (8; 3% instances), orphan (4; 1% instances), xcomp (2; 1% instances), cop (1; 0% instances), csubj (1; 0% instances), obj (1; 0% instances)

Children of PART nodes belong to 10 different parts of speech: PUNCT (204; 73% instances), PART (27; 10% instances), VERB (20; 7% instances), ADV (8; 3% instances), NOUN (7; 2% instances), CCONJ (6; 2% instances), PRON (3; 1% instances), AUX (2; 1% instances), DET (2; 1% instances), PROPN (2; 1% instances)