home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: POS Tags: X

There are 687 X lemmas (24%), 687 X types (14%) and 2140 X tokens (6%). Out of 15 observed tags, the rank of X is: 2 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent X lemmas: a), b), c), a, d), z, e), hospodaření, f), pohledávky

The 10 most frequent X types: a), b), c), a, d), z, e), hospodaření, f), pohledávky

The 10 most frequent ambiguous lemmas: a (CCONJ 1241, X 40), z (ADP 203, X 29), hospodaření (NOUN 30, X 27), výsledek (NOUN 33, X 15), za (ADP 190, X 21), účetní (ADJ 1467, NOUN 22, X 18), dlouhodobý (ADJ 81, X 12), finanční (ADJ 94, X 12), majetek (NOUN 309, X 17), na (ADP 341, X 17)

The 10 most frequent ambiguous types: a (CCONJ 1230, X 40), z (ADP 165, X 29), hospodaření (NOUN 30, X 27), pohledávky (NOUN 27, X 7), náklady (NOUN 61, X 15), výsledek (X 15, NOUN 8), za (ADP 173, X 21), závazky (NOUN 60, X 4), účetní (ADJ 873, NOUN 21, X 18), dlouhodobý (ADJ 14, X 12)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.725132).

The 1st highest number of forms (1) was observed with the lemma “***Výsledek”: ***Výsledek.

The 2nd highest number of forms (1) was observed with the lemma “+”: +.

The 3rd highest number of forms (1) was observed with the lemma “1.ledna”: 1.ledna.

X does not occur with any features.

Relations

X nodes are attached to their parents using 9 different relations: nmod (1666; 78% instances), obl (146; 7% instances), root (130; 6% instances), conj (124; 6% instances), dep (69; 3% instances), obj (2; 0% instances), appos (1; 0% instances), nsubj (1; 0% instances), orphan (1; 0% instances)

Parents of X nodes belong to 8 different parts of speech: X (1009; 47% instances), NOUN (722; 34% instances), VERB (160; 7% instances), (130; 6% instances), ADJ (83; 4% instances), NUM (33; 2% instances), ADV (2; 0% instances), ADP (1; 0% instances)

1556 (73%) X nodes are leaves.

213 (10%) X nodes have one child.

52 (2%) X nodes have two children.

319 (15%) X nodes have three or more children.

The highest child degree of a X node is 16.

Children of X nodes are attached using 22 different relations: nmod (928; 43% instances), punct (631; 29% instances), case (235; 11% instances), conj (170; 8% instances), cc (55; 3% instances), obl (39; 2% instances), advmod:emph (28; 1% instances), obl:arg (21; 1% instances), nsubj (20; 1% instances), obj (8; 0% instances), dep (6; 0% instances), advcl (4; 0% instances), advmod (4; 0% instances), mark (4; 0% instances), expl:pv (3; 0% instances), orphan (3; 0% instances), xcomp (3; 0% instances), csubj (2; 0% instances), amod (1; 0% instances), appos (1; 0% instances), det (1; 0% instances), nummod (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: X (1009; 47% instances), PUNCT (631; 29% instances), ADP (235; 11% instances), NOUN (170; 8% instances), CCONJ (45; 2% instances), PART (38; 2% instances), VERB (10; 0% instances), PRON (8; 0% instances), ADV (6; 0% instances), NUM (6; 0% instances), SCONJ (5; 0% instances), ADJ (3; 0% instances), AUX (1; 0% instances), DET (1; 0% instances)