home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: POS Tags: X

There are 687 X lemmas (24%), 687 X types (14%) and 2120 X tokens (6%). Out of 15 observed tags, the rank of X is: 2 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent X lemmas: a), b), c), a, d), z, hospodaření, e), pohledávky, f)

The 10 most frequent X types: a), b), c), a, d), z, hospodaření, e), pohledávky, f)

The 10 most frequent ambiguous lemmas: a (CCONJ 1215, X 40), z (ADP 200, X 29), hospodaření (NOUN 30, X 27), výsledek (NOUN 30, X 15), za (ADP 186, X 21), účetní (ADJ 1451, NOUN 21, X 18), dlouhodobý (ADJ 80, X 12), finanční (ADJ 91, X 12), majetek (NOUN 303, X 17), na (ADP 329, X 17)

The 10 most frequent ambiguous types: a (CCONJ 1204, X 40), z (ADP 162, X 29), hospodaření (NOUN 30, X 27), pohledávky (NOUN 27, X 7), náklady (NOUN 58, X 15), výsledek (X 15, NOUN 8), za (ADP 169, X 21), závazky (NOUN 59, X 4), účetní (ADJ 860, NOUN 20, X 18), dlouhodobý (ADJ 14, X 12)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.713272).

The 1st highest number of forms (1) was observed with the lemma “***Výsledek”: ***Výsledek.

The 2nd highest number of forms (1) was observed with the lemma “+”: +.

The 3rd highest number of forms (1) was observed with the lemma “1.ledna”: 1.ledna.

X does not occur with any features.

Relations

X nodes are attached to their parents using 9 different relations: nmod (1317; 62% instances), obl (503; 24% instances), root (130; 6% instances), conj (96; 5% instances), dep (69; 3% instances), obj (2; 0% instances), appos (1; 0% instances), nsubj (1; 0% instances), orphan (1; 0% instances)

Parents of X nodes belong to 8 different parts of speech: X (981; 46% instances), NOUN (740; 35% instances), VERB (153; 7% instances), (130; 6% instances), ADJ (80; 4% instances), NUM (33; 2% instances), ADV (2; 0% instances), ADP (1; 0% instances)

1575 (74%) X nodes are leaves.

212 (10%) X nodes have one child.

48 (2%) X nodes have two children.

285 (13%) X nodes have three or more children.

The highest child degree of a X node is 16.

Children of X nodes are attached using 19 different relations: nmod (920; 46% instances), punct (603; 30% instances), case (202; 10% instances), conj (139; 7% instances), cc (50; 2% instances), advmod:emph (28; 1% instances), obl:arg (21; 1% instances), nsubj (20; 1% instances), obl (13; 1% instances), obj (8; 0% instances), advmod (3; 0% instances), expl:pv (3; 0% instances), orphan (3; 0% instances), xcomp (3; 0% instances), amod (1; 0% instances), appos (1; 0% instances), csubj (1; 0% instances), dep (1; 0% instances), nummod (1; 0% instances)

Children of X nodes belong to 12 different parts of speech: X (981; 49% instances), PUNCT (603; 30% instances), ADP (202; 10% instances), NOUN (130; 6% instances), CCONJ (40; 2% instances), PART (38; 2% instances), PRON (8; 0% instances), NUM (6; 0% instances), ADV (5; 0% instances), VERB (5; 0% instances), ADJ (2; 0% instances), SCONJ (1; 0% instances)