home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: POS Tags: X

There are 962 X lemmas (4%), 972 X types (2%) and 1794 X tokens (1%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: de, New, the, of, Open, play, off, York, Nature, and

The 10 most frequent X types: de, New, the, of, play, Open, off, Nature, York, and

The 10 most frequent ambiguous lemmas: York (X 20, PROPN 16), s (ADP 2504, NOUN 27, X 10, PART 6), Gondwana (X 8, PROPN 2), a (CCONJ 7162, NOUN 17, X 6), O (NOUN 14, X 6), cup (NOUN 5, X 1), Telecom (PROPN 7, X 5), set (NOUN 17, X 4), to (PART 97, X 2), Jersey (PROPN 5, X 4)

The 10 most frequent ambiguous types: York (X 19, PROPN 1), s (ADP 1960, NOUN 72, X 10, PART 6), a (CCONJ 6945, ADJ 32, NOUN 17, X 6), O (ADP 147, NOUN 14, X 6), Telecom (PROPN 5, X 5), set (DET 20, X 4, ADJ 1, NOUN 1), to (DET 1275, PART 93, X 2), Jersey (PROPN 5, X 4), Palace (X 4, PROPN 3), ad (PROPN 3, X 3, ADJ 1, ADP 1)

Morphology

The form / lemma ratio of X is 1.010395 (the average of all parts of speech is 1.964432).

The 1st highest number of forms (2) was observed with the lemma “And”: AND, And.

The 2nd highest number of forms (2) was observed with the lemma “Banshees”: BANSHEES, Banshees.

The 3rd highest number of forms (2) was observed with the lemma “Cup”: CUP, Cup.

X occurs with 1 features: Foreign (1794; 100% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 1 feature combinations. The most frequent feature combination is Foreign=Yes (1794 tokens). Examples: de, New, the, of, play, Open, off, Nature, York, and

Relations

X nodes are attached to their parents using 18 different relations: flat:foreign (827; 46% instances), nmod (552; 31% instances), obl (76; 4% instances), conj (72; 4% instances), nsubj (57; 3% instances), dep (53; 3% instances), root (53; 3% instances), appos (51; 3% instances), obj (20; 1% instances), case (14; 1% instances), cc (7; 0% instances), orphan (5; 0% instances), obl:arg (2; 0% instances), advcl (1; 0% instances), advmod:emph (1; 0% instances), amod (1; 0% instances), fixed (1; 0% instances), xcomp (1; 0% instances)

Parents of X nodes belong to 9 different parts of speech: X (889; 50% instances), NOUN (508; 28% instances), PROPN (148; 8% instances), VERB (144; 8% instances), (53; 3% instances), ADJ (22; 1% instances), NUM (22; 1% instances), ADV (7; 0% instances), AUX (1; 0% instances)

1013 (56%) X nodes are leaves.

307 (17%) X nodes have one child.

232 (13%) X nodes have two children.

242 (13%) X nodes have three or more children.

The highest child degree of a X node is 14.

Children of X nodes are attached using 23 different relations: flat:foreign (821; 46% instances), punct (360; 20% instances), case (143; 8% instances), nmod (116; 6% instances), nummod (73; 4% instances), amod (63; 4% instances), conj (61; 3% instances), appos (44; 2% instances), cc (32; 2% instances), dep (26; 1% instances), acl:relcl (13; 1% instances), det (7; 0% instances), cop (6; 0% instances), orphan (6; 0% instances), advmod:emph (4; 0% instances), mark (4; 0% instances), nsubj (3; 0% instances), obl (3; 0% instances), parataxis (3; 0% instances), advmod (2; 0% instances), aux (1; 0% instances), fixed (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: X (889; 50% instances), PUNCT (360; 20% instances), ADP (134; 7% instances), NOUN (116; 6% instances), NUM (88; 5% instances), PROPN (69; 4% instances), ADJ (64; 4% instances), VERB (22; 1% instances), CCONJ (20; 1% instances), ADV (8; 0% instances), DET (8; 0% instances), AUX (7; 0% instances), SCONJ (4; 0% instances), SYM (2; 0% instances), PART (1; 0% instances), PRON (1; 0% instances)