home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Polish-PUD: POS Tags: X

There are 82 X lemmas (2%), 82 X types (1%) and 104 X tokens (1%). Out of 15 observed tags, the rank of X is: 7 in number of lemmas, 8 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: of, the, de, a, 2004, 2016, El, association, von, ‘ya

The 10 most frequent X types: of, the, de, a, 2004, 2016, Association, El, Von, ‘Ya

The 10 most frequent ambiguous lemmas: a (CCONJ 56, PART 3, X 3), 2004 (X 2, ADJ 1), 2016 (X 2, ADJ 1), 1918 (ADJ 1, X 1), 1991 (ADJ 1, X 1), 1992 (ADJ 2, X 1), 1994 (ADJ 1, X 1), 1997 (ADJ 2, X 1), 2008 (ADJ 1, X 1), 2013 (ADJ 3, X 1)

The 10 most frequent ambiguous types: a (CCONJ 53, X 2, PART 1), 2004 (X 2, ADJ 1), 2016 (X 2, ADJ 1), 1918 (ADJ 1, X 1), 1991 (ADJ 1, X 1), 1992 (ADJ 2, X 1), 1994 (ADJ 1, X 1), 1997 (ADJ 2, X 1), 2008 (ADJ 1, X 1), 2013 (ADJ 3, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.436422).

The 1st highest number of forms (1) was observed with the lemma “’ya”: ‘Ya.

The 2nd highest number of forms (1) was observed with the lemma “1165”: 1.165.

The 3rd highest number of forms (1) was observed with the lemma “1918”: 1918.

X occurs with 2 features: Foreign (81; 78% instances), NumForm (21; 20% instances)

X occurs with 2 feature-value pairs: Foreign=Yes, NumForm=Digit

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (81 tokens). Examples: of, the, de, a, Association, El, Von, ‘Ya, America, Assistant

Relations

X nodes are attached to their parents using 12 different relations: flat:foreign (38; 37% instances), amod (16; 15% instances), flat (15; 14% instances), conj (10; 10% instances), nmod (8; 8% instances), obl (4; 4% instances), nmod:arg (3; 3% instances), nsubj (3; 3% instances), appos (2; 2% instances), fixed (2; 2% instances), iobj (2; 2% instances), advcl:cmp (1; 1% instances)

Parents of X nodes belong to 6 different parts of speech: X (51; 49% instances), NOUN (29; 28% instances), PROPN (12; 12% instances), VERB (8; 8% instances), ADJ (2; 2% instances), ADP (2; 2% instances)

29 (28%) X nodes are leaves.

49 (47%) X nodes have one child.

13 (13%) X nodes have two children.

13 (13%) X nodes have three or more children.

The highest child degree of a X node is 5.

Children of X nodes are attached using 12 different relations: flat:foreign (38; 32% instances), punct (38; 32% instances), flat (18; 15% instances), conj (8; 7% instances), cc (5; 4% instances), case (3; 3% instances), amod (2; 2% instances), mark (2; 2% instances), nmod (1; 1% instances), nmod:arg (1; 1% instances), nmod:flat (1; 1% instances), nmod:poss (1; 1% instances)

Children of X nodes belong to 7 different parts of speech: X (51; 43% instances), PUNCT (38; 32% instances), PROPN (17; 14% instances), CCONJ (5; 4% instances), ADP (3; 3% instances), NOUN (2; 2% instances), SCONJ (2; 2% instances)