home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-PUD: POS Tags: X

There are 87 X lemmas (2%), 87 X types (1%) and 95 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 8 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: a, and, my, really, the, you, American, Amnesty, Associated, Casa

The 10 most frequent X types: a, and, My, Really, The, You, American, Amnesty, Anyway, Associated

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.496727).

The 1st highest number of forms (1) was observed with the lemma “American”: American.

The 2nd highest number of forms (1) was observed with the lemma “Amnesty”: Amnesty.

The 3rd highest number of forms (1) was observed with the lemma “Associated”: Associated.

X occurs with 2 features: Foreign (95; 100% instances), Abbr (4; 4% instances)

X occurs with 2 feature-value pairs: Abbr=Yes, Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (91 tokens). Examples: a, and, My, Really, The, You, American, Amnesty, Anyway, Associated

Relations

X nodes are attached to their parents using 9 different relations: flat:foreign (60; 63% instances), flat (24; 25% instances), appos (4; 4% instances), obl (2; 2% instances), conj (1; 1% instances), iobj (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances), root (1; 1% instances)

Parents of X nodes belong to 5 different parts of speech: X (58; 61% instances), NOUN (28; 29% instances), PROPN (4; 4% instances), VERB (4; 4% instances), (1; 1% instances)

61 (64%) X nodes are leaves.

9 (9%) X nodes have one child.

4 (4%) X nodes have two children.

21 (22%) X nodes have three or more children.

The highest child degree of a X node is 9.

Children of X nodes are attached using 8 different relations: flat:foreign (63; 55% instances), punct (43; 38% instances), case (3; 3% instances), appos (1; 1% instances), cc (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances), parataxis (1; 1% instances)

Children of X nodes belong to 9 different parts of speech: X (58; 51% instances), PUNCT (43; 38% instances), PROPN (6; 5% instances), ADP (2; 2% instances), ADV (1; 1% instances), CCONJ (1; 1% instances), NOUN (1; 1% instances), SCONJ (1; 1% instances), VERB (1; 1% instances)