home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: POS Tags: X

There are 358 X lemmas (1%), 357 X types (0%) and 511 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: of, and, the, in, for, de, mignews.com, capture, di, money

The 10 most frequent X types: of, and, the, in, for, MIGNews.com, de, capture, di, money

The 10 most frequent ambiguous lemmas: а (CCONJ 5975, INTJ 5, NOUN 4, PART 4, X 4), daily (X 3, ADJ 1), g (X 3, NOUN 1), i (NUM 26, X 3), б (X 3, NOUN 1), и (CCONJ 24778, PART 4708, X 3, VERB 1), аль (PART 10, X 2), * (PUNCT 1, X 1), robots.txt (NOUN 5, X 1), англ (ADJ 1, X 1)

The 10 most frequent ambiguous types: а (CCONJ 4172, X 4, INTJ 3, PART 1), daily (X 3, ADJ 1), g (X 3, NOUN 1), б (AUX 15, X 3), и (CCONJ 22604, PART 4668, X 3), аль (PART 9, X 2), * (PUNCT 1, X 1), PS (PROPN 1, X 1), S (PROPN 2, X 1), homo (PROPN 4, X 1)

Morphology

The form / lemma ratio of X is 0.997207 (the average of all parts of speech is 2.589377).

The 1st highest number of forms (1) was observed with the lemma “*”: *.

The 2nd highest number of forms (1) was observed with the lemma “12:04”: 12:04.

The 3rd highest number of forms (1) was observed with the lemma “15.08.2008”: 15.08.2008.

X occurs with 1 features: Foreign (508; 99% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (508 tokens). Examples: of, and, the, in, for, MIGNews.com, de, capture, di, money

Relations

X nodes are attached to their parents using 16 different relations: flat:foreign (351; 69% instances), appos (74; 14% instances), nsubj (19; 4% instances), nmod (14; 3% instances), parataxis (14; 3% instances), obl (10; 2% instances), root (7; 1% instances), conj (6; 1% instances), obj (6; 1% instances), fixed (2; 0% instances), nsubj:pass (2; 0% instances), orphan (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), compound (1; 0% instances), flat:name (1; 0% instances)

Parents of X nodes belong to 11 different parts of speech: NOUN (298; 58% instances), VERB (69; 14% instances), PROPN (55; 11% instances), X (47; 9% instances), ADJ (24; 5% instances), (7; 1% instances), NUM (4; 1% instances), ADV (3; 1% instances), DET (2; 0% instances), CCONJ (1; 0% instances), PART (1; 0% instances)

296 (58%) X nodes are leaves.

118 (23%) X nodes have one child.

59 (12%) X nodes have two children.

38 (7%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 19 different relations: punct (256; 65% instances), appos (41; 10% instances), flat:foreign (26; 7% instances), parataxis (15; 4% instances), case (11; 3% instances), cc (9; 2% instances), amod (8; 2% instances), conj (5; 1% instances), nmod (4; 1% instances), advmod (3; 1% instances), fixed (3; 1% instances), mark (3; 1% instances), acl (2; 1% instances), det (2; 1% instances), nsubj (2; 1% instances), obl (2; 1% instances), orphan (2; 1% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances)

Children of X nodes belong to 13 different parts of speech: PUNCT (256; 65% instances), X (47; 12% instances), NOUN (32; 8% instances), ADJ (11; 3% instances), ADP (11; 3% instances), PROPN (11; 3% instances), CCONJ (9; 2% instances), VERB (9; 2% instances), ADV (2; 1% instances), DET (2; 1% instances), PART (2; 1% instances), PRON (2; 1% instances), SCONJ (2; 1% instances)