Treebank Statistics: UD_Russian-SynTagRus: POS Tags: SYM
There are 18 SYM
lemmas (0%), 17 SYM
types (0%) and 1279 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 15 in number of lemmas, 17 in number of types and 15 in number of tokens.
The 10 most frequent SYM
lemmas: %, $, №, °, &, €, +, =, №№, x
The 10 most frequent SYM
types: %, $, №, °, &, €, +, =, №№, х
The 10 most frequent ambiguous lemmas: & (SYM 11, X 1), + (SYM 7, PUNCT 1), x (ADJ 5, NUM 4, PUNCT 2, SYM 2), - (PUNCT 24673, SYM 1), 0 (NUM 21, SYM 1), ? (PUNCT 4767, SYM 1), g (X 3, SYM 1)
The 10 most frequent ambiguous types: & (SYM 11, X 1), + (SYM 7, PUNCT 1), х (SYM 3, CCONJ 1, NOUN 1), - (PUNCT 24674, SYM 1), 0 (NUM 21, SYM 1), ? (PUNCT 4767, SYM 1), g (X 4, SYM 1)
- &
- +
- х
- SYM 3: А , скажем , в библиотеках достаточно трубопроводов прямоугольного сечения ( 20 х 40 см ) и контейнеров грузоподъемностью 25 кг .
- CCONJ 1: Естественно , что первые десять манипуляторов при этом изготовят 10 х 10 = 100 штук манипуляторов , уменьшенных , однако , уже в 16 раз …
- NOUN 1: Голова по-армянски : глух’ , с коротким придыханием после “ х “ и мягким “ л “ …
- -
- PUNCT 24674: - Детерминированность ( определённость ) .
- SYM 1: Хотя на Земле и есть организмы , которые могут использовать для фотосинтеза не превращение воды ( H2O ) в кислород и четыре иона водорода ( H+ ) с четырьмя электронами ( e - ) , а реакцию , где сероводород H2S даёт два атома серы … и те же четыре электрона с четырьмя ионами водорода .
- 0
- ?
- g
Morphology
The form / lemma ratio of SYM
is 0.944444 (the average of all parts of speech is 2.654430).
The 1st highest number of forms (1) was observed with the lemma “$”: $.
The 2nd highest number of forms (1) was observed with the lemma “%”: %.
The 3rd highest number of forms (1) was observed with the lemma “&”: &.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 21 different relations: obl (535; 42% instances), nsubj (172; 13% instances), nmod (170; 13% instances), parataxis (96; 8% instances), conj (95; 7% instances), nummod:entity (41; 3% instances), compound (30; 2% instances), orphan (26; 2% instances), root (26; 2% instances), obj (22; 2% instances), nsubj:pass (19; 1% instances), appos (11; 1% instances), flat:foreign (11; 1% instances), flat (6; 0% instances), iobj (6; 0% instances), case (5; 0% instances), advcl (2; 0% instances), ccomp (2; 0% instances), list (2; 0% instances), cc (1; 0% instances), fixed (1; 0% instances)
Parents of SYM
nodes belong to 10 different parts of speech: VERB (699; 55% instances), NOUN (315; 25% instances), SYM (55; 4% instances), NUM (48; 4% instances), ADV (47; 4% instances), ADJ (41; 3% instances), PROPN (29; 2% instances), (26; 2% instances), X (15; 1% instances), PRON (4; 0% instances)
66 (5%) SYM
nodes are leaves.
197 (15%) SYM
nodes have one child.
546 (43%) SYM
nodes have two children.
470 (37%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 9.
Children of SYM
nodes are attached using 24 different relations: nummod (1112; 38% instances), nmod (512; 17% instances), case (496; 17% instances), punct (346; 12% instances), conj (74; 3% instances), advmod (68; 2% instances), nummod:entity (66; 2% instances), obl (54; 2% instances), cc (53; 2% instances), nsubj (39; 1% instances), parataxis (36; 1% instances), orphan (28; 1% instances), amod (17; 1% instances), nummod:gov (7; 0% instances), appos (6; 0% instances), det (6; 0% instances), cop (5; 0% instances), acl:relcl (4; 0% instances), acl (3; 0% instances), mark (3; 0% instances), expl (2; 0% instances), flat (2; 0% instances), discourse (1; 0% instances), flat:foreign (1; 0% instances)
Children of SYM
nodes belong to 16 different parts of speech: NUM (1088; 37% instances), NOUN (607; 21% instances), ADP (487; 17% instances), PUNCT (346; 12% instances), ADV (100; 3% instances), ADJ (59; 2% instances), SYM (55; 2% instances), CCONJ (50; 2% instances), PROPN (48; 2% instances), PART (36; 1% instances), VERB (34; 1% instances), PRON (13; 0% instances), DET (8; 0% instances), AUX (5; 0% instances), SCONJ (4; 0% instances), INTJ (1; 0% instances)