home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Finnish-TDT: POS Tags: SYM

There are 200 SYM lemmas (1%), 202 SYM types (0%) and 479 SYM tokens (0%). Out of 15 observed tags, the rank of SYM is: 8 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent SYM lemmas: :), %, &, +, :D, ;), 3.Rf3, =, >, 2.f4

The 10 most frequent SYM types: :), %, &, +, :D, ;), 3.Rf3, =, >, 2.f4

The 10 most frequent ambiguous lemmas: :) (SYM 64, PUNCT 1), % (SYM 42, NOUN 9), & (SYM 21, PROPN 1), + (SYM 20, PROPN 2), °C (SYM 3, NOUN 1), A (NOUN 22, PROPN 7, SYM 1), B (NOUN 3, PROPN 1, SYM 1), K (PROPN 1, SYM 1), V (ADJ 10, NOUN 1, SYM 1), × (PROPN 4, SYM 1)

The 10 most frequent ambiguous types: :) (SYM 64, PUNCT 1), & (SYM 21, PROPN 1), + (SYM 20, PROPN 2), A (NOUN 10, PROPN 7, SYM 1), B (NOUN 3, PROPN 1, SYM 1), V (ADJ 7, NOUN 1, SYM 1), × (PROPN 4, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.010000 (the average of all parts of speech is 2.060628).

The 1st highest number of forms (2) was observed with the lemma “SRT#8”: SRT-8, SRT-8:ssa.

The 2nd highest number of forms (2) was observed with the lemma “°C”: °C, °C:ta.

The 3rd highest number of forms (1) was observed with the lemma “#”: #.

SYM occurs with 1 features: Case (2; 0% instances)

SYM occurs with 2 feature-value pairs: Case=Ine, Case=Par

SYM occurs with 3 feature combinations. The most frequent feature combination is _ (477 tokens). Examples: :), %, &, +, :D, ;), 3.Rf3, =, >, 2.f4

Relations

SYM nodes are attached to their parents using 24 different relations: discourse (121; 25% instances), flat:name (95; 20% instances), nmod (50; 10% instances), punct (36; 8% instances), obj (29; 6% instances), appos (27; 6% instances), nsubj (20; 4% instances), obl (17; 4% instances), conj (16; 3% instances), root (11; 2% instances), compound:nn (10; 2% instances), cc (9; 2% instances), nsubj:cop (8; 2% instances), advcl (6; 1% instances), compound (6; 1% instances), nummod (4; 1% instances), dep (3; 1% instances), parataxis (3; 1% instances), acl:relcl (2; 0% instances), amod (2; 0% instances), advmod (1; 0% instances), case (1; 0% instances), orphan (1; 0% instances), vocative (1; 0% instances)

Parents of SYM nodes belong to 10 different parts of speech: NOUN (158; 33% instances), VERB (149; 31% instances), SYM (82; 17% instances), ADJ (33; 7% instances), PROPN (26; 5% instances), (11; 2% instances), NUM (8; 2% instances), ADV (5; 1% instances), PRON (4; 1% instances), X (3; 1% instances)

310 (65%) SYM nodes are leaves.

44 (9%) SYM nodes have one child.

67 (14%) SYM nodes have two children.

58 (12%) SYM nodes have three or more children.

The highest child degree of a SYM node is 14.

Children of SYM nodes are attached using 20 different relations: punct (153; 35% instances), flat:name (90; 21% instances), nummod (56; 13% instances), nmod (22; 5% instances), nsubj:cop (18; 4% instances), conj (17; 4% instances), cop (15; 3% instances), cc (13; 3% instances), compound:nn (12; 3% instances), advmod (11; 3% instances), acl:relcl (5; 1% instances), appos (5; 1% instances), compound (4; 1% instances), obl (4; 1% instances), orphan (4; 1% instances), mark (3; 1% instances), acl (2; 0% instances), amod (2; 0% instances), advcl (1; 0% instances), case (1; 0% instances)

Children of SYM nodes belong to 13 different parts of speech: PUNCT (153; 35% instances), SYM (82; 19% instances), NUM (74; 17% instances), NOUN (64; 15% instances), AUX (15; 3% instances), CCONJ (13; 3% instances), ADV (12; 3% instances), VERB (9; 2% instances), ADJ (5; 1% instances), PROPN (5; 1% instances), PRON (3; 1% instances), SCONJ (2; 0% instances), ADP (1; 0% instances)