home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Finnish-OOD: POS Tags: SYM

There are 116 SYM lemmas (2%), 117 SYM types (1%) and 196 SYM tokens (1%). Out of 15 observed tags, the rank of SYM is: 7 in number of lemmas, 8 in number of types and 13 in number of tokens.

The 10 most frequent SYM lemmas: %, ->, –>, :), +, ;), :D, →, =, ~

The 10 most frequent SYM types: %, ->, –>, :), +, ;), :D, →, =, ~

The 10 most frequent ambiguous lemmas: % (SYM 22, NOUN 3), = (PUNCT 3, SYM 3)

The 10 most frequent ambiguous types: = (PUNCT 3, SYM 3)

Morphology

The form / lemma ratio of SYM is 1.008621 (the average of all parts of speech is 1.566190).

The 1st highest number of forms (2) was observed with the lemma “https”: https://t.co/O7y8YXnXJM, https://t.co/l4w6clYexv.

The 2nd highest number of forms (1) was observed with the lemma “#2”: #2.

The 3rd highest number of forms (1) was observed with the lemma “%”: %.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 15 different relations: discourse (141; 72% instances), nmod:poss (10; 5% instances), obl (8; 4% instances), cc (7; 4% instances), advmod (5; 3% instances), conj (4; 2% instances), parataxis (4; 2% instances), root (4; 2% instances), appos (3; 2% instances), flat:foreign (3; 2% instances), nummod (2; 1% instances), orphan (2; 1% instances), case (1; 1% instances), nmod (1; 1% instances), obj (1; 1% instances)

Parents of SYM nodes belong to 12 different parts of speech: VERB (93; 47% instances), NOUN (50; 26% instances), PROPN (14; 7% instances), NUM (10; 5% instances), ADJ (7; 4% instances), PRON (5; 3% instances), SYM (5; 3% instances), (4; 2% instances), X (4; 2% instances), AUX (2; 1% instances), ADV (1; 1% instances), INTJ (1; 1% instances)

164 (84%) SYM nodes are leaves.

18 (9%) SYM nodes have one child.

8 (4%) SYM nodes have two children.

6 (3%) SYM nodes have three or more children.

The highest child degree of a SYM node is 7.

Children of SYM nodes are attached using 10 different relations: nummod (25; 41% instances), punct (14; 23% instances), conj (6; 10% instances), nsubj:cop (5; 8% instances), obl (3; 5% instances), case (2; 3% instances), cc (2; 3% instances), discourse (2; 3% instances), acl:relcl (1; 2% instances), advmod (1; 2% instances)

Children of SYM nodes belong to 9 different parts of speech: NUM (26; 43% instances), PUNCT (14; 23% instances), NOUN (7; 11% instances), SYM (5; 8% instances), VERB (3; 5% instances), ADP (2; 3% instances), CCONJ (2; 3% instances), ADV (1; 2% instances), PRON (1; 2% instances)