This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home fi/pos issue tracker

SYM: symbol

A symbol is a word-like entity that differs from ordinary words by form, function, or both.

Examples


Treebank Statistics (UD_Finnish)

There are 196 SYM lemmas (1%), 198 SYM types (0%) and 458 SYM tokens (0%). Out of 15 observed tags, the rank of SYM is: 8 in number of lemmas, 9 in number of types and 13 in number of tokens.

The 10 most frequent SYM lemmas: :), %, &, :D, ;), +, 3.Rf3, >, 2.f4, E21

The 10 most frequent SYM types: :), %, &, :D, ;), +, 3.Rf3, >, 2.f4, E21

The 10 most frequent ambiguous lemmas: :) (SYM 68, PUNCT 1), % (SYM 37, NOUN 9), & (SYM 21, PROPN 1), + (SYM 16, PROPN 2), °C (SYM 3, NOUN 1), A (NOUN 21, PROPN 7, SYM 1), B (NOUN 3, SYM 1, PROPN 1), K (SYM 1, PROPN 1), V (ADJ 10, NOUN 1, SYM 1), × (PROPN 4, SYM 1)

The 10 most frequent ambiguous types: :) (SYM 68, PUNCT 1), & (SYM 21, PROPN 1), + (SYM 16, PROPN 2), A (NOUN 9, PROPN 7, SYM 1), B (NOUN 3, SYM 1, PROPN 1), V (ADJ 7, NOUN 1, SYM 1), × (PROPN 4, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.010204 (the average of all parts of speech is 2.036755).

The 1st highest number of forms (2) was observed with the lemma “SRT#8”: SRT-8, SRT-8:ssa.

The 2nd highest number of forms (2) was observed with the lemma “°C”: °C, °C:ta.

The 3rd highest number of forms (1) was observed with the lemma “#”: #.

SYM occurs with 1 features: fi-feat/Case (2; 0% instances)

SYM occurs with 2 feature-value pairs: Case=Ine, Case=Par

SYM occurs with 3 feature combinations. The most frequent feature combination is _ (456 tokens). Examples: :), %, &, :D, ;), +, 3.Rf3, >, 2.f4, E21

Relations

SYM nodes are attached to their parents using 23 different relations: fi-dep/discourse (118; 26% instances), fi-dep/name (95; 21% instances), fi-dep/nmod (67; 15% instances), fi-dep/dobj (27; 6% instances), fi-dep/appos (26; 6% instances), fi-dep/punct (26; 6% instances), fi-dep/nsubj (19; 4% instances), fi-dep/conj (12; 3% instances), fi-dep/root (11; 2% instances), fi-dep/compound:nn (10; 2% instances), fi-dep/cc (9; 2% instances), fi-dep/nsubj:cop (7; 2% instances), fi-dep/advcl (6; 1% instances), fi-dep/compound (6; 1% instances), fi-dep/remnant (4; 1% instances), fi-dep/dep (3; 1% instances), fi-dep/nummod (3; 1% instances), fi-dep/parataxis (3; 1% instances), fi-dep/acl:relcl (2; 0% instances), fi-dep/advmod (1; 0% instances), fi-dep/amod (1; 0% instances), fi-dep/case (1; 0% instances), fi-dep/vocative (1; 0% instances)

Parents of SYM nodes belong to 10 different parts of speech: VERB (147; 32% instances), NOUN (138; 30% instances), SYM (86; 19% instances), ADJ (33; 7% instances), PROPN (26; 6% instances), ROOT (11; 2% instances), NUM (8; 2% instances), ADV (4; 1% instances), X (3; 1% instances), PRON (2; 0% instances)

299 (65%) SYM nodes are leaves.

39 (9%) SYM nodes have one child.

64 (14%) SYM nodes have two children.

56 (12%) SYM nodes have three or more children.

The highest child degree of a SYM node is 13.

Children of SYM nodes are attached using 19 different relations: fi-dep/punct (152; 36% instances), fi-dep/name (90; 21% instances), fi-dep/nummod (51; 12% instances), fi-dep/nmod (21; 5% instances), fi-dep/nsubj:cop (18; 4% instances), fi-dep/conj (17; 4% instances), fi-dep/cop (15; 4% instances), fi-dep/cc (13; 3% instances), fi-dep/compound:nn (12; 3% instances), fi-dep/advmod (11; 3% instances), fi-dep/acl:relcl (5; 1% instances), fi-dep/appos (5; 1% instances), fi-dep/compound (4; 1% instances), fi-dep/remnant (4; 1% instances), fi-dep/mark (3; 1% instances), fi-dep/acl (2; 0% instances), fi-dep/amod (2; 0% instances), fi-dep/advcl (1; 0% instances), fi-dep/case (1; 0% instances)

Children of SYM nodes belong to 12 different parts of speech: PUNCT (152; 36% instances), SYM (86; 20% instances), NUM (69; 16% instances), NOUN (57; 13% instances), VERB (24; 6% instances), CONJ (13; 3% instances), ADV (12; 3% instances), ADJ (5; 1% instances), PRON (3; 1% instances), PROPN (3; 1% instances), SCONJ (2; 0% instances), ADP (1; 0% instances)


Treebank Statistics (UD_Finnish-FTB)

There are 6 SYM lemmas (0%), 6 SYM types (0%) and 22 SYM tokens (0%). Out of 16 observed tags, the rank of SYM is: 16 in number of lemmas, 16 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: %, &, /, +, *, @

The 10 most frequent SYM types: %, &, /, +, *, @

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 2.044212).

The 1st highest number of forms (1) was observed with the lemma “%”: %.

The 2nd highest number of forms (1) was observed with the lemma “&”: &.

The 3rd highest number of forms (1) was observed with the lemma “*”: *.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 1 different relations: fi-dep/dep (22; 100% instances)

Parents of SYM nodes belong to 3 different parts of speech: NOUN (11; 50% instances), PROPN (7; 32% instances), VERB (4; 18% instances)

13 (59%) SYM nodes are leaves.

7 (32%) SYM nodes have one child.

2 (9%) SYM nodes have two children.

The highest child degree of a SYM node is 2.

Children of SYM nodes are attached using 2 different relations: fi-dep/nummod (8; 73% instances), fi-dep/punct (3; 27% instances)

Children of SYM nodes belong to 2 different parts of speech: NUM (8; 73% instances), PUNCT (3; 27% instances)


SYM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]