home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-Kaist: POS Tags: SYM

There are 15 SYM lemmas (0%), 15 SYM types (0%) and 260 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 16 in number of lemmas, 17 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: %, km, $, 1,500+m, 10+m, 20+ha, 200+m, 216+m, 30+m, 3000+m

The 10 most frequent SYM types: %, km, $, 1,500m, 10m, 200m, 20ha, 216m, 3000m, 30m

The 10 most frequent ambiguous lemmas: km (X 7, SYM 4), C (X 2, SYM 1)

The 10 most frequent ambiguous types: km (X 7, SYM 4), C (X 2, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 0.998034).

The 1st highest number of forms (1) was observed with the lemma “$”: $.

The 2nd highest number of forms (1) was observed with the lemma “%”: %.

The 3rd highest number of forms (1) was observed with the lemma “1,500+m”: 1,500m.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 6 different relations: nummod (237; 91% instances), dep (9; 3% instances), root (6; 2% instances), conj (5; 2% instances), compound (2; 1% instances), obj (1; 0% instances)

Parents of SYM nodes belong to 10 different parts of speech: VERB (90; 35% instances), NOUN (70; 27% instances), SCONJ (27; 10% instances), ADV (22; 8% instances), NUM (21; 8% instances), ADJ (10; 4% instances), CCONJ (7; 3% instances), (6; 2% instances), SYM (4; 2% instances), PROPN (3; 1% instances)

5 (2%) SYM nodes are leaves.

211 (81%) SYM nodes have one child.

26 (10%) SYM nodes have two children.

18 (7%) SYM nodes have three or more children.

The highest child degree of a SYM node is 3.

Children of SYM nodes are attached using 6 different relations: nummod (250; 79% instances), punct (62; 20% instances), compound (2; 1% instances), dislocated (1; 0% instances), nmod (1; 0% instances), obl (1; 0% instances)

Children of SYM nodes belong to 4 different parts of speech: NUM (246; 78% instances), PUNCT (62; 20% instances), NOUN (5; 2% instances), SYM (4; 1% instances)