home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-CTeTex: POS Tags: SYM

There are 1 SYM lemmas (6%), 19 SYM types (1%) and 98 SYM tokens (1%). Out of 17 observed tags, the rank of SYM is: 15 in number of lemmas, 12 in number of types and 14 in number of tokens.

The 10 most frequent SYM lemmas: _

The 10 most frequent SYM types: /, =, %, -, &, °, +, #, x, ###

The 10 most frequent ambiguous lemmas: _ (NOUN 2649, PUNCT 1455, DET 936, ADP 781, VERB 721, ADJ 647, AUX 492, NUM 317, PROPN 293, CCONJ 267, ADV 185, PART 165, SCONJ 163, SYM 98, PRON 83, X 17, INTJ 4)

The 10 most frequent ambiguous types: / (SYM 39, PUNCT 2), - (PUNCT 143, SYM 7), & (SYM 6, CCONJ 1), O (ADJ 2, NOUN 1, SYM 1)

Morphology

The form / lemma ratio of SYM is 19.000000 (the average of all parts of speech is 125.235294).

The 1st highest number of forms (19) was observed with the lemma “_”: #, ###, %, &, , **, +, -, .spec, /, =, >, O, airport.xml, http://www.whitehouse.gov/omb/egov/a-2-EAModelsNEW2.html, www.cdc.gov/phin, x, °, ±.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 17 different relations: cc (35; 36% instances), flat (11; 11% instances), case (10; 10% instances), list (6; 6% instances), appos (5; 5% instances), conj (5; 5% instances), nmod (5; 5% instances), obl (5; 5% instances), advmod (4; 4% instances), nsubj:pass (3; 3% instances), advcl (2; 2% instances), parataxis (2; 2% instances), compound (1; 1% instances), iobj (1; 1% instances), nsubj (1; 1% instances), nummod (1; 1% instances), obj (1; 1% instances)

Parents of SYM nodes belong to 9 different parts of speech: NOUN (48; 49% instances), SYM (14; 14% instances), VERB (14; 14% instances), NUM (8; 8% instances), ADJ (5; 5% instances), PROPN (4; 4% instances), CCONJ (3; 3% instances), ADP (1; 1% instances), DET (1; 1% instances)

60 (61%) SYM nodes are leaves.

9 (9%) SYM nodes have one child.

13 (13%) SYM nodes have two children.

16 (16%) SYM nodes have three or more children.

The highest child degree of a SYM node is 10.

Children of SYM nodes are attached using 16 different relations: punct (15; 15% instances), nummod (14; 14% instances), case (11; 11% instances), nsubj (11; 11% instances), obj (11; 11% instances), nmod (8; 8% instances), compound (6; 6% instances), list (6; 6% instances), conj (5; 5% instances), cc (4; 4% instances), advmod (2; 2% instances), flat (2; 2% instances), mark (2; 2% instances), acl (1; 1% instances), amod (1; 1% instances), det (1; 1% instances)

Children of SYM nodes belong to 12 different parts of speech: NOUN (28; 28% instances), NUM (18; 18% instances), PUNCT (15; 15% instances), SYM (14; 14% instances), ADP (12; 12% instances), PROPN (5; 5% instances), ADV (2; 2% instances), CCONJ (2; 2% instances), ADJ (1; 1% instances), DET (1; 1% instances), SCONJ (1; 1% instances), VERB (1; 1% instances)