home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GENTLE: POS Tags: SYM

There are 20 SYM lemmas (1%), 20 SYM types (0%) and 167 SYM tokens (1%). Out of 17 observed tags, the rank of SYM is: 13 in number of lemmas, 15 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: ⪯, ∈, =, -, ⋅, /, %, +, $, ≤

The 10 most frequent SYM types: ⪯, ∈, =, -, ⋅, /, %, +, $, ≤

The 10 most frequent ambiguous lemmas: (SYM 31, NOUN 1), (SYM 25, NOUN 1), = (SYM 20, PUNCT 1), - (PUNCT 130, SYM 13), / (SYM 10, PUNCT 5, CCONJ 3), + (SYM 9, CCONJ 1), x (NOUN 47, ADJ 2, ADV 1, SYM 1)

The 10 most frequent ambiguous types: (SYM 31, NOUN 1), (SYM 25, NOUN 1), = (SYM 20, PUNCT 1), - (PUNCT 129, SYM 13), / (SYM 10, PUNCT 5, CCONJ 3), + (SYM 9, CCONJ 1), x (NOUN 47, ADJ 2, ADV 1, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.147634).

The 1st highest number of forms (1) was observed with the lemma “$”: $.

The 2nd highest number of forms (1) was observed with the lemma “%”: %.

The 3rd highest number of forms (1) was observed with the lemma “+”: +.

SYM occurs with 1 features: Number (7; 4% instances)

SYM occurs with 1 feature-value pairs: Number=Sing

SYM occurs with 2 feature combinations. The most frequent feature combination is _ (160 tokens). Examples: ⪯, ∈, =, -, ⋅, /, +, $, ≤, >

Relations

SYM nodes are attached to their parents using 19 different relations: case (30; 18% instances), conj (26; 16% instances), root (23; 14% instances), cc (22; 13% instances), advcl (11; 7% instances), appos (9; 5% instances), nmod (7; 4% instances), nsubj (7; 4% instances), parataxis (6; 4% instances), ccomp (5; 3% instances), xcomp (5; 3% instances), acl (4; 2% instances), obj (3; 2% instances), obl (3; 2% instances), compound (2; 1% instances), acl:relcl (1; 1% instances), advmod (1; 1% instances), csubj (1; 1% instances), orphan (1; 1% instances)

Parents of SYM nodes belong to 8 different parts of speech: NOUN (65; 39% instances), SYM (29; 17% instances), (23; 14% instances), NUM (22; 13% instances), VERB (19; 11% instances), ADJ (6; 4% instances), PROPN (2; 1% instances), CCONJ (1; 1% instances)

65 (39%) SYM nodes are leaves.

18 (11%) SYM nodes have one child.

20 (12%) SYM nodes have two children.

64 (38%) SYM nodes have three or more children.

The highest child degree of a SYM node is 8.

Children of SYM nodes are attached using 21 different relations: nsubj (61; 20% instances), punct (32; 10% instances), nmod:unmarked (29; 9% instances), obj (24; 8% instances), obl:unmarked (23; 7% instances), conj (21; 7% instances), dep (21; 7% instances), advmod (16; 5% instances), cc (15; 5% instances), mark (15; 5% instances), nummod (15; 5% instances), case (12; 4% instances), advcl (9; 3% instances), obl (6; 2% instances), dislocated (3; 1% instances), parataxis (3; 1% instances), nmod (2; 1% instances), acl:relcl (1; 0% instances), cc:preconj (1; 0% instances), cop (1; 0% instances), nsubj:outer (1; 0% instances)

Children of SYM nodes belong to 12 different parts of speech: NOUN (154; 50% instances), PUNCT (32; 10% instances), SYM (29; 9% instances), NUM (23; 7% instances), ADV (19; 6% instances), CCONJ (16; 5% instances), ADP (12; 4% instances), SCONJ (9; 3% instances), ADJ (8; 3% instances), PROPN (7; 2% instances), AUX (1; 0% instances), VERB (1; 0% instances)