Treebank Statistics: UD_English-GENTLE: POS Tags: SYM
There are 20 SYM lemmas (1%), 20 SYM types (0%) and 169 SYM tokens (1%).
Out of 17 observed tags, the rank of SYM is: 12 in number of lemmas, 15 in number of types and 16 in number of tokens.
The 10 most frequent SYM lemmas: ⪯, ∈, =, -, ⋅, /, %, +, $, ≤
The 10 most frequent SYM types: ⪯, ∈, =, -, ⋅, /, %, +, $, ≤
The 10 most frequent ambiguous lemmas: ⪯ (SYM 31, NOUN 1), ∈ (SYM 25, NOUN 1), - (PUNCT 131, SYM 13), / (SYM 11, PUNCT 4, CCONJ 3), + (SYM 9, CCONJ 1), x (NOUN 47, ADJ 2, ADV 1, SYM 1)
The 10 most frequent ambiguous types: ⪯ (SYM 31, NOUN 1), ∈ (SYM 25, NOUN 1), - (PUNCT 129, SYM 13), / (SYM 11, PUNCT 4, CCONJ 3), + (SYM 9, CCONJ 1), x (NOUN 47, ADJ 2, ADV 1, SYM 1)
- ⪯
- ∈
- -
- /
- +
- SYM 9: Weeks 15 + - Final Project
- CCONJ 1: From Middle English nexte , nexste , nixte , from Old English nīehsta , nīehste , etc. , inflected forms of nīehst ( “ nearest , next ” ) , superlative form of nēah ( “ nigh , near ” ) , corresponding to Proto-Germanic *nēhwist ( “ nearest , closest ” ) ; equivalent to nigh + -est .
- x
Morphology
The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.148610).
The 1st highest number of forms (1) was observed with the lemma “$”: $.
The 2nd highest number of forms (1) was observed with the lemma “%”: %.
The 3rd highest number of forms (1) was observed with the lemma “+”: +.
SYM occurs with 2 features: ExtPos (31; 18% instances), Number (7; 4% instances)
SYM occurs with 2 feature-value pairs: ExtPos=ADP, Number=Sing
SYM occurs with 3 feature combinations.
The most frequent feature combination is _ (131 tokens).
Examples: ⪯, ∈, =, +, /, $, ≤, >, ⊆, ∖
Relations
SYM nodes are attached to their parents using 19 different relations: case (31; 18% instances), conj (27; 16% instances), root (23; 14% instances), cc (22; 13% instances), advcl (12; 7% instances), appos (9; 5% instances), nmod (7; 4% instances), nsubj (7; 4% instances), parataxis (6; 4% instances), ccomp (5; 3% instances), xcomp (5; 3% instances), acl (4; 2% instances), obj (3; 2% instances), compound (2; 1% instances), obl (2; 1% instances), acl:relcl (1; 1% instances), advmod (1; 1% instances), csubj (1; 1% instances), orphan (1; 1% instances)
Parents of SYM nodes belong to 8 different parts of speech: NOUN (65; 38% instances), SYM (30; 18% instances), NUM (23; 14% instances), (23; 14% instances), VERB (19; 11% instances), ADJ (6; 4% instances), PROPN (2; 1% instances), CCONJ (1; 1% instances)
66 (39%) SYM nodes are leaves.
18 (11%) SYM nodes have one child.
21 (12%) SYM nodes have two children.
64 (38%) SYM nodes have three or more children.
The highest child degree of a SYM node is 7.
Children of SYM nodes are attached using 20 different relations: nsubj (62; 20% instances), punct (32; 10% instances), nmod:unmarked (29; 9% instances), obj (24; 8% instances), obl:unmarked (23; 7% instances), conj (22; 7% instances), dep (21; 7% instances), advmod (16; 5% instances), mark (16; 5% instances), cc (15; 5% instances), nummod (15; 5% instances), case (11; 4% instances), advcl (9; 3% instances), obl (6; 2% instances), dislocated (3; 1% instances), parataxis (3; 1% instances), nmod (2; 1% instances), acl:relcl (1; 0% instances), cc:preconj (1; 0% instances), cop (1; 0% instances)
Children of SYM nodes belong to 12 different parts of speech: NOUN (154; 49% instances), PUNCT (32; 10% instances), SYM (30; 10% instances), NUM (23; 7% instances), ADV (19; 6% instances), CCONJ (16; 5% instances), ADP (11; 4% instances), SCONJ (10; 3% instances), ADJ (8; 3% instances), PROPN (7; 2% instances), AUX (1; 0% instances), VERB (1; 0% instances)