Treebank Statistics: UD_English-GENTLE: POS Tags: SYM
There are 20 SYM
lemmas (1%), 20 SYM
types (0%) and 167 SYM
tokens (1%).
Out of 17 observed tags, the rank of SYM
is: 13 in number of lemmas, 15 in number of types and 16 in number of tokens.
The 10 most frequent SYM
lemmas: ⪯, ∈, =, -, ⋅, /, %, +, $, ≤
The 10 most frequent SYM
types: ⪯, ∈, =, -, ⋅, /, %, +, $, ≤
The 10 most frequent ambiguous lemmas: ⪯ (SYM 31, NOUN 1), ∈ (SYM 25, NOUN 1), = (SYM 20, PUNCT 1), - (PUNCT 130, SYM 13), / (SYM 10, PUNCT 5, CCONJ 3), + (SYM 9, CCONJ 1), x (NOUN 47, ADJ 2, ADV 1, SYM 1)
The 10 most frequent ambiguous types: ⪯ (SYM 31, NOUN 1), ∈ (SYM 25, NOUN 1), = (SYM 20, PUNCT 1), - (PUNCT 129, SYM 13), / (SYM 10, PUNCT 5, CCONJ 3), + (SYM 9, CCONJ 1), x (NOUN 47, ADJ 2, ADV 1, SYM 1)
- ⪯
- ∈
- =
- -
- /
- +
- SYM 9: Weeks 15 + - Final Project
- CCONJ 1: From Middle English nexte , nexste , nixte , from Old English nīehsta , nīehste , etc. , inflected forms of nīehst ( “ nearest , next ” ) , superlative form of nēah ( “ nigh , near ” ) , corresponding to Proto-Germanic *nēhwist ( “ nearest , closest ” ) ; equivalent to nigh + -est .
- x
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.147634).
The 1st highest number of forms (1) was observed with the lemma “$”: $.
The 2nd highest number of forms (1) was observed with the lemma “%”: %.
The 3rd highest number of forms (1) was observed with the lemma “+”: +.
SYM
occurs with 1 features: Number (7; 4% instances)
SYM
occurs with 1 feature-value pairs: Number=Sing
SYM
occurs with 2 feature combinations.
The most frequent feature combination is _
(160 tokens).
Examples: ⪯, ∈, =, -, ⋅, /, +, $, ≤, >
Relations
SYM
nodes are attached to their parents using 19 different relations: case (30; 18% instances), conj (26; 16% instances), root (23; 14% instances), cc (22; 13% instances), advcl (11; 7% instances), appos (9; 5% instances), nmod (7; 4% instances), nsubj (7; 4% instances), parataxis (6; 4% instances), ccomp (5; 3% instances), xcomp (5; 3% instances), acl (4; 2% instances), obj (3; 2% instances), obl (3; 2% instances), compound (2; 1% instances), acl:relcl (1; 1% instances), advmod (1; 1% instances), csubj (1; 1% instances), orphan (1; 1% instances)
Parents of SYM
nodes belong to 8 different parts of speech: NOUN (65; 39% instances), SYM (29; 17% instances), (23; 14% instances), NUM (22; 13% instances), VERB (19; 11% instances), ADJ (6; 4% instances), PROPN (2; 1% instances), CCONJ (1; 1% instances)
65 (39%) SYM
nodes are leaves.
18 (11%) SYM
nodes have one child.
20 (12%) SYM
nodes have two children.
64 (38%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 8.
Children of SYM
nodes are attached using 21 different relations: nsubj (61; 20% instances), punct (32; 10% instances), nmod:unmarked (29; 9% instances), obj (24; 8% instances), obl:unmarked (23; 7% instances), conj (21; 7% instances), dep (21; 7% instances), advmod (16; 5% instances), cc (15; 5% instances), mark (15; 5% instances), nummod (15; 5% instances), case (12; 4% instances), advcl (9; 3% instances), obl (6; 2% instances), dislocated (3; 1% instances), parataxis (3; 1% instances), nmod (2; 1% instances), acl:relcl (1; 0% instances), cc:preconj (1; 0% instances), cop (1; 0% instances), nsubj:outer (1; 0% instances)
Children of SYM
nodes belong to 12 different parts of speech: NOUN (154; 50% instances), PUNCT (32; 10% instances), SYM (29; 9% instances), NUM (23; 7% instances), ADV (19; 6% instances), CCONJ (16; 5% instances), ADP (12; 4% instances), SCONJ (9; 3% instances), ADJ (8; 3% instances), PROPN (7; 2% instances), AUX (1; 0% instances), VERB (1; 0% instances)