Statistics of SYM in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_English-GENTLE: POS Tags: `SYM`

There are 20 SYM lemmas (1%), 20 SYM types (0%) and 169 SYM tokens (1%). Out of 17 observed tags, the rank of SYM is: 12 in number of lemmas, 15 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: ⪯, ∈, =, -, ⋅, /, %, +, $, ≤

The 10 most frequent SYM types: ⪯, ∈, =, -, ⋅, /, %, +, $, ≤

The 10 most frequent ambiguous lemmas: ⪯ (SYM 31, NOUN 1), ∈ (SYM 25, NOUN 1), - (PUNCT 131, SYM 13), / (SYM 11, PUNCT 4, CCONJ 3), + (SYM 9, CCONJ 1), x (NOUN 47, ADJ 2, ADV 1, SYM 1)

The 10 most frequent ambiguous types: ⪯ (SYM 31, NOUN 1), ∈ (SYM 25, NOUN 1), - (PUNCT 129, SYM 13), / (SYM 11, PUNCT 4, CCONJ 3), + (SYM 9, CCONJ 1), x (NOUN 47, ADJ 2, ADV 1, SYM 1)

⪯
- SYM 31: Let U = ( S , ⪯ S ) ∪ ( T , ⪯ T )
- NOUN 1: We claim that ⪯ is a well - ordering .
∈
- SYM 25: Let x , y , z ∈ U .
- NOUN 1: Suppose x ∈ T and x ≠ min T .
-
- PUNCT 129: next - day
- SYM 13: It ‘s a 3 - 1 victory , Genoa was just some BS .
/
- SYM 11: VITALS : BP : 120 / 74 .
- PUNCT 4: enPR : nĕ kst , IPA ( key ) : / nɛkst /
- CCONJ 3: 12. UI / UX
+
- SYM 9: Weeks 15 + - Final Project
- CCONJ 1: From Middle English nexte , nexste , nixte , from Old English nīehsta , nīehste , etc. , inflected forms of nīehst ( “ nearest , next ” ) , superlative form of nēah ( “ nigh , near ” ) , corresponding to Proto-Germanic *nēhwist ( “ nearest , closest ” ) ; equivalent to nigh + -est .
x
- NOUN 47: Let this vertex be labeled x .
- ADJ 2: Let x ⪯ y and y ⪯ z .
- ADV 1: Define the following relation ⪯ on U : ∀ x , y ∈ U : x ⪯ y if and only if : x , y ∈ S : x ⪯ S y or : x , y ∈ T : x ⪯ T y or : x ∈ S , y ∈ T
- SYM 1: Suppose x ∈ T and x ≠ min T .

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.148610).

The 1st highest number of forms (1) was observed with the lemma “$”: $.

The 2nd highest number of forms (1) was observed with the lemma “%”: %.

The 3rd highest number of forms (1) was observed with the lemma “+”: +.

SYM occurs with 2 features: ExtPos (31; 18% instances), Number (7; 4% instances)

SYM occurs with 2 feature-value pairs: ExtPos=ADP, Number=Sing

SYM occurs with 3 feature combinations. The most frequent feature combination is _ (131 tokens). Examples: ⪯, ∈, =, +, /, $, ≤, >, ⊆, ∖

Relations

SYM nodes are attached to their parents using 19 different relations: case (31; 18% instances), conj (27; 16% instances), root (23; 14% instances), cc (22; 13% instances), advcl (12; 7% instances), appos (9; 5% instances), nmod (7; 4% instances), nsubj (7; 4% instances), parataxis (6; 4% instances), ccomp (5; 3% instances), xcomp (5; 3% instances), acl (4; 2% instances), obj (3; 2% instances), compound (2; 1% instances), obl (2; 1% instances), acl:relcl (1; 1% instances), advmod (1; 1% instances), csubj (1; 1% instances), orphan (1; 1% instances)

Parents of SYM nodes belong to 8 different parts of speech: NOUN (65; 38% instances), SYM (30; 18% instances), NUM (23; 14% instances), (23; 14% instances), VERB (19; 11% instances), ADJ (6; 4% instances), PROPN (2; 1% instances), CCONJ (1; 1% instances)

66 (39%) SYM nodes are leaves.

18 (11%) SYM nodes have one child.

21 (12%) SYM nodes have two children.

64 (38%) SYM nodes have three or more children.

The highest child degree of a SYM node is 7.

Children of SYM nodes are attached using 20 different relations: nsubj (62; 20% instances), punct (32; 10% instances), nmod:unmarked (29; 9% instances), obj (24; 8% instances), obl:unmarked (23; 7% instances), conj (22; 7% instances), dep (21; 7% instances), advmod (16; 5% instances), mark (16; 5% instances), cc (15; 5% instances), nummod (15; 5% instances), case (11; 4% instances), advcl (9; 3% instances), obl (6; 2% instances), dislocated (3; 1% instances), parataxis (3; 1% instances), nmod (2; 1% instances), acl:relcl (1; 0% instances), cc:preconj (1; 0% instances), cop (1; 0% instances)

Children of SYM nodes belong to 12 different parts of speech: NOUN (154; 49% instances), PUNCT (32; 10% instances), SYM (30; 10% instances), NUM (23; 7% instances), ADV (19; 6% instances), CCONJ (16; 5% instances), ADP (11; 4% instances), SCONJ (10; 3% instances), ADJ (8; 3% instances), PROPN (7; 2% instances), AUX (1; 0% instances), VERB (1; 0% instances)

Treebank Statistics: UD_English-GENTLE: POS Tags: SYM

Morphology

Relations

Treebank Statistics: UD_English-GENTLE: POS Tags: `SYM`