home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-AnCora: POS Tags: SYM

There are 46 SYM lemmas (0%), 47 SYM types (0%) and 427 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 10 in number of lemmas, 11 in number of types and 14 in number of tokens.

The 10 most frequent SYM lemmas: %, ibex-35, .14, .23, .46, 23, a, a-7, b, cac-40

The 10 most frequent SYM types: %, Ibex-35, A-7, CAC-40, a, b, m.14:, m.46, sub’23, *

The 10 most frequent ambiguous lemmas: ibex-35 (SYM 3, PROPN 1), 23 (NUM 48, SYM 2, NOUN 1), a (ADP 13407, NOUN 9, SYM 2), b (NOUN 4, SYM 2), 12 (NUM 89, NOUN 1, SYM 1), 2000 (NUM 73, NOUN 53, SYM 1), 22 (NUM 39, SYM 1), 24 (NUM 66, SYM 1), 8 (NUM 31, SYM 1), 89 (NUM 5, SYM 1)

The 10 most frequent ambiguous types: Ibex-35 (SYM 3, PROPN 1), A-7 (PROPN 3, SYM 2), a (ADP 12951, DET 6, NOUN 6, SYM 2), b (SYM 2, NOUN 1), 22 (NUM 39, SYM 1), 23 (NUM 47, NOUN 1, SYM 1), 8 (NUM 31, SYM 1), 9 (NUM 33, SYM 1), ARC`2000 (PROPN 2, SYM 1), c (NOUN 1, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.021739 (the average of all parts of speech is 1.505634).

The 1st highest number of forms (2) was observed with the lemma “23”: 23, sub`23.

The 2nd highest number of forms (1) was observed with the lemma “%”: %.

The 3rd highest number of forms (1) was observed with the lemma “*”: *.

SYM occurs with 1 features: NumForm (20; 5% instances)

SYM occurs with 1 feature-value pairs: NumForm=Digit

SYM occurs with 2 feature combinations. The most frequent feature combination is _ (407 tokens). Examples: %, Ibex-35, A-7, CAC-40, a, b, m.14:, m.46, sub’23, *

Relations

SYM nodes are attached to their parents using 17 different relations: nmod (107; 25% instances), obj (99; 23% instances), appos (61; 14% instances), obl (50; 12% instances), nsubj (37; 9% instances), conj (22; 5% instances), advmod (13; 3% instances), obl:arg (10; 2% instances), dep (7; 2% instances), nummod (5; 1% instances), ccomp (4; 1% instances), root (4; 1% instances), acl (2; 0% instances), advcl (2; 0% instances), parataxis (2; 0% instances), compound (1; 0% instances), obl:agent (1; 0% instances)

Parents of SYM nodes belong to 10 different parts of speech: VERB (209; 49% instances), NOUN (113; 26% instances), PROPN (32; 7% instances), NUM (26; 6% instances), SYM (22; 5% instances), ADJ (11; 3% instances), ADV (8; 2% instances), (4; 1% instances), ADP (1; 0% instances), PRON (1; 0% instances)

42 (10%) SYM nodes are leaves.

10 (2%) SYM nodes have one child.

86 (20%) SYM nodes have two children.

289 (68%) SYM nodes have three or more children.

The highest child degree of a SYM node is 9.

Children of SYM nodes are attached using 16 different relations: nummod (355; 28% instances), det (329; 26% instances), punct (159; 12% instances), case (155; 12% instances), nmod (127; 10% instances), advmod (45; 4% instances), cc (21; 2% instances), appos (20; 2% instances), conj (20; 2% instances), cop (12; 1% instances), obl (12; 1% instances), amod (11; 1% instances), nsubj (8; 1% instances), advcl (4; 0% instances), mark (4; 0% instances), parataxis (1; 0% instances)

Children of SYM nodes belong to 14 different parts of speech: NUM (355; 28% instances), DET (329; 26% instances), PUNCT (159; 12% instances), ADP (152; 12% instances), NOUN (129; 10% instances), ADV (45; 4% instances), SYM (22; 2% instances), PROPN (21; 2% instances), ADJ (18; 1% instances), CCONJ (18; 1% instances), PRON (15; 1% instances), AUX (12; 1% instances), VERB (5; 0% instances), SCONJ (3; 0% instances)