home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French: POS Tags: SYM

There are 72 SYM lemmas (0%), 71 SYM types (0%) and 559 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 10 in number of lemmas, 12 in number of types and 15 in number of tokens.

The 10 most frequent SYM lemmas: %, €, °, &, +, $, =, H, m, “

The 10 most frequent SYM types: %, €, °, &, +, $, =, n°, H, m

The 10 most frequent ambiguous lemmas: + (SYM 18, PUNCT 3), m (NOUN 95, SYM 1), (PUNCT 954, SYM 4), * (SYM 3, PUNCT 2), A (PROPN 11, X 7, NOUN 4, SYM 3), x (PUNCT 3, SYM 2, X 2), (PUNCT 31, SYM 2), C (NOUN 14, SYM 2, PROPN 1), / (PUNCT 126, SYM 1), > (PUNCT 1, SYM 1)

The 10 most frequent ambiguous types: + (SYM 18, PUNCT 3), (SYM 13, NOUN 1), H (SYM 6, NOUN 2), m (NOUN 76, SYM 1), (PUNCT 954, SYM 4), C (NOUN 14, SYM 4, PRON 1, PROPN 1), * (SYM 3, PUNCT 2), A (ADP 87, PROPN 11, X 7, DET 5, NOUN 4, SYM 3, AUX 1), x (PUNCT 3, SYM 2, X 2), (PUNCT 31, SYM 2)

Morphology

The form / lemma ratio of SYM is 0.986111 (the average of all parts of speech is 1.306238).

The 1st highest number of forms (2) was observed with the lemma “°”: n°, °.

The 2nd highest number of forms (1) was observed with the lemma “””: .

The 3rd highest number of forms (1) was observed with the lemma “#”: #.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 19 different relations: nmod (122; 22% instances), obl (105; 19% instances), appos (69; 12% instances), obj (56; 10% instances), conj (52; 9% instances), nsubj (38; 7% instances), cc (29; 5% instances), compound (21; 4% instances), dep (15; 3% instances), flat:name (14; 3% instances), punct (12; 2% instances), root (10; 2% instances), case (4; 1% instances), xcomp (4; 1% instances), nsubj:pass (3; 1% instances), orphan (2; 0% instances), ccomp (1; 0% instances), discourse (1; 0% instances), nummod (1; 0% instances)

Parents of SYM nodes belong to 13 different parts of speech: NOUN (193; 35% instances), VERB (188; 34% instances), SYM (66; 12% instances), PROPN (49; 9% instances), ADJ (27; 5% instances), NUM (11; 2% instances), (10; 2% instances), X (9; 2% instances), ADV (2; 0% instances), ADP (1; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), PRON (1; 0% instances)

107 (19%) SYM nodes are leaves.

75 (13%) SYM nodes have one child.

184 (33%) SYM nodes have two children.

193 (35%) SYM nodes have three or more children.

The highest child degree of a SYM node is 14.

Children of SYM nodes are attached using 18 different relations: nummod (412; 35% instances), case (217; 18% instances), nmod (199; 17% instances), punct (164; 14% instances), conj (42; 4% instances), compound (36; 3% instances), cc (24; 2% instances), det (21; 2% instances), advmod (12; 1% instances), cop (12; 1% instances), nsubj (12; 1% instances), appos (11; 1% instances), amod (6; 1% instances), acl (2; 0% instances), advcl (2; 0% instances), flat:name (2; 0% instances), acl:relcl (1; 0% instances), mark (1; 0% instances)

Children of SYM nodes belong to 15 different parts of speech: NUM (429; 36% instances), ADP (213; 18% instances), NOUN (198; 17% instances), PUNCT (163; 14% instances), SYM (66; 6% instances), DET (21; 2% instances), ADV (14; 1% instances), CCONJ (14; 1% instances), PROPN (13; 1% instances), AUX (12; 1% instances), PRON (10; 1% instances), SCONJ (9; 1% instances), ADJ (8; 1% instances), VERB (5; 0% instances), X (1; 0% instances)