home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-PUD: POS Tags: SYM

There are 1 SYM lemmas (5%), 4 SYM types (0%) and 34 SYM tokens (0%). Out of 16 observed tags, the rank of SYM is: 14 in number of lemmas, 15 in number of types and 15 in number of tokens.

The 10 most frequent SYM lemmas: _

The 10 most frequent SYM types: %, £, °, /

The 10 most frequent ambiguous lemmas: _ (NOUN 4804, ADP 3324, DET 3037, VERB 3024, PUNCT 2548, ADJ 1607, PRON 1335, PROPN 1241, ADV 1035, CCONJ 562, NUM 458, AUX 274, SCONJ 206, X 48, SYM 34, PART 9)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of SYM is 4.000000 (the average of all parts of speech is 309.550000).

The 1st highest number of forms (4) was observed with the lemma “_”: %, /, £, °.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 9 different relations: obl (10; 29% instances), obj (8; 24% instances), nmod (7; 21% instances), nsubj (4; 12% instances), advmod (1; 3% instances), appos (1; 3% instances), conj (1; 3% instances), dep (1; 3% instances), nsubj:pass (1; 3% instances)

Parents of SYM nodes belong to 5 different parts of speech: VERB (22; 65% instances), NOUN (5; 15% instances), SYM (3; 9% instances), ADJ (2; 6% instances), PROPN (2; 6% instances)

1 (3%) SYM nodes are leaves.

3 (9%) SYM nodes have one child.

23 (68%) SYM nodes have two children.

7 (21%) SYM nodes have three or more children.

The highest child degree of a SYM node is 3.

Children of SYM nodes are attached using 7 different relations: nummod (33; 47% instances), case (17; 24% instances), nmod (14; 20% instances), compound (2; 3% instances), punct (2; 3% instances), cc (1; 1% instances), conj (1; 1% instances)

Children of SYM nodes belong to 6 different parts of speech: NUM (34; 49% instances), ADP (17; 24% instances), NOUN (13; 19% instances), SYM (3; 4% instances), PUNCT (2; 3% instances), CCONJ (1; 1% instances)