home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hindi-PUD: POS Tags: SYM

There are 1 SYM lemmas (6%), 3 SYM types (0%) and 30 SYM tokens (0%). Out of 16 observed tags, the rank of SYM is: 14 in number of lemmas, 16 in number of types and 15 in number of tokens.

The 10 most frequent SYM lemmas: _

The 10 most frequent SYM types: %, £, €

The 10 most frequent ambiguous lemmas: _ (NOUN 5597, ADP 4849, PUNCT 2297, VERB 2058, ADJ 1995, AUX 1776, PROPN 1358, PRON 1128, DET 876, CCONJ 545, NUM 452, SCONJ 382, PART 316, ADV 159, SYM 30, X 11)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of SYM is 3.000000 (the average of all parts of speech is 345.375000).

The 1st highest number of forms (3) was observed with the lemma “_”: %, £, €.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 4 different relations: dep (24; 80% instances), discourse (4; 13% instances), compound (1; 3% instances), nmod (1; 3% instances)

Parents of SYM nodes belong to 2 different parts of speech: NUM (27; 90% instances), NOUN (3; 10% instances)

27 (90%) SYM nodes are leaves.

2 (7%) SYM nodes have one child.

1 (3%) SYM nodes have two children.

The highest child degree of a SYM node is 2.

Children of SYM nodes are attached using 2 different relations: nummod (3; 75% instances), case (1; 25% instances)

Children of SYM nodes belong to 2 different parts of speech: NUM (3; 75% instances), ADP (1; 25% instances)