home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: SYM

There are 4 SYM lemmas (0%), 4 SYM types (0%) and 388 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 16 in number of lemmas, 16 in number of types and 15 in number of tokens.

The 10 most frequent SYM lemmas: %، +، /، <

The 10 most frequent SYM types: %، +، /، <

The 10 most frequent ambiguous lemmas: / (PUNCT 754, SYM 14)

The 10 most frequent ambiguous types: / (PUNCT 754, SYM 14)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.761966).

The 1st highest number of forms (1) was observed with the lemma “%”: %.

The 2nd highest number of forms (1) was observed with the lemma “+”: +.

The 3rd highest number of forms (1) was observed with the lemma “/”: /.

SYM occurs with 1 features: ConjType (5; 1% instances)

SYM occurs with 1 feature-value pairs: ConjType=Oper

SYM occurs with 2 feature combinations. The most frequent feature combination is _ (383 tokens). Examples: %، +، /

Relations

SYM nodes are attached to their parents using 3 different relations: nmod (340; 88% instances), cc (43; 11% instances), dep (5; 1% instances)

Parents of SYM nodes belong to 4 different parts of speech: NUM (346; 89% instances), X (25; 6% instances), NOUN (11; 3% instances), VERB (6; 2% instances)

386 (99%) SYM nodes are leaves.

2 (1%) SYM nodes have one child.

The highest child degree of a SYM node is 1.

Children of SYM nodes are attached using 2 different relations: ccomp (1; 50% instances), nmod (1; 50% instances)

Children of SYM nodes belong to 2 different parts of speech: NOUN (1; 50% instances), VERB (1; 50% instances)