home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PUD: POS Tags: X

There are 4 X lemmas (0%), 4 X types (0%) and 5 X tokens (0%). Out of 16 observed tags, the rank of X is: 14 in number of lemmas, 15 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: yAng_1، >asA-u_1، >ay~_1، surEAna_1

The 10 most frequent X types: يانغ، سرعان، فاي، ياسو

The 10 most frequent ambiguous lemmas: >ay~_1 (DET 13, PROPN 2, ADV 1, NOUN 1, X 1), surEAna_1 (ADV 2, X 1)

The 10 most frequent ambiguous types: سرعان (ADV 2, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.409765).

The 1st highest number of forms (1) was observed with the lemma “>asA-u_1”: ياسو.

The 2nd highest number of forms (1) was observed with the lemma “>ay~_1”: فاي.

The 3rd highest number of forms (1) was observed with the lemma “surEAna_1”: سرعان.

X does not occur with any features.

Relations

X nodes are attached to their parents using 2 different relations: goeswith (4; 80% instances), discourse (1; 20% instances)

Parents of X nodes belong to 3 different parts of speech: PROPN (3; 60% instances), NOUN (1; 20% instances), VERB (1; 20% instances)

3 (60%) X nodes are leaves.

2 (40%) X nodes have one child.

The highest child degree of a X node is 1.

Children of X nodes are attached using 1 different relations: punct (2; 100% instances)

Children of X nodes belong to 1 different parts of speech: PUNCT (2; 100% instances)