home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: X

There are 6 X lemmas (0%), 56 X types (1%) and 78 X tokens (0%). Out of 16 observed tags, the rank of X is: 15 in number of lemmas, 13 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: , X, d, d\, que, z

The 10 most frequent X types: allah, ce, non, tout, 3alou, book, jour, la, a, abi

The 10 most frequent ambiguous lemmas: _ (X 70, DET 11, PUNCT 5, NOUN 1), X (VERB 6, X 4), d (ADP 1, X 1), que (SCONJ 90, PART 60, ADV 14, ADP 12, PRON 12, X 1)

The 10 most frequent ambiguous types: allah (PROPN 91, X 9), ce (PRON 10, DET 7, X 4), non (INTJ 5, X 4), tout (PRON 10, ADJ 9, ADV 3, X 3, DET 2), jour (NOUN 3, X 2), la (DET 210, PART 86, ADP 6, ADV 5, SCONJ 2, VERB 2, X 2, ADJ 1, CCONJ 1, PRON 1, PROPN 1), a (ADP 46, DET 41, VERB 29, AUX 17, INTJ 5, PROPN 1, X 1), alah (PROPN 19, X 1), ana (PRON 50, SCONJ 4, ADP 1, X 1), bik (PRON 4, X 1)

Morphology

The form / lemma ratio of X is 9.333333 (the average of all parts of speech is 1.474223).

The 1st highest number of forms (49) was observed with the lemma “_”: 3alou, a, abi, alah, allah, ana, balle, bik, bla, book, ce, cha, chalah, chalahe, chalahh, challah, da3imou, dhou, etre, fakiroune, fin, fondez, frique, g3alha, gum, homa, jour, koune, la, ladkom, lah, llah, ma, nara, non, om, qoi, shalah, somble, tife, tou, tous, tout, tt, venu, when, where, zam, être.

The 2nd highest number of forms (4) was observed with the lemma “X”: ahe, fara7koum, ina, non.

The 3rd highest number of forms (1) was observed with the lemma “d”: d.

X does not occur with any features.

Relations

X nodes are attached to their parents using 8 different relations: goeswith (70; 90% instances), dep (2; 3% instances), flat (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances), obl (1; 1% instances), xcomp (1; 1% instances)

Parents of X nodes belong to 10 different parts of speech: INTJ (22; 28% instances), ADV (14; 18% instances), VERB (14; 18% instances), PROPN (7; 9% instances), NOUN (6; 8% instances), SCONJ (6; 8% instances), CCONJ (3; 4% instances), PRON (3; 4% instances), ADP (2; 3% instances), ADJ (1; 1% instances)

77 (99%) X nodes are leaves.

1 (1%) X nodes have one child.

The highest child degree of a X node is 1.

Children of X nodes are attached using 1 different relations: case (1; 100% instances)

Children of X nodes belong to 1 different parts of speech: ADP (1; 100% instances)