Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: PUNCT
There are 43 PUNCT
lemmas (1%), 44 PUNCT
types (1%) and 251 PUNCT
tokens (1%).
Out of 16 observed tags, the rank of PUNCT
is: 11 in number of lemmas, 15 in number of types and 12 in number of tokens.
The 10 most frequent PUNCT
lemmas: ,, (, ?, ), !, -, /, ;, !!!, !!
The 10 most frequent PUNCT
types: ,, (, ), ?, -, !!!, !!, ;, /, !!!!
The 10 most frequent ambiguous lemmas: _ (X 70, DET 11, PUNCT 5, NOUN 1), plus (ADV 34, PUNCT 5, ADJ 2)
The 10 most frequent ambiguous types: / (PUNCT 8, ADP 1), + (PUNCT 5, ADV 4)
- /
- +
Morphology
The form / lemma ratio of PUNCT
is 1.023256 (the average of all parts of speech is 1.474223).
The 1st highest number of forms (7) was observed with the lemma “!”: !, !!, !!!, !!!!, !!!!!, !!!!!!, !!!!!!!!.
The 2nd highest number of forms (5) was observed with the lemma “?”: ?, ??, ???, ????, ?????.
The 3rd highest number of forms (1) was observed with the lemma “!!”: !!.
PUNCT
does not occur with any features.
Relations
PUNCT
nodes are attached to their parents using 1 different relations: punct (251; 100% instances)
Parents of PUNCT
nodes belong to 10 different parts of speech: VERB (139; 55% instances), NOUN (41; 16% instances), NUM (23; 9% instances), PROPN (17; 7% instances), ADJ (14; 6% instances), PRON (9; 4% instances), INTJ (4; 2% instances), ADV (2; 1% instances), ADP (1; 0% instances), SCONJ (1; 0% instances)
251 (100%) PUNCT
nodes are leaves.
The highest child degree of a PUNCT
node is 0.