Treebank Statistics: UD_Arabic-PADT: Features: NumForm
This feature is language-specific.
It occurs with 2 different values: Digit, Word.
7758 tokens (3%) have a non-empty value of NumForm.
1083 types (4%) occur at least once with a non-empty value of NumForm.
993 lemmas (6%) occur at least once with a non-empty value of NumForm.
The feature is used with 1 part-of-speech tags: NUM (7758; 3% instances).
NUM
7758 NUM tokens (100% of all NUM tokens) have a non-empty value of NumForm.
The most frequent other feature values with which NUM and NumForm co-occurred: Number=EMPTY (6316; 81%), Definite=EMPTY (5551; 72%), Case=EMPTY (5550; 72%).
NUM tokens may have the following values of NumForm:
Digit(5521; 71% of non-emptyNumForm): 15، 3، 6، 2، 8، 7، 4، 11، 10، 12Word(2237; 29% of non-emptyNumForm): مليون، مليار، ألف، ثلاثة، ملايين، المئة، بليون، الف، المائة، عشرة
NumForm seems to be lexical feature of NUM. 100% lemmas (993) occur only with one value of NumForm.
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm:
NUM –[conj]–> NUM (878; 96%),
NUM –[appos]–> NUM (82; 91%),
NUM –[dep]–> NUM (10; 100%),
NUM –[compound]–> NUM (7; 100%),
NUM –[nsubj]–> NUM (5; 83%),
NUM –[obl]–> NUM (5; 100%).