home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: Features: NumForm

This feature is language-specific. It occurs with 4 different values: Combi, Cyril, Digit, Word.

2313 tokens (2%) have a non-empty value of NumForm. 738 types (3%) occur at least once with a non-empty value of NumForm. 276 lemmas (3%) occur at least once with a non-empty value of NumForm. The feature is used with 2 part-of-speech tags: NUM (1289; 1% instances), ADJ (1024; 1% instances).

NUM

1289 NUM tokens (99% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (1145; 89%), Gender=EMPTY (733; 57%).

NUM tokens may have the following values of NumForm:

Paradigm 3CyrilDigitWord
Case=Acc|Gender=Masc3
Case=Accг[3]
Case=Nom|Gender=Masc3, [3.], [3]
Case=Nom|Gender=Fem3
Case=Nom3, [3.]3-

NumForm seems to be lexical feature of NUM. 95% lemmas (191) occur only with one value of NumForm.

ADJ

1024 ADJ tokens (11% of all ADJ tokens) have a non-empty value of NumForm.

The most frequent other feature values with which ADJ and NumForm co-occurred: Degree=EMPTY (1024; 100%), Variant=EMPTY (1024; 100%), NumType=Ord (1014; 99%), Number=Sing (971; 95%), Gender=Masc (860; 84%), Case=Gen (548; 54%).

ADJ tokens may have the following values of NumForm:

Paradigm 1655CyrilDigit
Case=Genах҃нє, ах҃не, ах҃нѕ, а҃хнѕ[1655]
Case=Locах҃нє

Relations with Agreement in NumForm

The 10 most frequent relations where parent and child node agree in NumForm: ADJ –[compound]–> NUM (31; 91%), NUM –[conj]–> NUM (12; 86%), NUM –[compound]–> NUM (4; 100%), ADJ –[nmod]–> ADJ (2; 67%), NUM –[nmod]–> NUM (2; 100%), NUM –[nummod:gov]–> NUM (2; 100%), NUM –[nummod]–> NUM (2; 100%), ADJ –[nmod]–> NUM (1; 100%), NUM –[conj]–> ADJ (1; 100%), NUM –[parataxis]–> NUM (1; 100%).