home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: Features: NumForm

This feature is language-specific. It occurs with 5 different values: Combi, Cyril, Digit, Roman, Word.

3283 tokens (3%) have a non-empty value of NumForm. 664 types (3%) occur at least once with a non-empty value of NumForm. 405 lemmas (5%) occur at least once with a non-empty value of NumForm. The feature is used with 2 part-of-speech tags: NUM (2537; 3% instances), ADJ (746; 1% instances).

NUM

2537 NUM tokens (100% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (2372; 93%), Case=Nom (1685; 66%), Gender=EMPTY (1631; 64%).

NUM tokens may have the following values of NumForm:

Paradigm 3CombiDigit
Case=Acc|Gender=Masc3
Case=Acc|Gender=Fem3
Case=Acc3
Case=Dat|Gender=Masc3
Case=Gen|Gender=Masc3-х3
Case=Gen|Gender=Fem3-х
Case=Gen3
Case=Loc|Gender=Masc3-х
Case=Loc|Gender=Fem3-х
Case=Nom|Gender=Masc3
Case=Nom|Gender=Fem3
Case=Nom|Gender=Neut3
Case=Nom3

NumForm seems to be lexical feature of NUM. 98% lemmas (280) occur only with one value of NumForm.

ADJ

746 ADJ tokens (9% of all ADJ tokens) have a non-empty value of NumForm.

The most frequent other feature values with which ADJ and NumForm co-occurred: Variant=EMPTY (738; 99%), Number=Sing (726; 97%), Degree=EMPTY (681; 91%), Gender=Masc (619; 83%).

ADJ tokens may have the following values of NumForm:

NumForm seems to be lexical feature of ADJ. 100% lemmas (160) occur only with one value of NumForm.

Relations with Agreement in NumForm

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[conj]–> NUM (142; 99%), NUM –[compound]–> NUM (51; 100%), NUM –[nsubj]–> NUM (25; 93%), ADJ –[conj]–> ADJ (11; 85%), NUM –[nmod]–> NUM (5; 83%), NUM –[flat]–> NUM (4; 80%), ADJ –[conj]–> NUM (2; 67%), NUM –[conj]–> ADJ (2; 100%), NUM –[orphan]–> NUM (2; 100%), NUM –[nummod]–> NUM (1; 100%).