home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: Features: NumForm

This feature is language-specific. It occurs with 4 different values: Combi, Digit, Roman, Word.

3799 tokens (2%) have a non-empty value of NumForm. 680 types (2%) occur at least once with a non-empty value of NumForm. 521 lemmas (3%) occur at least once with a non-empty value of NumForm. The feature is used with 2 part-of-speech tags: NUM (3082; 2% instances), ADJ (717; 0% instances).

NUM

3082 NUM tokens (100% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (2802; 91%), Gender=EMPTY (2724; 88%), Case=EMPTY (1931; 63%).

NUM tokens may have the following values of NumForm:

Paradigm 2CombiDigit
Case=Gen|Gender=Masc|NumType=Card2-х
Case=Gen|NumType=Card2-х, 2х
NumType=Card2
NumType=Sets2

NumForm seems to be lexical feature of NUM. 98% lemmas (367) occur only with one value of NumForm.

ADJ

717 ADJ tokens (4% of all ADJ tokens) have a non-empty value of NumForm.

The most frequent other feature values with which ADJ and NumForm co-occurred: Variant=EMPTY (717; 100%), Gender=EMPTY (431; 60%), Degree=Pos (363; 51%).

ADJ tokens may have the following values of NumForm:

NumForm seems to be lexical feature of ADJ. 100% lemmas (189) occur only with one value of NumForm.

Relations with Agreement in NumForm

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[nmod]–> NUM (157; 99%), NUM –[conj]–> NUM (70; 100%), ADJ –[nmod]–> ADJ (24; 96%), NUM –[parataxis]–> NUM (10; 91%), ADJ –[conj]–> ADJ (9; 75%), NUM –[list]–> NUM (7; 100%), ADJ –[nmod]–> NUM (3; 75%), NUM –[nummod:gov]–> NUM (3; 100%), NUM –[nummod]–> NUM (3; 100%), NUM –[advcl]–> NUM (2; 100%).