home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: Features: NumForm

This feature is language-specific. It occurs with 4 different values: Combi, Digit, Roman, Word.

5498 tokens (3%) have a non-empty value of NumForm. 1007 types (3%) occur at least once with a non-empty value of NumForm. 938 lemmas (5%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: NUM (5498; 3% instances).

NUM

5498 NUM tokens (99% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (4764; 87%), Gender=EMPTY (4606; 84%), Number=Sing (4419; 80%).

NUM tokens may have the following values of NumForm:

Paradigm doiRomanWord
Foreign=Yes|Number=Sing|NumType=OrdII
Gender=Masc|Number=Sing|NumType=Orddoilea, secund
Gender=Masc|Number=Plur|NumType=Carddoi
Gender=Fem|Number=Sing|NumType=Orddoua
Gender=Fem|Number=Plur|NumType=Carddouă
Number=Sing|NumType=Ordii

NumForm seems to be lexical feature of NUM. 97% lemmas (908) occur only with one value of NumForm.

Relations with Agreement in NumForm

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[conj]–> NUM (271; 99%), NUM –[nummod]–> NUM (75; 67%), NUM –[compound]–> NUM (27; 57%), NUM –[fixed]–> NUM (2; 100%), NUM –[parataxis]–> NUM (2; 67%), NUM –[acl]–> NUM (1; 100%), NUM –[appos]–> NUM (1; 100%), NUM –[dep]–> NUM (1; 100%), NUM –[nmod]–> NUM (1; 100%).