NumForm
: numeral form
NumForm
is a lexical feature of numerals that marks whether the number is expressed by digits or letters.
Word: number expressed as word
Examples
- en “one”, dva “two”, tri “three”
- enoj “one-fold”, dvoj “two-fold”, troj “three-fold”
Digit: number expressed using digits
Examples
- 1, 2, 3
- 1., 2., 3.
Roman: roman numeral
Examples
- I, II, III
- I., II., III.
Conversion from JOS
NumForm
is assigned to all numerals that are converted to UD NUM
. Numerals with Form=digit are converted to NumForm=Digit
, numerals with Form=roman are converted to NumForm=Roman
and numerals with Form=letter are converted to NumForm=Word
. Note, however, that (word) numerals that are converted to UD ADJ
, do not have any NumForm
.
Treebank Statistics (UD_Slovenian)
This feature is language-specific.
It occurs with 3 different values: Digit
, Roman
, Word
.
1927 tokens (1%) have a non-empty value of NumForm
.
550 types (2%) occur at least once with a non-empty value of NumForm
.
485 lemmas (3%) occur at least once with a non-empty value of NumForm
.
The feature is used with 1 part-of-speech tags: sl-pos/NUM (1927; 1% instances).
NUM
1927 sl-pos/NUM tokens (100% of all NUM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NUM
and NumForm
co-occurred: NumType=Card (1665; 86%), Gender=EMPTY (1441; 75%), Case=EMPTY (1187; 62%), Number=EMPTY (1187; 62%).
NUM
tokens may have the following values of NumForm
:
Digit
(1166; 61% of non-emptyNumForm
): 10, 15, 2000, 1., 50, 30, 3, 20, 20., 6Roman
(21; 1% of non-emptyNumForm
): I., II, II., VI, I, III, IV., IX., V, V.Word
(740; 38% of non-emptyNumForm
): eno, tri, dveh, dva, ena, eden, tisoč, štiri, štirih, dve
NumForm
seems to be lexical feature of NUM
. 100% lemmas (485) occur only with one value of NumForm
.
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm
:
NUM –[conj]–> NUM (92; 98%),
NUM –[compound]–> NUM (23; 68%).
Treebank Statistics (UD_Slovenian-SST)
This feature is language-specific.
It occurs with 1 different values: Word
.
499 tokens (2%) have a non-empty value of NumForm
.
83 types (1%) occur at least once with a non-empty value of NumForm
.
53 lemmas (1%) occur at least once with a non-empty value of NumForm
.
The feature is used with 1 part-of-speech tags: sl-pos/NUM (499; 2% instances).
NUM
499 sl-pos/NUM tokens (100% of all NUM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NUM
and NumForm
co-occurred: NumType=Card (498; 100%), Number=Plur (287; 58%).
NUM
tokens may have the following values of NumForm
:
Word
(499; 100% of non-emptyNumForm
): eno, dva, en, ena, tri, tisoč, dvajset, dve, pet, enega
NumForm
seems to be lexical feature of NUM
. 100% lemmas (53) occur only with one value of NumForm
.
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm
:
NUM –[compound]–> NUM (48; 100%),
NUM –[conj]–> NUM (29; 100%),
NUM –[mwe]–> NUM (4; 100%),
NUM –[reparandum]–> NUM (4; 100%),
NUM –[nummod]–> NUM (1; 100%),
NUM –[advmod]–> NUM (1; 100%).