Treebank Statistics: UD_Czech-PDTC: Features: NumType
This feature is universal.
It occurs with 5 different values: Card, Frac, Mult, Ord, Sets.
Some words have combined values of the feature; 1 combinations have been observed: Mult|Sets.
121047 tokens (4%) have a non-empty value of NumType.
7778 types (4%) occur at least once with a non-empty value of NumType.
7225 lemmas (9%) occur at least once with a non-empty value of NumType.
The feature is used with 4 part-of-speech tags: NUM (104456; 3% instances), ADJ (10119; 0% instances), DET (5026; 0% instances), ADV (1446; 0% instances).
NUM
104456 NUM tokens (100% of all NUM tokens) have a non-empty value of NumType.
The most frequent other feature values with which NUM and NumType co-occurred: Gender=EMPTY (82759; 79%), Case=EMPTY (69972; 67%), Number=EMPTY (69972; 67%), NumForm=Digit (67981; 65%).
NUM tokens may have the following values of NumType:
Card(104359; 100% of non-emptyNumType): 1, milionů, milionu, dva, tři, 2, jeden, miliardy, 3, 4Frac(15; 0% of non-emptyNumType): nejeden, nejedné, nejednoho, nejednomu, nejednoSets(82; 0% of non-emptyNumType): jedny, jedni, jedněch, jedněmi, jedněm
NumType seems to be lexical feature of NUM. 100% lemmas (7095) occur only with one value of NumType.
ADJ
10119 ADJ tokens (3% of all ADJ tokens) have a non-empty value of NumType.
The most frequent other feature values with which ADJ and NumType co-occurred: Degree=EMPTY (10119; 100%), Polarity=EMPTY (10119; 100%), VerbForm=EMPTY (10119; 100%), Voice=EMPTY (10119; 100%), Number=Sing (8755; 87%), Animacy=EMPTY (6700; 66%).
ADJ tokens may have the following values of NumType:
Mult(130; 1% of non-emptyNumType): obojí, dvojí, dvojím, dvoje, oboje, dvojího, obojím, troje, trojí, obéhoMult,Sets(5; 0% of non-emptyNumType): devatery, třicatery, čtvery, čtverým, šesteryOrd(9984; 99% of non-emptyNumType): první, druhé, třetí, prvním, druhý, prvních, třetím, druhou, druhá, prvníhoEMPTY(350918): další, nové, poslední, české, velké, dalších, cenných, obchodní, hlavní, státní
NumType seems to be lexical feature of ADJ. 100% lemmas (67) occur only with one value of NumType.
DET
5026 DET tokens (3% of all DET tokens) have a non-empty value of NumType.
The most frequent other feature values with which DET and NumType co-occurred: Number[psor]=EMPTY (5026; 100%), Person=EMPTY (5026; 100%), Poss=EMPTY (5026; 100%), Animacy=EMPTY (5020; 100%), Gender=EMPTY (5000; 99%), Number=EMPTY (5000; 99%), PronType=Ind (3568; 71%).
DET tokens may have the following values of NumType:
Card(5000; 99% of non-emptyNumType): několik, mnoho, několika, kolik, mnoha, tolik, pár, málo, mála, tolikaOrd(18; 0% of non-emptyNumType): kolikáté, kolikátého, kolikátá, několikátý, bůhvíkolikátém, kolikátí, kolikátý, několikátéSets(8; 0% of non-emptyNumType): několikeré, několikerého, několikery, několikerá, několikerý, tolikeréEMPTY(149580): to, které, který, která, jeho, své, jejich, tím, toho, této
NumType seems to be lexical feature of DET. 100% lemmas (17) occur only with one value of NumType.
ADV
1446 ADV tokens (1% of all ADV tokens) have a non-empty value of NumType.
The most frequent other feature values with which ADV and NumType co-occurred: Degree=EMPTY (1446; 100%), Polarity=EMPTY (1446; 100%), PronType=EMPTY (1094; 76%).
ADV tokens may have the following values of NumType:
Mult(1446; 100% of non-emptyNumType): jednou, dvakrát, třikrát, několikrát, čtyřikrát, pětkrát, kolikrát, desetkrát, šestkrát, mnohokrátEMPTY(163747): tam, už, tak, jak, kde, pak, kdy, více, ještě, včera
NumType seems to be lexical feature of ADV. 100% lemmas (46) occur only with one value of NumType.
Relations with Agreement in NumType
The 10 most frequent relations where parent and child node agree in NumType:
NUM –[compound]–> NUM (8765; 100%),
NUM –[conj]–> NUM (5763; 100%),
ADJ –[conj]–> ADJ (172; 70%),
ADV –[conj]–> ADV (70; 67%),
NUM –[det:nummod]–> DET (58; 100%),
NUM –[orphan]–> NUM (38; 100%),
NUM –[dep]–> NUM (36; 100%),
NUM –[obl]–> NUM (17; 100%),
ADJ –[orphan]–> ADJ (12; 63%),
DET –[conj]–> DET (12; 80%).