Treebank Statistics: UD_Czech-PDT: Features: NumType
This feature is universal.
It occurs with 5 different values: Card
, Frac
, Mult
, Ord
, Sets
.
Some words have combined values of the feature; 1 combinations have been observed: Mult|Sets
.
10967 tokens (3%) have a non-empty value of NumType
.
1554 types (3%) occur at least once with a non-empty value of NumType
.
1307 lemmas (5%) occur at least once with a non-empty value of NumType
.
The feature is used with 4 part-of-speech tags: NUM (8531; 3% instances), ADJ (1216; 0% instances), DET (1130; 0% instances), ADV (90; 0% instances).
NUM
8531 NUM tokens (100% of all NUM
tokens) have a non-empty value of NumType
.
The most frequent other feature values with which NUM
and NumType
co-occurred: Gender=EMPTY (7565; 89%), NumValue=EMPTY (7321; 86%), Case=EMPTY (6163; 72%), Number=EMPTY (6163; 72%), NumForm=Digit (6033; 71%).
NUM
tokens may have the following values of NumType
:
Card
(8530; 100% of non-emptyNumType
): 1, 2, 3, tři, dva, dvě, 4, 10, jeden, 5Frac
(1; 0% of non-emptyNumType
): nejednomu
NumType
seems to be lexical feature of NUM
. 100% lemmas (1226) occur only with one value of NumType
.
ADJ
1216 ADJ tokens (3% of all ADJ
tokens) have a non-empty value of NumType
.
The most frequent other feature values with which ADJ
and NumType
co-occurred: Degree=EMPTY (1216; 100%), Polarity=EMPTY (1216; 100%), VerbForm=EMPTY (1216; 100%), Voice=EMPTY (1216; 100%), Number=Sing (1038; 85%), Animacy=EMPTY (814; 67%).
ADJ
tokens may have the following values of NumType
:
Mult,Sets
(31; 3% of non-emptyNumType
): dvojí, obojí, dvojím, jedny, jedněch, dvoje, dvojího, jedni, obojím, obéhoOrd
(1185; 97% of non-emptyNumType
): první, druhé, prvním, třetí, druhou, druhý, prvního, prvních, druhá, druhémEMPTY
(39558): další, české, nové, poslední, státní, možné, dalších, vlastní, národní, větší
NumType
seems to be lexical feature of ADJ
. 100% lemmas (41) occur only with one value of NumType
.
DET
1130 DET tokens (8% of all DET
tokens) have a non-empty value of NumType
.
The most frequent other feature values with which DET
and NumType
co-occurred: Number[psor]=EMPTY (1130; 100%), Person=EMPTY (1130; 100%), Poss=EMPTY (1130; 100%), PronType=Int,Rel (720; 64%), Number=EMPTY (602; 53%), Animacy=EMPTY (601; 53%).
DET
tokens may have the following values of NumType
:
Card
(410; 36% of non-emptyNumType
): několik, několika, mnoho, mnoha, kolik, tolik, málo, pár, mála, nemáloOrd
(720; 64% of non-emptyNumType
): tisíc, miliónů, milionů, miliardy, miliard, tisíce, mil, sto, miliónu, setEMPTY
(12388): to, které, který, jeho, která, jejich, své, tím, tom, kteří
NumType
seems to be lexical feature of DET
. 100% lemmas (20) occur only with one value of NumType
.
ADV
90 ADV tokens (1% of all ADV
tokens) have a non-empty value of NumType
.
The most frequent other feature values with which ADV
and NumType
co-occurred: Degree=EMPTY (90; 100%), Polarity=EMPTY (90; 100%), PronType=EMPTY (61; 68%).
ADV
tokens may have the following values of NumType
:
Mult
(90; 100% of non-emptyNumType
): dvakrát, jednou, třikrát, několikrát, pětkrát, čtyřikrát, šestkrát, desetkrát, jedenkrát, vícekrátEMPTY
(16720): tak, už, také, jak, již, ještě, včera, pak, dnes, kde
NumType
seems to be lexical feature of ADV
. 100% lemmas (20) occur only with one value of NumType
.
Relations with Agreement in NumType
The 10 most frequent relations where parent and child node agree in NumType
:
NUM –[conj]–> NUM (695; 100%),
NUM –[compound]–> NUM (442; 100%),
NUM –[orphan]–> NUM (19; 100%),
ADJ –[conj]–> ADJ (17; 55%),
NUM –[dep]–> NUM (17; 100%),
DET –[det]–> DET (14; 70%),
NUM –[flat]–> NUM (11; 100%),
DET –[conj]–> DET (4; 100%),
DET –[appos]–> DET (1; 100%),
DET –[appos]–> NUM (1; 100%).