Treebank Statistics: UD_Polish-PDB: Features: NumType
This feature is universal.
It occurs with 3 different values: Card, Ord, Sets.
3359 tokens (1%) have a non-empty value of NumType.
669 types (1%) occur at least once with a non-empty value of NumType.
613 lemmas (2%) occur at least once with a non-empty value of NumType.
The feature is used with 5 part-of-speech tags: NUM (1333; 0% instances), ADJ (1159; 0% instances), NOUN (725; 0% instances), DET (136; 0% instances), PROPN (6; 0% instances).
NUM
1333 NUM tokens (51% of all NUM tokens) have a non-empty value of NumType.
The most frequent other feature values with which NUM and NumType co-occurred: Number=Plur (1320; 99%), NumForm=Digit (1277; 96%), Gender=Masc (1066; 80%), Animacy=Inan (872; 65%).
NUM tokens may have the following values of NumType:
Card(1278; 96% of non-emptyNumType): 10, 3, 2, 30, 5, 15, 20, 4, 50, 12Sets(55; 4% of non-emptyNumType): dwoje, Troje, pięcioro, Czworo, dwanaścioro, dwojgiem, dziesięcioro, pięć, sześcioroEMPTY(1300): dwie, dwa, dwóch, trzy, trzech, cztery, pięć, dwaj, czterech, pół
NumType seems to be lexical feature of NUM. 100% lemmas (372) occur only with one value of NumType.
ADJ
1159 ADJ tokens (3% of all ADJ tokens) have a non-empty value of NumType.
The most frequent other feature values with which ADJ and NumType co-occurred: Aspect=EMPTY (1159; 100%), Polarity=EMPTY (1159; 100%), VerbForm=EMPTY (1159; 100%), Voice=EMPTY (1159; 100%), Degree=Pos (1158; 100%), Number=Sing (1130; 97%), Gender=Masc (971; 84%), Animacy=Inan (938; 81%), Case=Gen (668; 58%).
ADJ tokens may have the following values of NumType:
Ord(1159; 100% of non-emptyNumType): 1, II, 2008, 2000, 2009, 2, 15, 1995, 20, XIXEMPTY(34769): innych, jeden, sam, inne, europejskiej, pierwszy, różnych, jednym, cały, nowych
NumType seems to be lexical feature of ADJ. 100% lemmas (271) occur only with one value of NumType.
NOUN
725 NOUN tokens (1% of all NOUN tokens) have a non-empty value of NumType.
The most frequent other feature values with which NOUN and NumType co-occurred: Animacy=EMPTY (725; 100%), Gender=Neut (725; 100%), Number=Plur (436; 60%).
NOUN tokens may have the following values of NumType:
Sets(725; 100% of non-emptyNumType): dzieci, dziecko, oczy, dziecka, oczach, dzieckiem, oczami, zwierząt, dziećmi, oczuEMPTY(87909): mężczyzna, roku, pan, kobieta, lat, r, człowiek, pracy, chłopiec, osób
NumType seems to be lexical feature of NOUN. 100% lemmas (16) occur only with one value of NumType.
DET
136 DET tokens (1% of all DET tokens) have a non-empty value of NumType.
The most frequent other feature values with which DET and NumType co-occurred: Number[psor]=EMPTY (136; 100%), Person=EMPTY (136; 100%), Poss=EMPTY (136; 100%), Number=Plur (116; 85%), Case=Acc (90; 66%), PronType=Int (79; 58%), Gender=Masc (78; 57%).
DET tokens may have the following values of NumType:
Card(126; 93% of non-emptyNumType): ile, tyle, ilu, tylu, ileż, iluż, tyleżSets(10; 7% of non-emptyNumType): oboje, kilkoro, obojguEMPTY(9212): które, ten, który, tym, tej, tego, te, tych, która, którzy
PROPN
6 PROPN tokens (0% of all PROPN tokens) have a non-empty value of NumType.
The most frequent other feature values with which PROPN and NumType co-occurred: Animacy=EMPTY (6; 100%), Gender=Neut (6; 100%), Number=Sing (6; 100%).
PROPN tokens may have the following values of NumType:
Sets(6; 100% of non-emptyNumType): Hedestad, INE, Lakis, Plovdiv, Poste, SUDEMPTY(11994): Polsce, Polski, UE, Europy, Andrzej, Polska, Europie, Warszawie, Jerzy, SLD
Relations with Agreement in NumType
The 10 most frequent relations where parent and child node agree in NumType:
NUM –[conj]–> NUM (59; 100%),
ADJ –[conj]–> ADJ (43; 100%),
ADJ –[amod:flat]–> ADJ (5; 100%),
NUM –[flat]–> NUM (3; 100%),
ADJ –[fixed]–> ADJ (1; 100%).