This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home pl/pos issue tracker

NUM: numeral

This document is a placeholder for the language-specific documentation for NUM.


Treebank Statistics (UD_Polish)

There are 173 NUM lemmas (1%), 206 NUM types (1%) and 742 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: dwa, wiele, trzy, oba, cztery, 10, 30, dużo, pięć, 15

The 10 most frequent NUM types: dwa, wielu, dwóch, trzy, wiele, trzech, 10, dwie, 30, dużo

The 10 most frequent ambiguous lemmas: wiele (NUM 63, ADV 2), 10 (NUM 19, ADJ 2), 30 (NUM 17, ADJ 3), dużo (NUM 16, ADV 13), 15 (NUM 13, ADJ 4), 3 (NUM 11, ADJ 1), 50 (NUM 10, ADJ 1), 20 (NUM 9, ADJ 1), 2 (NUM 8, ADJ 7), 12 (NUM 7, ADJ 2)

The 10 most frequent ambiguous types: wiele (NUM 21, ADV 2), 10 (NUM 20, ADJ 2), 30 (NUM 17, ADJ 3), dużo (NUM 14, ADV 6), 15 (NUM 13, ADJ 4), 3 (NUM 11, ADJ 1), więcej (NUM 11, ADV 6), 50 (NUM 10, ADJ 1), 20 (NUM 9, ADJ 1), 2 (NUM 8, ADJ 7)

Morphology

The form / lemma ratio of NUM is 1.190751 (the average of all parts of speech is 1.801337).

The 1st highest number of forms (7) was observed with the lemma “dwa”: dwa, dwaj, dwie, dwiema, dwoma, dwu, dwóch.

The 2nd highest number of forms (5) was observed with the lemma “cztery”: czterech, czterej, czterem, czterema, cztery.

The 3rd highest number of forms (5) was observed with the lemma “oba”: oba, obaj, obie, obiema, obu.

NUM occurs with 4 features: Case (742; 100% instances), Number (742; 100% instances), Gender (741; 100% instances), Animacy (525; 71% instances)

NUM occurs with 14 feature-value pairs: Animacy=Anim, Animacy=Inan, Animacy=Nhum, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 31 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur (183 tokens). Examples: dwa, trzy, wiele, 10, dużo, 30, cztery, 15, 5, 7

Relations

NUM nodes are attached to their parents using 11 different relations: dobj (238; 32% instances), nsubj (191; 26% instances), advmod (162; 22% instances), nummod (99; 13% instances), conj (22; 3% instances), iobj (11; 1% instances), case (10; 1% instances), appos (4; 1% instances), root (3; 0% instances), cc (1; 0% instances), nsubjpass (1; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: VERB (564; 76% instances), NOUN (104; 14% instances), NUM (23; 3% instances), ADJ (15; 2% instances), PUNCT (10; 1% instances), PART (6; 1% instances), ADP (5; 1% instances), ADV (4; 1% instances), PROPN (4; 1% instances), AUX (3; 0% instances), ROOT (3; 0% instances), PRON (1; 0% instances)

50 (7%) NUM nodes are leaves.

311 (42%) NUM nodes have one child.

263 (35%) NUM nodes have two children.

118 (16%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 11 different relations: nmod (811; 66% instances), case (285; 23% instances), punct (35; 3% instances), conj (25; 2% instances), acl (21; 2% instances), amod (19; 2% instances), cc (18; 1% instances), nummod (6; 0% instances), cop (4; 0% instances), advmod (3; 0% instances), nsubj (2; 0% instances)

Children of NUM nodes belong to 13 different parts of speech: NOUN (675; 55% instances), ADP (208; 17% instances), X (104; 8% instances), PART (77; 6% instances), PUNCT (35; 3% instances), PRON (26; 2% instances), VERB (24; 2% instances), NUM (23; 2% instances), ADJ (18; 1% instances), CONJ (18; 1% instances), PROPN (17; 1% instances), ADV (3; 0% instances), AUX (1; 0% instances)


NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]