NUM
: numeral
This document is a placeholder for the language-specific documentation
for NUM
.
Treebank Statistics (UD_Polish)
There are 173 NUM
lemmas (1%), 206 NUM
types (1%) and 742 NUM
tokens (1%).
Out of 15 observed tags, the rank of NUM
is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: dwa, wiele, trzy, oba, cztery, 10, 30, dużo, pięć, 15
The 10 most frequent NUM
types: dwa, wielu, dwóch, trzy, wiele, trzech, 10, dwie, 30, dużo
The 10 most frequent ambiguous lemmas: wiele (NUM 63, ADV 2), 10 (NUM 19, ADJ 2), 30 (NUM 17, ADJ 3), dużo (NUM 16, ADV 13), 15 (NUM 13, ADJ 4), 3 (NUM 11, ADJ 1), 50 (NUM 10, ADJ 1), 20 (NUM 9, ADJ 1), 2 (NUM 8, ADJ 7), 12 (NUM 7, ADJ 2)
The 10 most frequent ambiguous types: wiele (NUM 21, ADV 2), 10 (NUM 20, ADJ 2), 30 (NUM 17, ADJ 3), dużo (NUM 14, ADV 6), 15 (NUM 13, ADJ 4), 3 (NUM 11, ADJ 1), więcej (NUM 11, ADV 6), 50 (NUM 10, ADJ 1), 20 (NUM 9, ADJ 1), 2 (NUM 8, ADJ 7)
- wiele
- 10
- 30
- dużo
- 15
- 3
- więcej
- 50
- 20
- 2
Morphology
The form / lemma ratio of NUM
is 1.190751 (the average of all parts of speech is 1.801337).
The 1st highest number of forms (7) was observed with the lemma “dwa”: dwa, dwaj, dwie, dwiema, dwoma, dwu, dwóch.
The 2nd highest number of forms (5) was observed with the lemma “cztery”: czterech, czterej, czterem, czterema, cztery.
The 3rd highest number of forms (5) was observed with the lemma “oba”: oba, obaj, obie, obiema, obu.
NUM
occurs with 4 features: Case (742; 100% instances), Number (742; 100% instances), Gender (741; 100% instances), Animacy (525; 71% instances)
NUM
occurs with 14 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Animacy=Nhum
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
NUM
occurs with 31 feature combinations.
The most frequent feature combination is Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur
(183 tokens).
Examples: dwa, trzy, wiele, 10, dużo, 30, cztery, 15, 5, 7
Relations
NUM
nodes are attached to their parents using 11 different relations: dobj (238; 32% instances), nsubj (191; 26% instances), advmod (162; 22% instances), nummod (99; 13% instances), conj (22; 3% instances), iobj (11; 1% instances), case (10; 1% instances), appos (4; 1% instances), root (3; 0% instances), cc (1; 0% instances), nsubjpass (1; 0% instances)
Parents of NUM
nodes belong to 12 different parts of speech: VERB (564; 76% instances), NOUN (104; 14% instances), NUM (23; 3% instances), ADJ (15; 2% instances), PUNCT (10; 1% instances), PART (6; 1% instances), ADP (5; 1% instances), ADV (4; 1% instances), PROPN (4; 1% instances), AUX (3; 0% instances), ROOT (3; 0% instances), PRON (1; 0% instances)
50 (7%) NUM
nodes are leaves.
311 (42%) NUM
nodes have one child.
263 (35%) NUM
nodes have two children.
118 (16%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 7.
Children of NUM
nodes are attached using 11 different relations: nmod (811; 66% instances), case (285; 23% instances), punct (35; 3% instances), conj (25; 2% instances), acl (21; 2% instances), amod (19; 2% instances), cc (18; 1% instances), nummod (6; 0% instances), cop (4; 0% instances), advmod (3; 0% instances), nsubj (2; 0% instances)
Children of NUM
nodes belong to 13 different parts of speech: NOUN (675; 55% instances), ADP (208; 17% instances), X (104; 8% instances), PART (77; 6% instances), PUNCT (35; 3% instances), PRON (26; 2% instances), VERB (24; 2% instances), NUM (23; 2% instances), ADJ (18; 1% instances), CONJ (18; 1% instances), PROPN (17; 1% instances), ADV (3; 0% instances), AUX (1; 0% instances)
NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]