This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home lv/pos issue tracker

NUM: numeral

This document is a placeholder for the language-specific documentation for NUM.


Treebank Statistics (UD_Latvian)

There are 190 NUM lemmas (5%), 212 NUM types (3%) and 424 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: viens, viena, trīs, 000, 25, divi, divas, 1, 3, 50

The 10 most frequent NUM types: viens, 000, 25, trīs, vienu, viena, 1, 3, 50, desmit

The 10 most frequent ambiguous lemmas: tūkstoši (NUM 4, NOUN 1), +371 (PART 1, NUM 1), 14.00 (NUM 1, SYM 1), III (ADJ 2, NUM 1)

The 10 most frequent ambiguous types: +371 (NUM 1, PART 1), 14.00 (NUM 1, SYM 1), III (ADJ 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.115789 (the average of all parts of speech is 1.616894).

The 1st highest number of forms (5) was observed with the lemma “divi”: divi, diviem, divos, divu, divus.

The 2nd highest number of forms (5) was observed with the lemma “viena”: viena, vienai, vienas, vienu, vienā.

The 3rd highest number of forms (4) was observed with the lemma “divas”: divas, divu, divām, divās.

NUM occurs with 4 features: NumType (424; 100% instances), Number (141; 33% instances), Case (131; 31% instances), Gender (130; 31% instances)

NUM occurs with 10 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, NumType=Card, Number=Plur, Number=Sing

NUM occurs with 21 feature combinations. The most frequent feature combination is NumType=Card (283 tokens). Examples: 000, 25, viens, 1, 3, 50, trīs, 20, 200, 8000

Relations

NUM nodes are attached to their parents using 14 different relations: nummod (290; 68% instances), compound (39; 9% instances), nmod (38; 9% instances), conj (16; 4% instances), acl (11; 3% instances), root (9; 2% instances), nsubj (7; 2% instances), name (3; 1% instances), parataxis (3; 1% instances), dobj (2; 0% instances), iobj (2; 0% instances), mwe (2; 0% instances), ccomp (1; 0% instances), nsubjpass (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (235; 55% instances), SYM (71; 17% instances), NUM (52; 12% instances), VERB (24; 6% instances), PROPN (14; 3% instances), PUNCT (10; 2% instances), ROOT (9; 2% instances), X (6; 1% instances), ADJ (1; 0% instances), ADV (1; 0% instances), PRON (1; 0% instances)

308 (73%) NUM nodes are leaves.

56 (13%) NUM nodes have one child.

41 (10%) NUM nodes have two children.

19 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 20 different relations: compound (56; 26% instances), advmod (27; 13% instances), nmod (23; 11% instances), punct (23; 11% instances), mwe (13; 6% instances), conj (11; 5% instances), acl (10; 5% instances), case (10; 5% instances), aux (8; 4% instances), cc (7; 3% instances), nsubj (7; 3% instances), amod (4; 2% instances), nummod (4; 2% instances), name (3; 1% instances), discourse (2; 1% instances), appos (1; 0% instances), ccomp (1; 0% instances), det (1; 0% instances), iobj (1; 0% instances), neg (1; 0% instances)

Children of NUM nodes belong to 14 different parts of speech: NUM (52; 24% instances), NOUN (35; 16% instances), ADV (28; 13% instances), PUNCT (25; 12% instances), SYM (16; 8% instances), SCONJ (13; 6% instances), VERB (12; 6% instances), ADP (10; 5% instances), PROPN (6; 3% instances), CONJ (5; 2% instances), PART (4; 2% instances), PRON (4; 2% instances), ADJ (2; 1% instances), DET (1; 0% instances)


NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]