NUM
: numeral
This document is a placeholder for the language-specific documentation
for NUM
.
Treebank Statistics (UD_Latvian)
There are 190 NUM
lemmas (5%), 212 NUM
types (3%) and 424 NUM
tokens (2%).
Out of 16 observed tags, the rank of NUM
is: 6 in number of lemmas, 6 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: viens, viena, trīs, 000, 25, divi, divas, 1, 3, 50
The 10 most frequent NUM
types: viens, 000, 25, trīs, vienu, viena, 1, 3, 50, desmit
The 10 most frequent ambiguous lemmas: tūkstoši (NUM 4, NOUN 1), +371 (PART 1, NUM 1), 14.00 (NUM 1, SYM 1), III (ADJ 2, NUM 1)
The 10 most frequent ambiguous types: +371 (NUM 1, PART 1), 14.00 (NUM 1, SYM 1), III (ADJ 2, NUM 1)
- +371
- 14.00
- III
- ADJ 2: Datu centrs “ Dattum “ būs vienīgais “ Tier III “ datu centrs Baltijas valstīs un Ziemeļvalstīs .
- NUM 1: Savukārt biznesa klienti Latvijā un ārzemēs saņems uzņēmējdarbībai vajadzīgus mākoņdatošanas un lieljaudas datu analīzes pakalpojumus , izmantojot kompānijas jauno augstas klases ( Tier III ) datu centru un datu pārraides maģistrāli no Krievijas caur Latviju uz Vāciju , Tartačuks sacīja .
Morphology
The form / lemma ratio of NUM
is 1.115789 (the average of all parts of speech is 1.616894).
The 1st highest number of forms (5) was observed with the lemma “divi”: divi, diviem, divos, divu, divus.
The 2nd highest number of forms (5) was observed with the lemma “viena”: viena, vienai, vienas, vienu, vienā.
The 3rd highest number of forms (4) was observed with the lemma “divas”: divas, divu, divām, divās.
NUM
occurs with 4 features: NumType (424; 100% instances), Number (141; 33% instances), Case (131; 31% instances), Gender (130; 31% instances)
NUM
occurs with 10 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, NumType=Card
, Number=Plur
, Number=Sing
NUM
occurs with 21 feature combinations.
The most frequent feature combination is NumType=Card
(283 tokens).
Examples: 000, 25, viens, 1, 3, 50, trīs, 20, 200, 8000
Relations
NUM
nodes are attached to their parents using 14 different relations: nummod (290; 68% instances), compound (39; 9% instances), nmod (38; 9% instances), conj (16; 4% instances), acl (11; 3% instances), root (9; 2% instances), nsubj (7; 2% instances), name (3; 1% instances), parataxis (3; 1% instances), dobj (2; 0% instances), iobj (2; 0% instances), mwe (2; 0% instances), ccomp (1; 0% instances), nsubjpass (1; 0% instances)
Parents of NUM
nodes belong to 11 different parts of speech: NOUN (235; 55% instances), SYM (71; 17% instances), NUM (52; 12% instances), VERB (24; 6% instances), PROPN (14; 3% instances), PUNCT (10; 2% instances), ROOT (9; 2% instances), X (6; 1% instances), ADJ (1; 0% instances), ADV (1; 0% instances), PRON (1; 0% instances)
308 (73%) NUM
nodes are leaves.
56 (13%) NUM
nodes have one child.
41 (10%) NUM
nodes have two children.
19 (4%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 6.
Children of NUM
nodes are attached using 20 different relations: compound (56; 26% instances), advmod (27; 13% instances), nmod (23; 11% instances), punct (23; 11% instances), mwe (13; 6% instances), conj (11; 5% instances), acl (10; 5% instances), case (10; 5% instances), aux (8; 4% instances), cc (7; 3% instances), nsubj (7; 3% instances), amod (4; 2% instances), nummod (4; 2% instances), name (3; 1% instances), discourse (2; 1% instances), appos (1; 0% instances), ccomp (1; 0% instances), det (1; 0% instances), iobj (1; 0% instances), neg (1; 0% instances)
Children of NUM
nodes belong to 14 different parts of speech: NUM (52; 24% instances), NOUN (35; 16% instances), ADV (28; 13% instances), PUNCT (25; 12% instances), SYM (16; 8% instances), SCONJ (13; 6% instances), VERB (12; 6% instances), ADP (10; 5% instances), PROPN (6; 3% instances), CONJ (5; 2% instances), PART (4; 2% instances), PRON (4; 2% instances), ADJ (2; 1% instances), DET (1; 0% instances)
NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]