Treebank Statistics: UD_Latvian-LVTB: POS Tags: NUM
There are 623 NUM
lemmas (3%), 691 NUM
types (1%) and 3711 NUM
tokens (1%).
Out of 17 observed tags, the rank of NUM
is: 7 in number of lemmas, 7 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: viens, divi, trīs, otrs, četri, pieci, 20, desmit, seši, 3
The 10 most frequent NUM
types: viens, trīs, vienu, viena, divas, vienā, divi, 20, desmit, otru
The 10 most frequent ambiguous lemmas: otrs (NUM 158, ADJ 8), i (PART 6, CCONJ 2, NUM 1, SYM 1), V (PROPN 6, NUM 3), 16:00 (NUM 1, SYM 1), desmits (NOUN 18, NUM 1), otrais (ADJ 130, NUM 1)
The 10 most frequent ambiguous types: vienu (NUM 149, X 1), 8 (NUM 18, X 1), otrā (ADJ 20, NUM 17), I (NUM 12, CCONJ 2), 2008 (NUM 6, ADJ 1), V (PROPN 6, NUM 3), 16:00 (NUM 1, SYM 1), desmitiem (NOUN 7, NUM 1), l (NOUN 1, NUM 1), otrās (ADJ 18, NUM 1)
- vienu
- 8
- NUM 18: Sajūtot stiepšanu , notur 5 - 8 sekundes .
- X 1: Ja salīdzina akcīzes nodokļa likmju starpību starp dīzeļdegvielu un biodīzeļdegvielu , tad šī starpība ir nodokļa pamatlikmes apmērā , līdz ar to nodokļa likmju starpība par vienu litru produkta 2007. gadā ir ,17 8 santīmi , 2008. gadā – 19,2 santīmi , 2011. gadā 21,1 santīmi un 2013. gadā – 23,1 santīms .
- otrā
- I
- 2008
- NUM 6: No valsts budžeta 2007. / 08. studiju gadam izglītības jomā par budžeta vietām tika izlietots ap pieciem miljoniem latu gadā ( Izglītības un zinātnes ministrija Augstākās izglītības departaments , 2008 ) .
- ADJ 1: 2008 gada I ceturkšņa dati liecina , ka dzimstības un mirstības tendences varētu saglabāties arī šogad .
- V
- 16:00
- NUM 1: Šī gada 19. oktobrī Latvijas Tautas frontes muzejā ( Vecpilsētas iela 13/15 , Rīgā ) plkst. 16:00 tiks atklāta UNESCO atpazīstamības zīme programmas “ Pasaules atmiņa “ starptautiskajā reģistrā iekļautajai nominācijai “ Baltijas Ceļš - cilvēku ķēde trīs valstu vienotiem centieniem pēc brīvības “ .
- SYM 1: To , vai vēju pilsētas basketbolistiem izdosies izcīnīt astoto panākumu mēs varēsim pārliecināties sestdienas pēcpusdienā pulksten 16:00 .
- desmitiem
- l
- otrās
Morphology
The form / lemma ratio of NUM
is 1.109149 (the average of all parts of speech is 2.305217).
The 1st highest number of forms (11) was observed with the lemma “viens”: Vienām, viena, vienai, vienam, vienas, vieni, vieniem, vienos, viens, vienu, vienā.
The 2nd highest number of forms (10) was observed with the lemma “otrs”: otra, otrai, otram, otras, otriem, otrs, otru, otrā, otrām, otrās.
The 3rd highest number of forms (8) was observed with the lemma “divi”: divas, divi, diviem, divos, divu, divus, divām, divās.
NUM
occurs with 5 features: NumType (3711; 100% instances), Number (1848; 50% instances), Case (1699; 46% instances), Gender (1699; 46% instances), Typo (5; 0% instances)
NUM
occurs with 12 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, NumType=Card
, NumType=Frac
, Number=Plur
, Number=Sing
, Typo=Yes
NUM
occurs with 31 feature combinations.
The most frequent feature combination is NumType=Card
(1860 tokens).
Examples: viens, trīs, vienu, viena, 20, 3, 2, 1, 15, 30
Relations
NUM
nodes are attached to their parents using 24 different relations: nummod (2745; 74% instances), conj (192; 5% instances), parataxis (118; 3% instances), nsubj (90; 2% instances), dep (88; 2% instances), root (75; 2% instances), nmod (62; 2% instances), flat:name (58; 2% instances), compound (51; 1% instances), obj (45; 1% instances), obl (42; 1% instances), iobj (39; 1% instances), xcomp (35; 1% instances), flat (17; 0% instances), discourse (11; 0% instances), nsubj:pass (8; 0% instances), acl (7; 0% instances), ccomp (7; 0% instances), appos (6; 0% instances), orphan (5; 0% instances), advcl (4; 0% instances), flat:foreign (3; 0% instances), amod (2; 0% instances), csubj (1; 0% instances)
Parents of NUM
nodes belong to 12 different parts of speech: NOUN (2529; 68% instances), VERB (450; 12% instances), NUM (242; 7% instances), SYM (208; 6% instances), PROPN (96; 3% instances), (75; 2% instances), X (48; 1% instances), ADJ (39; 1% instances), ADV (14; 0% instances), PRON (8; 0% instances), AUX (1; 0% instances), CCONJ (1; 0% instances)
2447 (66%) NUM
nodes are leaves.
838 (23%) NUM
nodes have one child.
249 (7%) NUM
nodes have two children.
177 (5%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 11.
Children of NUM
nodes are attached using 27 different relations: punct (582; 27% instances), nmod (252; 12% instances), advmod (230; 11% instances), conj (198; 9% instances), case (165; 8% instances), discourse (91; 4% instances), obl (90; 4% instances), cop (86; 4% instances), cc (80; 4% instances), nsubj (72; 3% instances), compound (56; 3% instances), flat:name (35; 2% instances), dep (32; 1% instances), det (29; 1% instances), amod (28; 1% instances), acl (24; 1% instances), flat (21; 1% instances), orphan (16; 1% instances), parataxis (14; 1% instances), mark (10; 0% instances), csubj (8; 0% instances), advcl (7; 0% instances), appos (4; 0% instances), flat:foreign (4; 0% instances), nummod (3; 0% instances), iobj (2; 0% instances), goeswith (1; 0% instances)
Children of NUM
nodes belong to 16 different parts of speech: PUNCT (582; 27% instances), NOUN (330; 15% instances), ADV (308; 14% instances), NUM (242; 11% instances), ADP (162; 8% instances), AUX (86; 4% instances), PART (85; 4% instances), PRON (75; 4% instances), CCONJ (64; 3% instances), VERB (52; 2% instances), PROPN (38; 2% instances), ADJ (32; 1% instances), SCONJ (29; 1% instances), DET (28; 1% instances), SYM (20; 1% instances), X (7; 0% instances)