This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ro/pos issue tracker

NUM: numeral

This document is a placeholder for the language-specific documentation for NUM.


Treebank Statistics (UD_Romanian)

There are 919 NUM lemmas (5%), 992 NUM types (3%) and 5539 NUM tokens (3%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: doi, 1, 2, prim, 3, trei, 4, 5, 6, patru

The 10 most frequent NUM types: 1, 2, 3, două, 4, trei, 5, 6, primul, doi

The 10 most frequent ambiguous lemmas: prim (NUM 249, ADJ 2), întâi (NUM 16, ADV 12), dintâi (NUM 10, ADV 2), zero (NUM 8, NOUN 3), X (NOUN 17, NUM 4), xi (NUM 1, NOUN 1), iv (NUM 1, NOUN 1)

The 10 most frequent ambiguous types: I (NUM 58, NOUN 10, PRON 5), primele (NUM 30, NOUN 1), iv (NOUN 1, NUM 1), nouă (ADJ 26, NUM 9, PRON 1), dintâi (NUM 10, ADV 2), V (NOUN 12, NUM 9), zero (NUM 7, NOUN 3), VI (NUM 6, PRON 1), primei (NUM 6, NOUN 1), primelor (NUM 5, NOUN 1)

Morphology

The form / lemma ratio of NUM is 1.079434 (the average of all parts of speech is 1.826835).

The 1st highest number of forms (12) was observed with the lemma “prim”: prim, prim-, prima, prime, primei, primele, primelor, primii, primilor, primul, primului, primă.

The 2nd highest number of forms (10) was observed with the lemma “ultim”: ultim, ultima, ultime, ultimei, ultimele, ultimelor, ultimii, ultimilor, ultimul, ultimului.

The 3rd highest number of forms (6) was observed with the lemma “doi”: II, doi, doilea, doua, două, secund.

NUM occurs with 7 features: NumType (5524; 100% instances), Number (5509; 99% instances), NumForm (5476; 99% instances), Gender (901; 16% instances), Case (456; 8% instances), Definite (453; 8% instances), PronType (48; 1% instances)

NUM occurs with 14 feature-value pairs: Case=Acc,Nom, Case=Dat,Gen, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, NumForm=Digit, NumForm=Roman, NumForm=Word, NumType=Card, NumType=Ord, Number=Plur, Number=Sing, PronType=Tot

NUM occurs with 35 feature combinations. The most frequent feature combination is Number=Plur|NumForm=Digit|NumType=Card (3959 tokens). Examples: 1, 2, 3, 4, 5, 6, 7, 8, 2004, 10

Relations

NUM nodes are attached to their parents using 27 different relations: nummod (2907; 52% instances), nmod (762; 14% instances), parataxis (742; 13% instances), amod (292; 5% instances), conj (288; 5% instances), nmod:tmod (174; 3% instances), goeswith (83; 1% instances), nsubj (69; 1% instances), compound (51; 1% instances), dobj (29; 1% instances), mwe (26; 0% instances), root (22; 0% instances), nmod:pmod (19; 0% instances), appos (17; 0% instances), nsubjpass (12; 0% instances), remnant (9; 0% instances), xcomp (8; 0% instances), punct (7; 0% instances), advcl (5; 0% instances), acl (4; 0% instances), dep (4; 0% instances), advmod (2; 0% instances), ccomp (2; 0% instances), name (2; 0% instances), csubj (1; 0% instances), foreign (1; 0% instances), iobj (1; 0% instances)

Parents of NUM nodes belong to 15 different parts of speech: NOUN (3592; 65% instances), VERB (1095; 20% instances), NUM (476; 9% instances), PROPN (82; 1% instances), ADV (77; 1% instances), ADJ (63; 1% instances), SYM (45; 1% instances), ADP (22; 0% instances), ROOT (22; 0% instances), PRON (17; 0% instances), PUNCT (16; 0% instances), DET (15; 0% instances), X (8; 0% instances), AUX (7; 0% instances), CONJ (2; 0% instances)

3169 (57%) NUM nodes are leaves.

1042 (19%) NUM nodes have one child.

1062 (19%) NUM nodes have two children.

266 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 16.

Children of NUM nodes are attached using 24 different relations: punct (1824; 43% instances), case (853; 20% instances), det (334; 8% instances), conj (294; 7% instances), advmod (218; 5% instances), cc (171; 4% instances), nmod (133; 3% instances), goeswith (81; 2% instances), nummod (66; 2% instances), compound (57; 1% instances), cop (30; 1% instances), nsubj (28; 1% instances), amod (26; 1% instances), appos (26; 1% instances), acl (22; 1% instances), mark (16; 0% instances), mwe (16; 0% instances), dep (11; 0% instances), remnant (9; 0% instances), parataxis (8; 0% instances), advcl (6; 0% instances), aux (6; 0% instances), nmod:pmod (2; 0% instances), neg (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: PUNCT (1779; 42% instances), ADP (868; 20% instances), NUM (476; 11% instances), DET (388; 9% instances), CONJ (184; 4% instances), NOUN (169; 4% instances), ADV (157; 4% instances), SYM (66; 2% instances), VERB (62; 1% instances), PRON (30; 1% instances), ADJ (21; 0% instances), PROPN (14; 0% instances), SCONJ (13; 0% instances), AUX (6; 0% instances), X (3; 0% instances), PART (2; 0% instances)


NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]