This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ro/pos issue tracker

NUM: numeral

This document is a placeholder for the language-specific documentation for NUM.


Treebank Statistics (UD_Romanian)

There are 919 NUM lemmas (5%), 992 NUM types (3%) and 5539 NUM tokens (3%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: doi, 1, 2, prim, 3, trei, 4, 5, 6, patru

The 10 most frequent NUM types: 1, 2, 3, două, 4, trei, 5, 6, primul, doi

The 10 most frequent ambiguous lemmas: prim (NUM 249, ADJ 2), întâi (NUM 16, ADV 12), dintâi (NUM 10, ADV 2), zero (NUM 8, NOUN 3), X (NOUN 17, NUM 4), xi (NUM 1, NOUN 1), iv (NUM 1, NOUN 1)

The 10 most frequent ambiguous types: I (NUM 58, NOUN 10, PRON 5), primele (NUM 30, NOUN 1), iv (NOUN 1, NUM 1), nouă (ADJ 26, NUM 9, PRON 1), dintâi (NUM 10, ADV 2), V (NOUN 12, NUM 9), zero (NUM 7, NOUN 3), VI (NUM 6, PRON 1), primei (NUM 6, NOUN 1), primelor (NUM 5, NOUN 1)

Morphology

The form / lemma ratio of NUM is 1.079434 (the average of all parts of speech is 1.826835).

The 1st highest number of forms (12) was observed with the lemma “prim”: prim, prim-, prima, prime, primei, primele, primelor, primii, primilor, primul, primului, primă.

The 2nd highest number of forms (10) was observed with the lemma “ultim”: ultim, ultima, ultime, ultimei, ultimele, ultimelor, ultimii, ultimilor, ultimul, ultimului.

The 3rd highest number of forms (6) was observed with the lemma “doi”: II, doi, doilea, doua, două, secund.

NUM occurs with 7 features: ro-feat/NumType (5524; 100% instances), ro-feat/Number (5509; 99% instances), ro-feat/NumForm (5476; 99% instances), ro-feat/Gender (901; 16% instances), ro-feat/Case (456; 8% instances), ro-feat/Definite (453; 8% instances), ro-feat/PronType (48; 1% instances)

NUM occurs with 14 feature-value pairs: Case=Acc,Nom, Case=Dat,Gen, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, NumForm=Digit, NumForm=Roman, NumForm=Word, NumType=Card, NumType=Ord, Number=Plur, Number=Sing, PronType=Tot

NUM occurs with 35 feature combinations. The most frequent feature combination is Number=Plur|NumForm=Digit|NumType=Card (3959 tokens). Examples: 1, 2, 3, 4, 5, 6, 7, 8, 2004, 10

Relations

NUM nodes are attached to their parents using 27 different relations: ro-dep/nummod (2907; 52% instances), ro-dep/nmod (762; 14% instances), ro-dep/parataxis (742; 13% instances), ro-dep/amod (292; 5% instances), ro-dep/conj (288; 5% instances), ro-dep/nmod:tmod (174; 3% instances), ro-dep/goeswith (83; 1% instances), ro-dep/nsubj (69; 1% instances), ro-dep/compound (51; 1% instances), ro-dep/dobj (29; 1% instances), ro-dep/mwe (26; 0% instances), ro-dep/root (22; 0% instances), ro-dep/nmod:pmod (19; 0% instances), ro-dep/appos (17; 0% instances), ro-dep/nsubjpass (12; 0% instances), ro-dep/remnant (9; 0% instances), ro-dep/xcomp (8; 0% instances), ro-dep/punct (7; 0% instances), ro-dep/advcl (5; 0% instances), ro-dep/acl (4; 0% instances), ro-dep/dep (4; 0% instances), ro-dep/advmod (2; 0% instances), ro-dep/ccomp (2; 0% instances), ro-dep/name (2; 0% instances), ro-dep/csubj (1; 0% instances), ro-dep/foreign (1; 0% instances), ro-dep/iobj (1; 0% instances)

Parents of NUM nodes belong to 15 different parts of speech: NOUN (3592; 65% instances), VERB (1095; 20% instances), NUM (476; 9% instances), PROPN (82; 1% instances), ADV (77; 1% instances), ADJ (63; 1% instances), SYM (45; 1% instances), ADP (22; 0% instances), ROOT (22; 0% instances), PRON (17; 0% instances), PUNCT (16; 0% instances), DET (15; 0% instances), X (8; 0% instances), AUX (7; 0% instances), CONJ (2; 0% instances)

3169 (57%) NUM nodes are leaves.

1042 (19%) NUM nodes have one child.

1062 (19%) NUM nodes have two children.

266 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 16.

Children of NUM nodes are attached using 24 different relations: ro-dep/punct (1824; 43% instances), ro-dep/case (853; 20% instances), ro-dep/det (334; 8% instances), ro-dep/conj (294; 7% instances), ro-dep/advmod (218; 5% instances), ro-dep/cc (171; 4% instances), ro-dep/nmod (133; 3% instances), ro-dep/goeswith (81; 2% instances), ro-dep/nummod (66; 2% instances), ro-dep/compound (57; 1% instances), ro-dep/cop (30; 1% instances), ro-dep/nsubj (28; 1% instances), ro-dep/amod (26; 1% instances), ro-dep/appos (26; 1% instances), ro-dep/acl (22; 1% instances), ro-dep/mark (16; 0% instances), ro-dep/mwe (16; 0% instances), ro-dep/dep (11; 0% instances), ro-dep/remnant (9; 0% instances), ro-dep/parataxis (8; 0% instances), ro-dep/advcl (6; 0% instances), ro-dep/aux (6; 0% instances), ro-dep/nmod:pmod (2; 0% instances), ro-dep/neg (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: PUNCT (1779; 42% instances), ADP (868; 20% instances), NUM (476; 11% instances), DET (388; 9% instances), CONJ (184; 4% instances), NOUN (169; 4% instances), ADV (157; 4% instances), SYM (66; 2% instances), VERB (62; 1% instances), PRON (30; 1% instances), ADJ (21; 0% instances), PROPN (14; 0% instances), SCONJ (13; 0% instances), AUX (6; 0% instances), X (3; 0% instances), PART (2; 0% instances)


NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]