home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-ArT: POS Tags: NUM

There are 4 NUM lemmas (2%), 5 NUM types (2%) and 6 NUM tokens (1%). Out of 14 observed tags, the rank of NUM is: 9 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: un, doi, trei, unsprăyinģiţĺi

The 10 most frequent NUM types: ună, doľi, nă, treiľi, unsprăyinģiţĺi

The 10 most frequent ambiguous lemmas: un (DET 8, NUM 3)

The 10 most frequent ambiguous types: (DET 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.250000 (the average of all parts of speech is 1.341667).

The 1st highest number of forms (2) was observed with the lemma “un”: nă, ună.

The 2nd highest number of forms (1) was observed with the lemma “doi”: doľi.

The 3rd highest number of forms (1) was observed with the lemma “trei”: treiľi.

NUM occurs with 6 features: NumForm (6; 100% instances), NumType (6; 100% instances), Number (6; 100% instances), Case (5; 83% instances), Gender (5; 83% instances), Definite (4; 67% instances)

NUM occurs with 8 feature-value pairs: Case=Acc,Nom, Definite=Def, Gender=Fem, Gender=Masc, NumForm=Word, NumType=Card, Number=Plur, Number=Sing

NUM occurs with 4 feature combinations. The most frequent feature combination is Case=Acc,Nom|Definite=Def|Gender=Fem|Number=Sing|NumForm=Word|NumType=Card (2 tokens). Examples: ună

Relations

NUM nodes are attached to their parents using 2 different relations: nummod (5; 83% instances), amod (1; 17% instances)

Parents of NUM nodes belong to 3 different parts of speech: NOUN (3; 50% instances), VERB (2; 33% instances), PRON (1; 17% instances)

3 (50%) NUM nodes are leaves.

3 (50%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 2 different relations: case (2; 67% instances), advmod (1; 33% instances)

Children of NUM nodes belong to 2 different parts of speech: ADP (2; 67% instances), ADV (1; 33% instances)