home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-ArT: POS Tags: NOUN

There are 60 NOUN lemmas (25%), 67 NOUN types (21%) and 79 NOUN tokens (14%). Out of 14 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent NOUN lemmas: feată, casă, fiĉor, amiră, an, arap, gardu, lamńe, lucru, mer

The 10 most frequent NOUN types: casă, feata, fiĉorlu, ańi, gardu, lamńa, lucru, oară, ќiro, Araplu

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.116667 (the average of all parts of speech is 1.341667).

The 1st highest number of forms (3) was observed with the lemma “feată”: feata, feată, featăľei.

The 2nd highest number of forms (2) was observed with the lemma “amiră”: amirălu, amirălui.

The 3rd highest number of forms (2) was observed with the lemma “arap”: Araplu, arap.

NOUN occurs with 4 features: Definite (78; 99% instances), Gender (78; 99% instances), Number (78; 99% instances), Case (63; 80% instances)

NOUN occurs with 9 feature-value pairs: Case=Acc,Nom, Case=Dat,Gen, Case=Voc, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

NOUN occurs with 13 feature combinations. The most frequent feature combination is Case=Acc,Nom|Definite=Ind|Gender=Fem|Number=Sing (19 tokens). Examples: casă, oară, apă, broască, coadă, căloari, cătuşe, feată, harao, hărýie

Relations

NOUN nodes are attached to their parents using 10 different relations: nsubj (25; 32% instances), obl (21; 27% instances), obj (16; 20% instances), nmod (7; 9% instances), conj (2; 3% instances), iobj (2; 3% instances), vocative (2; 3% instances), xcomp (2; 3% instances), fixed (1; 1% instances), obl:pmod (1; 1% instances)

Parents of NOUN nodes belong to 4 different parts of speech: VERB (69; 87% instances), NOUN (8; 10% instances), ADJ (1; 1% instances), DET (1; 1% instances)

31 (39%) NOUN nodes are leaves.

30 (38%) NOUN nodes have one child.

13 (16%) NOUN nodes have two children.

5 (6%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 3.

Children of NOUN nodes are attached using 11 different relations: case (27; 38% instances), det (14; 20% instances), amod (7; 10% instances), nmod (7; 10% instances), punct (7; 10% instances), acl (3; 4% instances), nummod (2; 3% instances), advmod (1; 1% instances), cc (1; 1% instances), discourse (1; 1% instances), nsubj (1; 1% instances)

Children of NOUN nodes belong to 10 different parts of speech: ADP (27; 38% instances), DET (14; 20% instances), NOUN (8; 11% instances), PUNCT (7; 10% instances), ADJ (6; 8% instances), NUM (3; 4% instances), VERB (3; 4% instances), ADV (1; 1% instances), CCONJ (1; 1% instances), INTJ (1; 1% instances)