Treebank Statistics: UD_Romanian-ArT: POS Tags: NOUN
There are 60 NOUN
lemmas (25%), 67 NOUN
types (21%) and 79 NOUN
tokens (14%).
Out of 14 observed tags, the rank of NOUN
is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.
The 10 most frequent NOUN
lemmas: feată, casă, fiĉor, amiră, an, arap, gardu, lamńe, lucru, mer
The 10 most frequent NOUN
types: casă, feata, fiĉorlu, ańi, gardu, lamńa, lucru, oară, ќiro, Araplu
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NOUN
is 1.116667 (the average of all parts of speech is 1.341667).
The 1st highest number of forms (3) was observed with the lemma “feată”: feata, feată, featăľei.
The 2nd highest number of forms (2) was observed with the lemma “amiră”: amirălu, amirălui.
The 3rd highest number of forms (2) was observed with the lemma “arap”: Araplu, arap.
NOUN
occurs with 4 features: Definite (78; 99% instances), Gender (78; 99% instances), Number (78; 99% instances), Case (63; 80% instances)
NOUN
occurs with 9 feature-value pairs: Case=Acc,Nom
, Case=Dat,Gen
, Case=Voc
, Definite=Def
, Definite=Ind
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
NOUN
occurs with 13 feature combinations.
The most frequent feature combination is Case=Acc,Nom|Definite=Ind|Gender=Fem|Number=Sing
(19 tokens).
Examples: casă, oară, apă, broască, coadă, căloari, cătuşe, feată, harao, hărýie
Relations
NOUN
nodes are attached to their parents using 10 different relations: nsubj (25; 32% instances), obl (21; 27% instances), obj (16; 20% instances), nmod (7; 9% instances), conj (2; 3% instances), iobj (2; 3% instances), vocative (2; 3% instances), xcomp (2; 3% instances), fixed (1; 1% instances), obl:pmod (1; 1% instances)
Parents of NOUN
nodes belong to 4 different parts of speech: VERB (69; 87% instances), NOUN (8; 10% instances), ADJ (1; 1% instances), DET (1; 1% instances)
31 (39%) NOUN
nodes are leaves.
30 (38%) NOUN
nodes have one child.
13 (16%) NOUN
nodes have two children.
5 (6%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 3.
Children of NOUN
nodes are attached using 11 different relations: case (27; 38% instances), det (14; 20% instances), amod (7; 10% instances), nmod (7; 10% instances), punct (7; 10% instances), acl (3; 4% instances), nummod (2; 3% instances), advmod (1; 1% instances), cc (1; 1% instances), discourse (1; 1% instances), nsubj (1; 1% instances)
Children of NOUN
nodes belong to 10 different parts of speech: ADP (27; 38% instances), DET (14; 20% instances), NOUN (8; 11% instances), PUNCT (7; 10% instances), ADJ (6; 8% instances), NUM (3; 4% instances), VERB (3; 4% instances), ADV (1; 1% instances), CCONJ (1; 1% instances), INTJ (1; 1% instances)