home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-TueCL: POS Tags: PROPN

There are 58 PROPN lemmas (5%), 62 PROPN types (4%) and 72 PROPN tokens (2%). Out of 17 observed tags, the rank of PROPN is: 6 in number of lemmas, 7 in number of types and 14 in number of tokens.

The 10 most frequent PROPN lemmas: România, Mirela, Vaida, Diletta, Franța, Irinel, Maria, @KlausIohannis, @Utilizator_x3, ALEXANDRA

The 10 most frequent PROPN types: România, Mirela, Vaida, Irinel, Maria, @KlausIohannis, @Utilizator_x3, ALEXANDRA, Africa, Alex

The 10 most frequent ambiguous lemmas: domn (NOUN 2, PROPN 1), mare (ADJ 4, PROPN 1)

The 10 most frequent ambiguous types: Doamne (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.068966 (the average of all parts of speech is 1.367279).

The 1st highest number of forms (3) was observed with the lemma “România”: Ro, România, romania.

The 2nd highest number of forms (2) was observed with the lemma “Diletta”: Diletta, Dilettei.

The 3rd highest number of forms (2) was observed with the lemma “Franța”: Franta, Franța.

PROPN occurs with 6 features: Number (7; 10% instances), Case (6; 8% instances), Definite (6; 8% instances), Foreign (6; 8% instances), Gender (6; 8% instances), Typo (3; 4% instances)

PROPN occurs with 10 feature-value pairs: Case=Acc,Nom, Case=Dat,Gen, Case=Voc, Definite=Def, Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Typo=Yes

PROPN occurs with 7 feature combinations. The most frequent feature combination is _ (56 tokens). Examples: România, Mirela, Vaida, Irinel, Maria, @KlausIohannis, @Utilizator_x3, ALEXANDRA, Africa, Alex

Relations

PROPN nodes are attached to their parents using 11 different relations: flat (18; 25% instances), obl (15; 21% instances), nmod (14; 19% instances), nsubj (9; 13% instances), obj (4; 6% instances), vocative (4; 6% instances), appos (2; 3% instances), nsubj:pass (2; 3% instances), obl:agent (2; 3% instances), conj (1; 1% instances), root (1; 1% instances)

Parents of PROPN nodes belong to 7 different parts of speech: VERB (31; 43% instances), PROPN (19; 26% instances), NOUN (16; 22% instances), ADJ (2; 3% instances), PRON (2; 3% instances), AUX (1; 1% instances), (1; 1% instances)

27 (38%) PROPN nodes are leaves.

29 (40%) PROPN nodes have one child.

14 (19%) PROPN nodes have two children.

2 (3%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 11 different relations: case (30; 44% instances), flat (24; 35% instances), advmod (2; 3% instances), appos (2; 3% instances), conj (2; 3% instances), det (2; 3% instances), punct (2; 3% instances), cc (1; 1% instances), cop (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances)

Children of PROPN nodes belong to 8 different parts of speech: ADP (30; 44% instances), PROPN (19; 28% instances), NOUN (11; 16% instances), ADV (2; 3% instances), DET (2; 3% instances), PUNCT (2; 3% instances), AUX (1; 1% instances), CCONJ (1; 1% instances)