PROPN: proper noun
This document is a placeholder for the language-specific documentation
for PROPN.
Treebank Statistics (UD_Romanian)
There are 319 PROPN lemmas (8%), 319 PROPN types (7%) and 462 PROPN tokens (4%).
Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 9 in number of tokens.
The 10 most frequent PROPN lemmas: România, României, București, Iași, Nicolina, Moldova, Moldovei, Roman, Arghezi, Copou
The 10 most frequent PROPN types: România, României, București, Iași, Nicolina, Moldova, Moldovei, Roman, Arghezi, Copou
The 10 most frequent ambiguous lemmas: Aurel (PROPN 1, ADJ 1), CV (NOUN 1, PROPN 1), Galata (NOUN 2, PROPN 1), durere (NOUN 1, PROPN 1), nou (ADJ 7, PROPN 1), om (NOUN 16, PROPN 1), problemă (NOUN 1, PROPN 1), scrimă (NOUN 2, PROPN 1), vechi (ADJ 6, PROPN 1), Șorogari (NOUN 1, PROPN 1)
The 10 most frequent ambiguous types: Aurel (PROPN 1, ADJ 1), CV (NOUN 1, PROPN 1), Galata (NOUN 2, PROPN 1), nou (ADJ 4, PROPN 1), oamenilor (NOUN 4, PROPN 1), vechi (ADJ 6, PROPN 1), Șorogari (NOUN 1, PROPN 1)
- Aurel
- PROPN 1: cele efectuate de Pimen R. Constantinescu din Canti de Leopardi ori acelea realizate de Aurel Tita din Stéphane Mallarmé și din Paul Valéry .
- ADJ 1: Aviația militară română a luat ființă în anul 1938 datorită colaborării societății civile cu Ministerul de Război , iar primul avion militar de concepție și construcție românească , proiectat de inginerul aviator Aurel Vlaicu și realizat la Arsenalul Armatei , a zburat la 1938 iunie 1938 .
- CV
- NOUN 1: 1.1 . CV -ul Europass se elaborează pornind de_la formatul european comun pentru curriculum vitae ( CV ) propus prin Recomandarea 2002/236/CE .
- PROPN 1: 1.1 . CV -ul Europass se elaborează pornind de_la formatul european comun pentru curriculum vitae ( CV ) propus prin Recomandarea 2002/236/CE .
- Galata
- NOUN 2: Principalele coline sunt Copou , Cetățuia , Tătărași și Galata .
- PROPN 1: Prin extinderea lui , Iașul este legendara urbe a celor 1938 coline Cetățuia , Galata , Copou , Bucium-Păun , Șorogari , Repedea și Breazu , cu altitudini variind între 1938 m în Lunca Bahluiului și 1938 m pe Dealul Păun și Dealul Repedea .
- nou
- oamenilor
- vechi
- Șorogari
- NOUN 1: Orașul mai este traversat de râul Nicolina și de pârâul Șorogari ( numit în evul mediu Cacaina , deoarece aici se aruncau gunoaiele ) ; la răsărit de oraș , curge pârâul Ciric , pe care sunt create artificial trei lacuri cu scop de agrement .
- PROPN 1: Prin extinderea lui , Iașul este legendara urbe a celor 1938 coline Cetățuia , Galata , Copou , Bucium-Păun , Șorogari , Repedea și Breazu , cu altitudini variind între 1938 m în Lunca Bahluiului și 1938 m pe Dealul Păun și Dealul Repedea .
Morphology
The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.157060).
The 1st highest number of forms (1) was observed with the lemma “AB4”: AB4.
The 2nd highest number of forms (1) was observed with the lemma “AG36”: AG36.
The 3rd highest number of forms (1) was observed with the lemma “AIEA”: AIEA.
PROPN occurs with 4 features: ro-feat/Definite (54; 12% instances), ro-feat/Gender (54; 12% instances), ro-feat/Number (54; 12% instances), ro-feat/Case (53; 11% instances)
PROPN occurs with 8 feature-value pairs: Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing
PROPN occurs with 9 feature combinations.
The most frequent feature combination is _ (408 tokens).
Examples: România, București, Iași, Nicolina, Moldova, Roman, Arghezi, Copou, Iliescu, Mircea
Relations
PROPN nodes are attached to their parents using 22 different relations: ro-dep/nmod (181; 39% instances), ro-dep/nsubj (78; 17% instances), ro-dep/name (66; 14% instances), ro-dep/conj (56; 12% instances), ro-dep/appos (15; 3% instances), ro-dep/dobj (12; 3% instances), ro-dep/iobj (11; 2% instances), ro-dep/foreign (7; 2% instances), ro-dep/nmod:agent (6; 1% instances), ro-dep/remnant (6; 1% instances), ro-dep/nmod:pmod (5; 1% instances), ro-dep/root (4; 1% instances), ro-dep/amod (3; 1% instances), ro-dep/mwe (3; 1% instances), ro-dep/nsubjpass (2; 0% instances), ro-dep/acl (1; 0% instances), ro-dep/advmod (1; 0% instances), ro-dep/list (1; 0% instances), ro-dep/nmod:tmod (1; 0% instances), ro-dep/parataxis (1; 0% instances), ro-dep/vocative (1; 0% instances), ro-dep/xcomp (1; 0% instances)
Parents of PROPN nodes belong to 12 different parts of speech: NOUN (204; 44% instances), VERB (117; 25% instances), PROPN (110; 24% instances), ADJ (17; 4% instances), NUM (4; 1% instances), ROOT (4; 1% instances), ADP (1; 0% instances), AUX (1; 0% instances), CONJ (1; 0% instances), DET (1; 0% instances), PRON (1; 0% instances), PUNCT (1; 0% instances)
241 (52%) PROPN nodes are leaves.
135 (29%) PROPN nodes have one child.
35 (8%) PROPN nodes have two children.
51 (11%) PROPN nodes have three or more children.
The highest child degree of a PROPN node is 15.
Children of PROPN nodes are attached using 19 different relations: ro-dep/punct (97; 22% instances), ro-dep/case (86; 19% instances), ro-dep/name (74; 16% instances), ro-dep/conj (62; 14% instances), ro-dep/det (37; 8% instances), ro-dep/cc (27; 6% instances), ro-dep/nmod (17; 4% instances), ro-dep/appos (10; 2% instances), ro-dep/foreign (8; 2% instances), ro-dep/advmod (5; 1% instances), ro-dep/amod (5; 1% instances), ro-dep/remnant (5; 1% instances), ro-dep/nummod (4; 1% instances), ro-dep/list (3; 1% instances), ro-dep/acl (2; 0% instances), ro-dep/cop (2; 0% instances), ro-dep/nsubj (2; 0% instances), ro-dep/parataxis (2; 0% instances), ro-dep/nmod:tmod (1; 0% instances)
Children of PROPN nodes belong to 14 different parts of speech: PROPN (110; 24% instances), PUNCT (94; 21% instances), ADP (87; 19% instances), NOUN (54; 12% instances), DET (35; 8% instances), CONJ (27; 6% instances), NUM (13; 3% instances), ADJ (9; 2% instances), VERB (6; 1% instances), ADV (5; 1% instances), PRON (3; 1% instances), SYM (3; 1% instances), X (2; 0% instances), AUX (1; 0% instances)
PROPN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]