Treebank Statistics: UD_Latvian-LVTB: POS Tags: PROPN
There are 4398 PROPN
lemmas (18%), 5864 PROPN
types (11%) and 13978 PROPN
tokens (4%).
Out of 17 observed tags, the rank of PROPN
is: 2 in number of lemmas, 4 in number of types and 8 in number of tokens.
The 10 most frequent PROPN
lemmas: Latvija, Rīga, Eiropa, Krievija, ES, Sofija, Saeima, Baltija, Vācija, LETA
The 10 most frequent PROPN
types: Latvijas, Latvijā, Eiropas, Rīgas, ES, Krievijas, LETA, Baltijas, A., Rīgā
The 10 most frequent ambiguous lemmas: v. (PROPN 12, NOUN 1), g. (NOUN 11, PROPN 4), FM (PROPN 10, X 1), V (PROPN 6, NUM 3), BKUS (PROPN 3, NOUN 1), Veho (PROPN 2, X 1), ZC (PROPN 2, NOUN 1), A (SYM 12, PROPN 1, X 1), AB (PROPN 1, X 1), AB.LV (PROPN 1, SYM 1)
The 10 most frequent ambiguous types: M. (PROPN 32, NOUN 1, X 1), D. (PROPN 29, X 1), Satversmes (PROPN 26, NOUN 1), Saules (PROPN 26, NOUN 2), Mēness (PROPN 15, NOUN 2), FM (PROPN 10, X 1), Jūrmalā (PROPN 10, NOUN 1), airBaltic (PROPN 10, X 1), vilnis (NOUN 5, PROPN 2), Zemes (NOUN 10, PROPN 9)
- M.
- PROPN 32: Padomes vārdā — priekšsēdētājs M. Fischer Boel
- NOUN 1: RJA lektors LL. M. ( cilvēktiesību maģistrs ) MĀRTIŅŠ MITS :
- X 1: Seriālā Viņi atgriežas pirmo reizi kā aktieris debitē Džeikoba atveidotājs Lendons Gimenezs ( Landon Gimenez ) , savukārt zināmākie aktieri šajā seriālā būs Mārtija lomas atveidotājs Omars Eps ( Omar Epps ) , kurš Latvijas skatītājiem ir pazīstams pēc Dr. Ērika Formana lomas seriālā Doktors Hauss ( House M. D. ) , Lusillas tēlā iejūtas aktrise Francesa Fišere ( Frances Fisher ) , kura iepriekš ir redzēta arī Latvijā rādītajos seriālos Slepkavība ( The Killing ) un Saikne ( Touch ) , savukārt Megijas lomu spēlē aktrise Devina Kellija ( Devin Kelley ) , kura ir spēlējusi seriālā Slepenie sakari ( Covert affairs ) un Čikāgas kodeks ( Chicago code ) .
- D.
- PROPN 29: Džons D. Rokfellers savas iespējas saskatīja naftā .
- X 1: Seriālā Viņi atgriežas pirmo reizi kā aktieris debitē Džeikoba atveidotājs Lendons Gimenezs ( Landon Gimenez ) , savukārt zināmākie aktieri šajā seriālā būs Mārtija lomas atveidotājs Omars Eps ( Omar Epps ) , kurš Latvijas skatītājiem ir pazīstams pēc Dr. Ērika Formana lomas seriālā Doktors Hauss ( House M. D. ) , Lusillas tēlā iejūtas aktrise Francesa Fišere ( Frances Fisher ) , kura iepriekš ir redzēta arī Latvijā rādītajos seriālos Slepkavība ( The Killing ) un Saikne ( Touch ) , savukārt Megijas lomu spēlē aktrise Devina Kellija ( Devin Kelley ) , kura ir spēlējusi seriālā Slepenie sakari ( Covert affairs ) un Čikāgas kodeks ( Chicago code ) .
- Satversmes
- Saules
- Mēness
- FM
- Jūrmalā
- airBaltic
- vilnis
- Zemes
Morphology
The form / lemma ratio of PROPN
is 1.333333 (the average of all parts of speech is 2.339090).
The 1st highest number of forms (8) was observed with the lemma “Jānis”: JĀNIS, Jāni, Jānim, Jānis, Jāņa, Jāņi, Jāņiem, Jāņu.
The 2nd highest number of forms (6) was observed with the lemma “Eiropa”: EIROPAS, Eiropa, Eiropai, Eiropas, Eiropu, Eiropā.
The 3rd highest number of forms (6) was observed with the lemma “Rīga”: RĪGAS, Rīga, Rīgai, Rīgas, Rīgu, Rīgā.
PROPN
occurs with 5 features: Gender (12061; 86% instances), Case (11899; 85% instances), Number (11899; 85% instances), Abbr (1374; 10% instances), Typo (18; 0% instances)
PROPN
occurs with 13 feature-value pairs: Abbr=Yes
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Loc
, Case=Nom
, Case=Voc
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Ptan
, Number=Sing
, Typo=Yes
PROPN
occurs with 46 feature combinations.
The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing
(3103 tokens).
Examples: Latvijas, Eiropas, Rīgas, Krievijas, Baltijas, Saeimas, Jelgavas, Liepājas, Bauskas, Lietuvas
Relations
PROPN
nodes are attached to their parents using 22 different relations: nmod (4670; 33% instances), nsubj (2950; 21% instances), flat:name (2186; 16% instances), obl (1422; 10% instances), conj (1151; 8% instances), iobj (416; 3% instances), obj (308; 2% instances), parataxis (264; 2% instances), root (182; 1% instances), appos (103; 1% instances), nsubj:pass (88; 1% instances), discourse (47; 0% instances), vocative (39; 0% instances), acl (37; 0% instances), dep (31; 0% instances), orphan (28; 0% instances), xcomp (22; 0% instances), advcl (18; 0% instances), ccomp (10; 0% instances), flat (4; 0% instances), amod (1; 0% instances), csubj (1; 0% instances)
Parents of PROPN
nodes belong to 16 different parts of speech: NOUN (5097; 36% instances), VERB (5019; 36% instances), PROPN (3323; 24% instances), (182; 1% instances), ADJ (126; 1% instances), ADV (66; 0% instances), X (56; 0% instances), NUM (41; 0% instances), PRON (28; 0% instances), DET (23; 0% instances), AUX (6; 0% instances), INTJ (4; 0% instances), SYM (4; 0% instances), CCONJ (1; 0% instances), PART (1; 0% instances), PUNCT (1; 0% instances)
8619 (62%) PROPN
nodes are leaves.
2505 (18%) PROPN
nodes have one child.
1639 (12%) PROPN
nodes have two children.
1215 (9%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 34.
Children of PROPN
nodes are attached using 25 different relations: punct (2626; 25% instances), flat:name (2252; 22% instances), nmod (1727; 17% instances), conj (1207; 12% instances), case (777; 7% instances), cc (568; 5% instances), acl (231; 2% instances), parataxis (213; 2% instances), advmod:emph (155; 1% instances), amod (150; 1% instances), appos (75; 1% instances), orphan (60; 1% instances), det (52; 1% instances), discourse (42; 0% instances), dep (41; 0% instances), advmod (40; 0% instances), nsubj (39; 0% instances), mark (37; 0% instances), cop (36; 0% instances), advcl (20; 0% instances), obl (18; 0% instances), flat (7; 0% instances), nummod (6; 0% instances), iobj (2; 0% instances), aux (1; 0% instances)
Children of PROPN
nodes belong to 17 different parts of speech: PROPN (3323; 32% instances), PUNCT (2626; 25% instances), NOUN (1998; 19% instances), ADP (762; 7% instances), CCONJ (560; 5% instances), VERB (280; 3% instances), PART (169; 2% instances), ADJ (147; 1% instances), X (105; 1% instances), NUM (99; 1% instances), ADV (91; 1% instances), DET (69; 1% instances), SYM (56; 1% instances), SCONJ (42; 0% instances), AUX (37; 0% instances), PRON (17; 0% instances), INTJ (1; 0% instances)