home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Lithuanian-ALKSNIS: POS Tags: PROPN

There are 559 PROPN lemmas (6%), 702 PROPN types (4%) and 1593 PROPN tokens (2%). Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 11 in number of tokens.

The 10 most frequent PROPN lemmas: Lietuva, Europa, Vilnius, Kaunas, Šengenas, Kalėdos, Marcinkevičienė, Glaveckas, Mažuolis, Rusija

The 10 most frequent PROPN types: Lietuvos, Europos, Lietuvoje, Kauno, Vilniaus, Lietuva, Šengeno, Lietuvai, Vilnius, Kalėdų

The 10 most frequent ambiguous lemmas: Kalėdos (PROPN 14, NOUN 1), Iglesias (PROPN 1, X 1), klausimas (NOUN 63, PROPN 1)

The 10 most frequent ambiguous types: Kalėdų (PROPN 10, NOUN 1), Gintaras (PROPN 2, NOUN 1), Greta (PROPN 2, ADP 1), Iglesias (PROPN 1, X 1), Seimo (NOUN 28, PROPN 1), Vidutinė (ADJ 1, PROPN 1), klausimai (NOUN 8, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.255814 (the average of all parts of speech is 2.065341).

The 1st highest number of forms (8) was observed with the lemma “Lietuva”: LIETUVA, LIETUVOJE, LIETUVOS, Lietuva, Lietuvai, Lietuvoje, Lietuvos, Lietuvą.

The 2nd highest number of forms (5) was observed with the lemma “Europa”: EUROPOS, Europa, Europoje, Europos, Europą.

The 3rd highest number of forms (5) was observed with the lemma “Glaveckas”: GLAVECKAS, Glaveckas, Glavecko, Glavecku, Glavecką.

PROPN occurs with 3 features: Gender (1574; 99% instances), Number (1573; 99% instances), Case (1572; 99% instances)

PROPN occurs with 10 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

PROPN occurs with 22 feature combinations. The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing (522 tokens). Examples: Lietuvos, Europos, LIETUVOS, EUROPOS, Marcinkevičienės, Rusijos, Baltijos, Trejybės, Latvijos, Prancūzijos

Relations

PROPN nodes are attached to their parents using 14 different relations: nmod (883; 55% instances), obl (146; 9% instances), nsubj (141; 9% instances), conj (126; 8% instances), flat (110; 7% instances), obl:arg (79; 5% instances), root (73; 5% instances), obj (20; 1% instances), parataxis (9; 1% instances), nsubj:pass (2; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), iobj (1; 0% instances)

Parents of PROPN nodes belong to 10 different parts of speech: NOUN (976; 61% instances), VERB (264; 17% instances), PROPN (245; 15% instances), (73; 5% instances), ADJ (15; 1% instances), X (12; 1% instances), ADV (3; 0% instances), PRON (3; 0% instances), DET (1; 0% instances), PART (1; 0% instances)

1057 (66%) PROPN nodes are leaves.

312 (20%) PROPN nodes have one child.

140 (9%) PROPN nodes have two children.

84 (5%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 8.

Children of PROPN nodes are attached using 21 different relations: nmod (314; 35% instances), punct (230; 26% instances), conj (142; 16% instances), case (59; 7% instances), cc (53; 6% instances), advmod:emph (18; 2% instances), det (14; 2% instances), acl (13; 1% instances), amod (9; 1% instances), nsubj (8; 1% instances), appos (7; 1% instances), acl:relcl (6; 1% instances), mark (6; 1% instances), obl (5; 1% instances), cop (3; 0% instances), flat (3; 0% instances), obl:arg (3; 0% instances), advmod (2; 0% instances), parataxis (2; 0% instances), csubj (1; 0% instances), nummod (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (245; 27% instances), PUNCT (230; 26% instances), X (146; 16% instances), NOUN (75; 8% instances), ADP (59; 7% instances), CCONJ (53; 6% instances), VERB (22; 2% instances), PART (18; 2% instances), DET (14; 2% instances), PRON (11; 1% instances), ADJ (10; 1% instances), SCONJ (6; 1% instances), NUM (5; 1% instances), AUX (3; 0% instances), ADV (2; 0% instances)