home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-GC: POS Tags: PROPN

There are 3306 PROPN lemmas (22%), 3606 PROPN types (17%) and 6125 PROPN tokens (6%). Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 2 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: ísland, Trump, Bandaríkin, Reykjavík, Jón, þór, Guðmundur, Ólafur, Alþingi, Katrín

The 10 most frequent PROPN types: Trump, Íslands, Íslandi, þór, Jón, Reykjavík, Ísland, Bandaríkjunum, Guðmundur, Bandaríkjanna

The 10 most frequent ambiguous lemmas: ísland (NOUN 3, PROPN 2), Bandaríkin (PROPN 46, NOUN 12), Reykjavík (PROPN 40, NOUN 11), Alþingi (PROPN 19, NOUN 9), Sjálfstæðisflokkur (PROPN 17, NOUN 6), 2 (NUM 25, PROPN 16), Bretland (PROPN 16, NOUN 7), Facebook (PROPN 15, NOUN 5), Noregur (PROPN 15, NOUN 6), Framsóknarflokkur (PROPN 14, NOUN 1)

The 10 most frequent ambiguous types: Íslands (PROPN 53, NOUN 24), Íslandi (PROPN 30, NOUN 25), Reykjavík (PROPN 25, NOUN 4), Ísland (PROPN 25, NOUN 9), Bandaríkjunum (PROPN 24, NOUN 5), Bandaríkjanna (PROPN 19, NOUN 6), 2 (NUM 25, PROPN 16), Facebook (PROPN 15, NOUN 5), Reykjavíkur (PROPN 15, NOUN 7), RÚV (PROPN 15, NOUN 11)

Morphology

The form / lemma ratio of PROPN is 1.090744 (the average of all parts of speech is 1.434754).

The 1st highest number of forms (5) was observed with the lemma “Framsóknarflokkur”: Framsóknarflokknum, Framsóknarflokks, Framsóknarflokksins, Framsóknarflokkur, Framsóknarflokkurinn.

The 2nd highest number of forms (5) was observed with the lemma “Seðlabanki”: Seðlabanka, Seðlabankans, Seðlabankanum, Seðlabanki, Seðlabankinn.

The 3rd highest number of forms (5) was observed with the lemma “Sjálfstæðisflokkur”: Sjálfstæðisflokknum, Sjálfstæðisflokks, Sjálfstæðisflokksins, Sjálfstæðisflokkur, Sjálfstæðisflokkurinn.

PROPN occurs with 4 features: Case (5158; 84% instances), Gender (4964; 81% instances), Number (2570; 42% instances), Definite (361; 6% instances)

PROPN occurs with 10 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

PROPN occurs with 71 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc (1149 tokens). Examples: Trump, þór, Jón, Guðmundur, Ólafur, Sigurður, Björn, Páll, Bjarni, Guðmundsson

Relations

PROPN nodes are attached to their parents using 14 different relations: nsubj (1522; 25% instances), obl (1492; 24% instances), flat (1465; 24% instances), nmod:poss (793; 13% instances), conj (479; 8% instances), flat:name (131; 2% instances), obj (128; 2% instances), root (43; 1% instances), iobj (27; 0% instances), advcl (23; 0% instances), dep (9; 0% instances), acl:relcl (6; 0% instances), xcomp (6; 0% instances), ccomp (1; 0% instances)

Parents of PROPN nodes belong to 11 different parts of speech: PROPN (2124; 35% instances), VERB (2062; 34% instances), NOUN (1746; 29% instances), ADJ (88; 1% instances), (43; 1% instances), ADV (26; 0% instances), PRON (21; 0% instances), NUM (7; 0% instances), ADP (3; 0% instances), CCONJ (3; 0% instances), SCONJ (2; 0% instances)

2981 (49%) PROPN nodes are leaves.

1874 (31%) PROPN nodes have one child.

730 (12%) PROPN nodes have two children.

540 (9%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 18.

Children of PROPN nodes are attached using 23 different relations: flat (1464; 27% instances), case (1308; 24% instances), punct (684; 12% instances), obl (600; 11% instances), conj (499; 9% instances), cc (280; 5% instances), flat:name (131; 2% instances), acl:relcl (115; 2% instances), advmod (94; 2% instances), nmod:poss (70; 1% instances), amod (52; 1% instances), nmod (51; 1% instances), nsubj (34; 1% instances), nummod (31; 1% instances), cop (19; 0% instances), mark (18; 0% instances), xcomp (17; 0% instances), det (5; 0% instances), advcl (4; 0% instances), dep (4; 0% instances), ccomp (3; 0% instances), obj (3; 0% instances), flat:foreign (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (2124; 39% instances), ADP (1306; 24% instances), PUNCT (684; 12% instances), NOUN (626; 11% instances), CCONJ (288; 5% instances), VERB (138; 3% instances), ADV (101; 2% instances), PRON (80; 1% instances), ADJ (63; 1% instances), NUM (36; 1% instances), AUX (19; 0% instances), SCONJ (12; 0% instances), DET (5; 0% instances), SYM (3; 0% instances), X (2; 0% instances)