Treebank Statistics: UD_Icelandic-GC: POS Tags: PROPN
There are 3306 PROPN
lemmas (22%), 3606 PROPN
types (17%) and 6125 PROPN
tokens (6%).
Out of 17 observed tags, the rank of PROPN
is: 2 in number of lemmas, 2 in number of types and 7 in number of tokens.
The 10 most frequent PROPN
lemmas: ísland, Trump, Bandaríkin, Reykjavík, Jón, þór, Guðmundur, Ólafur, Alþingi, Katrín
The 10 most frequent PROPN
types: Trump, Íslands, Íslandi, þór, Jón, Reykjavík, Ísland, Bandaríkjunum, Guðmundur, Bandaríkjanna
The 10 most frequent ambiguous lemmas: ísland (NOUN 3, PROPN 2), Bandaríkin (PROPN 46, NOUN 12), Reykjavík (PROPN 40, NOUN 11), Alþingi (PROPN 19, NOUN 9), Sjálfstæðisflokkur (PROPN 17, NOUN 6), 2 (NUM 25, PROPN 16), Bretland (PROPN 16, NOUN 7), Facebook (PROPN 15, NOUN 5), Noregur (PROPN 15, NOUN 6), Framsóknarflokkur (PROPN 14, NOUN 1)
The 10 most frequent ambiguous types: Íslands (PROPN 53, NOUN 24), Íslandi (PROPN 30, NOUN 25), Reykjavík (PROPN 25, NOUN 4), Ísland (PROPN 25, NOUN 9), Bandaríkjunum (PROPN 24, NOUN 5), Bandaríkjanna (PROPN 19, NOUN 6), 2 (NUM 25, PROPN 16), Facebook (PROPN 15, NOUN 5), Reykjavíkur (PROPN 15, NOUN 7), RÚV (PROPN 15, NOUN 11)
- Íslands
- Íslandi
- Reykjavík
- Ísland
- Bandaríkjunum
- Bandaríkjanna
- 2
- Reykjavíkur
- RÚV
Morphology
The form / lemma ratio of PROPN
is 1.090744 (the average of all parts of speech is 1.434754).
The 1st highest number of forms (5) was observed with the lemma “Framsóknarflokkur”: Framsóknarflokknum, Framsóknarflokks, Framsóknarflokksins, Framsóknarflokkur, Framsóknarflokkurinn.
The 2nd highest number of forms (5) was observed with the lemma “Seðlabanki”: Seðlabanka, Seðlabankans, Seðlabankanum, Seðlabanki, Seðlabankinn.
The 3rd highest number of forms (5) was observed with the lemma “Sjálfstæðisflokkur”: Sjálfstæðisflokknum, Sjálfstæðisflokks, Sjálfstæðisflokksins, Sjálfstæðisflokkur, Sjálfstæðisflokkurinn.
PROPN
occurs with 4 features: Case (5158; 84% instances), Gender (4964; 81% instances), Number (2570; 42% instances), Definite (361; 6% instances)
PROPN
occurs with 10 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Definite=Def
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
PROPN
occurs with 71 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Masc
(1149 tokens).
Examples: Trump, þór, Jón, Guðmundur, Ólafur, Sigurður, Björn, Páll, Bjarni, Guðmundsson
Relations
PROPN
nodes are attached to their parents using 14 different relations: nsubj (1522; 25% instances), obl (1492; 24% instances), flat (1465; 24% instances), nmod:poss (793; 13% instances), conj (479; 8% instances), flat:name (131; 2% instances), obj (128; 2% instances), root (43; 1% instances), iobj (27; 0% instances), advcl (23; 0% instances), dep (9; 0% instances), acl:relcl (6; 0% instances), xcomp (6; 0% instances), ccomp (1; 0% instances)
Parents of PROPN
nodes belong to 11 different parts of speech: PROPN (2124; 35% instances), VERB (2062; 34% instances), NOUN (1746; 29% instances), ADJ (88; 1% instances), (43; 1% instances), ADV (26; 0% instances), PRON (21; 0% instances), NUM (7; 0% instances), ADP (3; 0% instances), CCONJ (3; 0% instances), SCONJ (2; 0% instances)
2981 (49%) PROPN
nodes are leaves.
1874 (31%) PROPN
nodes have one child.
730 (12%) PROPN
nodes have two children.
540 (9%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 18.
Children of PROPN
nodes are attached using 23 different relations: flat (1464; 27% instances), case (1308; 24% instances), punct (684; 12% instances), obl (600; 11% instances), conj (499; 9% instances), cc (280; 5% instances), flat:name (131; 2% instances), acl:relcl (115; 2% instances), advmod (94; 2% instances), nmod:poss (70; 1% instances), amod (52; 1% instances), nmod (51; 1% instances), nsubj (34; 1% instances), nummod (31; 1% instances), cop (19; 0% instances), mark (18; 0% instances), xcomp (17; 0% instances), det (5; 0% instances), advcl (4; 0% instances), dep (4; 0% instances), ccomp (3; 0% instances), obj (3; 0% instances), flat:foreign (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: PROPN (2124; 39% instances), ADP (1306; 24% instances), PUNCT (684; 12% instances), NOUN (626; 11% instances), CCONJ (288; 5% instances), VERB (138; 3% instances), ADV (101; 2% instances), PRON (80; 1% instances), ADJ (63; 1% instances), NUM (36; 1% instances), AUX (19; 0% instances), SCONJ (12; 0% instances), DET (5; 0% instances), SYM (3; 0% instances), X (2; 0% instances)