home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Faroese-OFT: POS Tags: PROPN

There are 428 PROPN lemmas (17%), 455 PROPN types (14%) and 805 PROPN tokens (8%). Out of 15 observed tags, the rank of PROPN is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent PROPN lemmas: Føroyar, Noreg, Danmark, Kanada, Niðurlond, Amerika, Grønland, Russland, Tórshavn, Frakland

The 10 most frequent PROPN types: Føroyum, Føroya, Føroyar, Noregi, Danmark, Kanada, Amerika, Kina, Fraklandi, Italia

The 10 most frequent ambiguous lemmas: søga (NOUN 4, PROPN 1)

The 10 most frequent ambiguous types: Evropa (NOUN 4, PROPN 2)

Morphology

The form / lemma ratio of PROPN is 1.063084 (the average of all parts of speech is 1.289602).

The 1st highest number of forms (3) was observed with the lemma “Føroyar”: Føroya, Føroyar, Føroyum.

The 2nd highest number of forms (3) was observed with the lemma “Russland”: Russland, Russlandi, Russlands.

The 3rd highest number of forms (3) was observed with the lemma “Týskland”: Týskland, Týsklandi, Týsklands.

PROPN occurs with 4 features: Case (805; 100% instances), Number (802; 100% instances), Gender (329; 41% instances), Definite (248; 31% instances)

PROPN occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

PROPN occurs with 37 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (346 tokens). Examples: Kanada, Amerika, Italia, Nigeria, Asia, Jackson, Norra, Nuuk, Oslo, Virginia

Relations

PROPN nodes are attached to their parents using 14 different relations: nsubj (210; 26% instances), root (183; 23% instances), obl (160; 20% instances), nmod (106; 13% instances), nmod:poss (44; 5% instances), flat (40; 5% instances), conj (34; 4% instances), appos (7; 1% instances), compound (7; 1% instances), dep (6; 1% instances), xcomp (3; 0% instances), obj (2; 0% instances), parataxis (2; 0% instances), orphan (1; 0% instances)

Parents of PROPN nodes belong to 7 different parts of speech: NOUN (422; 52% instances), (183; 23% instances), VERB (109; 14% instances), PROPN (73; 9% instances), ADJ (15; 2% instances), NUM (2; 0% instances), PRON (1; 0% instances)

276 (34%) PROPN nodes are leaves.

289 (36%) PROPN nodes have one child.

106 (13%) PROPN nodes have two children.

134 (17%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 6.

Children of PROPN nodes are attached using 20 different relations: case (333; 33% instances), punct (203; 20% instances), nsubj (118; 12% instances), cop (117; 12% instances), parataxis (73; 7% instances), flat (40; 4% instances), conj (39; 4% instances), cc (33; 3% instances), obl (10; 1% instances), advmod (7; 1% instances), compound (7; 1% instances), nmod (7; 1% instances), appos (6; 1% instances), expl (3; 0% instances), acl (2; 0% instances), aux (2; 0% instances), dep (2; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), amod (1; 0% instances)

Children of PROPN nodes belong to 10 different parts of speech: ADP (333; 33% instances), NOUN (218; 22% instances), PUNCT (203; 20% instances), VERB (121; 12% instances), PROPN (73; 7% instances), CCONJ (33; 3% instances), PRON (11; 1% instances), ADV (6; 1% instances), NUM (5; 0% instances), ADJ (2; 0% instances)