Treebank Statistics: UD_Livvi-KKPP: POS Tags: PROPN
There are 40 PROPN
lemmas (7%), 47 PROPN
types (6%) and 81 PROPN
tokens (5%).
Out of 14 observed tags, the rank of PROPN
is: 5 in number of lemmas, 6 in number of types and 7 in number of tokens.
The 10 most frequent PROPN
lemmas: Tver, Karjal, Anus, Mustonen, Peter, Petroskoi, Suomi, Karjala, Tapio, Anuksenlinna
The 10 most frequent PROPN
types: Tverin, Anuksen, Karjalan, Mustonen, Peter, Petroskoin, Suomes, Tapio, Anuksenlinnas, Karjalah
The 10 most frequent ambiguous lemmas: Karjal (PROPN 6, NOUN 2), Karelija (PROPN 1, X 1)
The 10 most frequent ambiguous types: Karelija (PROPN 1, X 1)
- Karelija
- PROPN 1: Festivualin hantuzis pietäh kaksi kilbua nuorih niškoi : Moja Karelija -videos’užietoin kilbu da Karelija – eto mi -karjalan , vepsän da suomen kieldy maltajien kilbu .
- X 1: Festivualin hantuzis pietäh kaksi kilbua nuorih niškoi : Moja Karelija -videos’užietoin kilbu da Karelija – eto mi -karjalan , vepsän da suomen kieldy maltajien kilbu .
Morphology
The form / lemma ratio of PROPN
is 1.175000 (the average of all parts of speech is 1.335034).
The 1st highest number of forms (3) was observed with the lemma “Karjal”: Karjalah, Karjalan, Karjalas.
The 2nd highest number of forms (3) was observed with the lemma “Peter”: Peter, Peteran, Peteras.
The 3rd highest number of forms (3) was observed with the lemma “Petroskoi”: Petroskoin, Petroskois, Petroskoispäi.
PROPN
occurs with 3 features: Case (78; 96% instances), Number (78; 96% instances), Clitic (1; 1% instances)
PROPN
occurs with 12 feature-value pairs: Case=Ade
, Case=All
, Case=Ela
, Case=Gen
, Case=Ill
, Case=Ine
, Case=Nom
, Case=Par
, Case=Tra
, Clitic=Gi
, Number=Plur
, Number=Sing
PROPN
occurs with 12 feature combinations.
The most frequent feature combination is Case=Gen|Number=Sing
(28 tokens).
Examples: Tverin, Karjalan, Anuksen, Petroskoin, Tuuksen, Koverin, Lihoslavl’an, Mägriän, Periodikan, Peteran
Relations
PROPN
nodes are attached to their parents using 10 different relations: nmod:poss (28; 35% instances), obl (18; 22% instances), conj (10; 12% instances), flat:name (9; 11% instances), nsubj (9; 11% instances), appos (3; 4% instances), nsubj:cop (1; 1% instances), obj (1; 1% instances), parataxis (1; 1% instances), vocative (1; 1% instances)
Parents of PROPN
nodes belong to 4 different parts of speech: NOUN (31; 38% instances), VERB (25; 31% instances), PROPN (23; 28% instances), ADJ (2; 2% instances)
48 (59%) PROPN
nodes are leaves.
21 (26%) PROPN
nodes have one child.
9 (11%) PROPN
nodes have two children.
3 (4%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 4.
Children of PROPN
nodes are attached using 11 different relations: conj (10; 20% instances), flat:name (10; 20% instances), punct (9; 18% instances), cc (8; 16% instances), nmod:poss (5; 10% instances), orphan (2; 4% instances), amod (1; 2% instances), appos (1; 2% instances), nmod (1; 2% instances), obj (1; 2% instances), obl (1; 2% instances)
Children of PROPN
nodes belong to 5 different parts of speech: PROPN (23; 47% instances), PUNCT (9; 18% instances), CCONJ (8; 16% instances), NOUN (8; 16% instances), ADJ (1; 2% instances)