home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Buryat-BDT: POS Tags: PROPN

There are 371 PROPN lemmas (14%), 441 PROPN types (10%) and 710 PROPN tokens (7%). Out of 16 observed tags, the rank of PROPN is: 3 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: Байгал, Булад, Улаан-Үдэ, Цыпелма, Энэдхэг, Баяр, Матти, Дандар, Хойто, Баянгаза

The 10 most frequent PROPN types: Байгал, Булад, Цыпелма, Баяр, Энэдхэг, Дандар, Матти, Хойто, Екатерина, Магдан

The 10 most frequent ambiguous lemmas: зандин (PROPN 6, NOUN 1), _ (PART 9, VERB 9, ADJ 7, NOUN 7, ADV 4, PROPN 4, PRON 3, DET 2, ADP 1, AUX 1)

The 10 most frequent ambiguous types: Баянгаза (PROPN 5, NOUN 1), Зүүн (PROPN 4, ADJ 1), Буладай (PROPN 2, NOUN 1), Гүрэнэй (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.188679 (the average of all parts of speech is 1.635385).

The 1st highest number of forms (5) was observed with the lemma “Цыпелма”: Цыпелма, Цыпелмаае, Цыпелмагай, Цыпелмада, Цыпелмае.

The 2nd highest number of forms (4) was observed with the lemma “_”: Абагал, Кондратьев, Кондратьевай, Хатареев.

The 3rd highest number of forms (4) was observed with the lemma “Лыгденов”: Лыгденов, Лыгденовтэ, Лыгденовтэнэй, Лыгденовэй.

PROPN occurs with 6 features: Case (701; 99% instances), Gender (380; 54% instances), Number (10; 1% instances), Reflex (8; 1% instances), Number[psor] (2; 0% instances), Person[psor] (2; 0% instances)

PROPN occurs with 12 feature-value pairs: Case=Abl, Case=Acc, Case=Com, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Number=Plur, Number[psor]=Sing, Person[psor]=3, Reflex=Yes

PROPN occurs with 26 feature combinations. The most frequent feature combination is Case=Nom (194 tokens). Examples: Байгал, Энэдхэг, Хойто, Булад, Улаан-Үдэ, Баянгаза, Цыпелма, Баяр, Зүдхэлиин, Зүүн

Relations

PROPN nodes are attached to their parents using 13 different relations: flat (201; 28% instances), nmod (177; 25% instances), nsubj (91; 13% instances), compound (89; 13% instances), conj (50; 7% instances), appos (38; 5% instances), nmod:own (20; 3% instances), list (12; 2% instances), root (11; 2% instances), obj (10; 1% instances), vocative (6; 1% instances), orphan (3; 0% instances), iobj (2; 0% instances)

Parents of PROPN nodes belong to 7 different parts of speech: NOUN (286; 40% instances), PROPN (257; 36% instances), VERB (132; 19% instances), ADJ (20; 3% instances), (11; 2% instances), ADP (3; 0% instances), PUNCT (1; 0% instances)

332 (47%) PROPN nodes are leaves.

299 (42%) PROPN nodes have one child.

60 (8%) PROPN nodes have two children.

19 (3%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 19 different relations: flat (219; 44% instances), punct (101; 20% instances), conj (50; 10% instances), compound (22; 4% instances), nmod (21; 4% instances), cc (19; 4% instances), appos (14; 3% instances), list (12; 2% instances), acl (10; 2% instances), amod (6; 1% instances), discourse (5; 1% instances), orphan (3; 1% instances), advmod (2; 0% instances), case (2; 0% instances), det (2; 0% instances), nsubj (2; 0% instances), cop (1; 0% instances), nummod (1; 0% instances), parataxis (1; 0% instances)

Children of PROPN nodes belong to 10 different parts of speech: PROPN (257; 52% instances), PUNCT (102; 21% instances), NOUN (77; 16% instances), CCONJ (19; 4% instances), VERB (14; 3% instances), ADJ (10; 2% instances), ADV (5; 1% instances), PRON (5; 1% instances), ADP (3; 1% instances), NUM (1; 0% instances)