home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Komi_Permyak-UH: POS Tags: NOUN

There are 100 NOUN lemmas (30%), 122 NOUN types (31%) and 142 NOUN tokens (21%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: ай, дор, лун, йӧр, вон, керку, ой, ӧшын, гижӧт, лог

The 10 most frequent NOUN types: айӧ, йӧрсӧ, айся, дорас, дорын, луныс, машина, ойнас, олісьыс, ордчӧн

The 10 most frequent ambiguous lemmas: ордчӧн (ADV 2, NOUN 2), Митя (NOUN 1, PROPN 1), кыдз (ADV 3, NOUN 1), том (ADJ 1, NOUN 1)

The 10 most frequent ambiguous types: ордчӧн (NOUN 2, ADV 1), Митя (NOUN 1, PROPN 1), дынӧ (ADP 2, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.220000 (the average of all parts of speech is 1.186186).

The 1st highest number of forms (5) was observed with the lemma “ай”: ай, айся, айыт, айытся, айӧ.

The 2nd highest number of forms (4) was observed with the lemma “лун”: лун, лунас, луныс, лунӧ.

The 3rd highest number of forms (3) was observed with the lemma “вон”: воннэз, воныс, вонӧ.

NOUN occurs with 7 features: Number (133; 94% instances), Case (129; 91% instances), Number[psor] (45; 32% instances), Person[psor] (45; 32% instances), Animacy (14; 10% instances), Derivation (8; 6% instances), NameType (1; 1% instances)

NOUN occurs with 23 feature-value pairs: Animacy=Anim, Animacy=Hum, Animacy=Inan, Case=Acc, Case=Car, Case=Comp, Case=Dat, Case=Egr, Case=Ela, Case=Ill, Case=Ine, Case=Ins, Case=Nom, Case=Prl, Derivation=Dimin, Derivation=ProprietiveMod, NameType=Giv, Number=Plur, Number=Sing, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3

NOUN occurs with 40 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (43 tokens). Examples: машина, охота, Гырка, Дядя, Митя, Морт, Ныв, Челядь, ай, берег

Relations

NOUN nodes are attached to their parents using 18 different relations: nsubj (31; 22% instances), obj (23; 16% instances), obl:lmod (17; 12% instances), nmod (15; 11% instances), conj (14; 10% instances), obl (12; 8% instances), obl:tmod (6; 4% instances), advcl (5; 4% instances), appos (4; 3% instances), orphan (3; 2% instances), case (2; 1% instances), flat:name (2; 1% instances), nsubj:cop (2; 1% instances), root (2; 1% instances), amod (1; 1% instances), dislocated (1; 1% instances), nmod:lmod (1; 1% instances), vocative (1; 1% instances)

Parents of NOUN nodes belong to 8 different parts of speech: VERB (81; 57% instances), NOUN (28; 20% instances), ADJ (18; 13% instances), PROPN (6; 4% instances), PRON (4; 3% instances), ADV (2; 1% instances), (2; 1% instances), NUM (1; 1% instances)

63 (44%) NOUN nodes are leaves.

48 (34%) NOUN nodes have one child.

20 (14%) NOUN nodes have two children.

11 (8%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 8.

Children of NOUN nodes are attached using 23 different relations: nmod (31; 23% instances), punct (20; 15% instances), amod (15; 11% instances), det (9; 7% instances), cc (8; 6% instances), conj (8; 6% instances), case (7; 5% instances), advmod (5; 4% instances), nummod (5; 4% instances), acl (4; 3% instances), advcl (3; 2% instances), acl:relcl (2; 2% instances), list (2; 2% instances), nmod:poss (2; 2% instances), obl (2; 2% instances), orphan (2; 2% instances), aux:neg (1; 1% instances), cop (1; 1% instances), flat:name (1; 1% instances), mark (1; 1% instances), nmod:lmod (1; 1% instances), nsubj (1; 1% instances), obl:tmod (1; 1% instances)

Children of NOUN nodes belong to 14 different parts of speech: NOUN (28; 21% instances), PUNCT (20; 15% instances), PRON (19; 14% instances), ADJ (15; 11% instances), VERB (11; 8% instances), CCONJ (8; 6% instances), PROPN (8; 6% instances), ADP (5; 4% instances), ADV (5; 4% instances), NUM (5; 4% instances), DET (4; 3% instances), AUX (2; 2% instances), PART (1; 1% instances), SCONJ (1; 1% instances)