home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Nepali-BK: POS Tags: NOUN

There are 79 NOUN lemmas (29%), 100 NOUN types (28%) and 161 NOUN tokens (20%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: कर्तव्य, मान्छे, घ्यू, आगो, भुत, खोटो, ठाउँ, कथन, कुरा, गोठ

The 10 most frequent NOUN types: घ्यू, कर्तव्य, मान्छे, भुत, खोटो, थकाई, आगो, कर्तव्यको, कुरा, गोठालो

The 10 most frequent ambiguous lemmas: धारा (NOUN 3, PROPN 1), पानी (NOUN 3, PROPN 1), भोलिपल्ट (ADV 1, NOUN 1)

The 10 most frequent ambiguous types: धारा (NOUN 3, PROPN 1), भोलिपल्ट (ADV 1, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.265823 (the average of all parts of speech is 1.329630).

The 1st highest number of forms (5) was observed with the lemma “कर्तव्य”: कर्तव्य, कर्तव्यका, कर्तव्यको, कर्तव्यबारे, कर्तव्यलाई.

The 2nd highest number of forms (4) was observed with the lemma “कथन”: कथन, कथनअनुसार, कथनभित्र, कथनमा.

The 3rd highest number of forms (3) was observed with the lemma “आगो”: आगो, आगोमा, आगोले.

NOUN occurs with 3 features: Number (160; 99% instances), Case (159; 99% instances), Gender (157; 98% instances)

NOUN occurs with 12 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Erg, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 15 feature combinations. The most frequent feature combination is Case=Nom|Gender=Neut|Number=Sing (56 tokens). Examples: मान्छे, भुत, कर्तव्य, गोठालो, दिन, धारा, कुरा, ठाउँ, भुत्ला, भुत्लै

Relations

NOUN nodes are attached to their parents using 16 different relations: obj (41; 25% instances), obl (33; 20% instances), nsubj (29; 18% instances), nmod:poss (18; 11% instances), appos (8; 5% instances), compound (6; 4% instances), iobj (5; 3% instances), xcomp (4; 2% instances), conj (3; 2% instances), discourse (3; 2% instances), nmod (3; 2% instances), ccomp (2; 1% instances), dislocated (2; 1% instances), reparandum (2; 1% instances), acl (1; 1% instances), parataxis (1; 1% instances)

Parents of NOUN nodes belong to 6 different parts of speech: VERB (111; 69% instances), NOUN (42; 26% instances), ADJ (3; 2% instances), AUX (2; 1% instances), PROPN (2; 1% instances), ADP (1; 1% instances)

61 (38%) NOUN nodes are leaves.

62 (39%) NOUN nodes have one child.

22 (14%) NOUN nodes have two children.

16 (10%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 8.

Children of NOUN nodes are attached using 22 different relations: det (35; 21% instances), nmod:poss (22; 13% instances), amod (20; 12% instances), discourse (16; 10% instances), acl (14; 9% instances), punct (13; 8% instances), appos (7; 4% instances), cc (6; 4% instances), compound (6; 4% instances), nmod (5; 3% instances), nummod (4; 2% instances), conj (3; 2% instances), advmod (2; 1% instances), case (2; 1% instances), reparandum (2; 1% instances), acl:relcl (1; 1% instances), advcl (1; 1% instances), aux (1; 1% instances), cop (1; 1% instances), dislocated (1; 1% instances), obl (1; 1% instances), parataxis (1; 1% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (42; 26% instances), DET (29; 18% instances), ADJ (21; 13% instances), VERB (17; 10% instances), PART (13; 8% instances), PUNCT (13; 8% instances), PRON (9; 5% instances), CCONJ (6; 4% instances), NUM (4; 2% instances), PROPN (4; 2% instances), ADP (2; 1% instances), ADV (2; 1% instances), AUX (2; 1% instances)