home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Assamese-AiW: POS Tags: NOUN

There are 184 NOUN lemmas (46%), 227 NOUN types (43%) and 278 NOUN tokens (32%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: তল, পূজা, শহা, কথা, পাছ, কাম, নাম, মানুহ, কমিটি, কিতাপ

The 10 most frequent NOUN types: পূজা, তললৈ, পাছে, কথা, কাম, ঠাইখনৰ, পৃথিৱীৰ, ব্যৱস্থা, ভিতৰত, শহাটো

The 10 most frequent ambiguous lemmas: দেখা (NOUN 1, PART 1), পৰিস্কাৰ (ADJ 1, NOUN 1), বহা (VERB 3, NOUN 1), বিষয় (ADP 1, NOUN 1)

The 10 most frequent ambiguous types: বিষয়ে (ADP 1, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.233696 (the average of all parts of speech is 1.317618).

The 1st highest number of forms (5) was observed with the lemma “তল”: তলখনত, তলত, তললৈ, তললৈকে, তলৰ.

The 2nd highest number of forms (4) was observed with the lemma “কথা”: কথা, কথাখিনি, কথাবোৰ, কথাহে.

The 3rd highest number of forms (3) was observed with the lemma “কিতাপ”: কিতাপখনলৈ, কিতাপখনৰ, কিতাপৰ.

NOUN occurs with 4 features: Number (251; 90% instances), Case (234; 84% instances), Definite (37; 13% instances), Gender (2; 1% instances)

NOUN occurs with 12 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Erg, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Gender=Fem, Number=Plur, Number=Sing

NOUN occurs with 32 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (67 tokens). Examples: পূজা, কথা, নীতি, পলম, ব্যৱহাৰ, অক্ষাংশ, অনুগ্রহ, আইদেউ, আগমণ, আমনি

Relations

NOUN nodes are attached to their parents using 18 different relations: obl (68; 24% instances), obj (49; 18% instances), nmod (37; 13% instances), nsubj (37; 13% instances), compound:lvc (20; 7% instances), nmod:poss (19; 7% instances), compound (18; 6% instances), conj (13; 5% instances), root (4; 1% instances), ccomp (3; 1% instances), compound:redup (2; 1% instances), parataxis (2; 1% instances), advcl (1; 0% instances), appos (1; 0% instances), iobj (1; 0% instances), nsubj:pass (1; 0% instances), obl:lmod (1; 0% instances), vocative (1; 0% instances)

Parents of NOUN nodes belong to 6 different parts of speech: VERB (175; 63% instances), NOUN (90; 32% instances), ADJ (7; 3% instances), (4; 1% instances), ADP (1; 0% instances), SCONJ (1; 0% instances)

112 (40%) NOUN nodes are leaves.

105 (38%) NOUN nodes have one child.

42 (15%) NOUN nodes have two children.

19 (7%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 7.

Children of NOUN nodes are attached using 21 different relations: nmod (38; 15% instances), amod (32; 12% instances), nmod:poss (29; 11% instances), det (25; 10% instances), punct (21; 8% instances), compound (19; 7% instances), acl (14; 5% instances), nummod (14; 5% instances), conj (13; 5% instances), case (11; 4% instances), cc (9; 3% instances), advmod (7; 3% instances), discourse (7; 3% instances), nsubj (6; 2% instances), mark (5; 2% instances), appos (2; 1% instances), compound:redup (2; 1% instances), obj (2; 1% instances), advcl (1; 0% instances), ccomp (1; 0% instances), cop (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: NOUN (90; 35% instances), ADJ (33; 13% instances), DET (23; 9% instances), PUNCT (21; 8% instances), PRON (18; 7% instances), VERB (17; 7% instances), NUM (15; 6% instances), ADP (12; 5% instances), CCONJ (9; 3% instances), ADV (7; 3% instances), PART (6; 2% instances), SCONJ (4; 2% instances), PROPN (2; 1% instances), AUX (1; 0% instances), INTJ (1; 0% instances)