Treebank Statistics: UD_Bengali-BRU: POS Tags: NOUN
There are 35 NOUN
lemmas (30%), 38 NOUN
types (25%) and 62 NOUN
tokens (19%).
Out of 14 observed tags, the rank of NOUN
is: 1 in number of lemmas, 2 in number of types and 3 in number of tokens.
The 10 most frequent NOUN
lemmas: বাবা, নাম, গান, মা, গল্প, পতাকা, হাত, কার্টুন, ক্লাস, দেশ
The 10 most frequent NOUN
types: নাম, গান, বাবা, মা, হাত, কার্টুন, ক্লাসে, গল্প, দেশের, পতাকা
The 10 most frequent ambiguous lemmas: আজ (ADV 3, NOUN 1), পড়া (VERB 4, NOUN 1), মজা (ADJ 1, NOUN 1)
The 10 most frequent ambiguous types: আজ (ADV 3, NOUN 1), মজার (ADJ 1, NOUN 1)
- আজ
- মজার
Morphology
The form / lemma ratio of NOUN
is 1.085714 (the average of all parts of speech is 1.290598).
The 1st highest number of forms (2) was observed with the lemma “গল্প”: গল্প, গল্পটি.
The 2nd highest number of forms (2) was observed with the lemma “পতাকা”: পতাকা, পতাকার.
The 3rd highest number of forms (2) was observed with the lemma “বাবা”: বাবা, বাবার.
NOUN
occurs with 3 features: Case (62; 100% instances), Number (62; 100% instances), VerbForm (2; 3% instances)
NOUN
occurs with 6 feature-value pairs: Case=Gen
, Case=Loc
, Case=Nom
, Number=Plur
, Number=Sing
, VerbForm=Vnoun
NOUN
occurs with 5 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing
(44 tokens).
Examples: গান, নাম, বাবা, মা, কার্টুন, গল্প, পতাকা, বই, রং, শেষ
Relations
NOUN
nodes are attached to their parents using 11 different relations: obj (23; 37% instances), nsubj (10; 16% instances), root (8; 13% instances), obl (6; 10% instances), compound:lvc (3; 5% instances), nmod (3; 5% instances), compound (2; 3% instances), iobj (2; 3% instances), nmod:poss (2; 3% instances), vocative (2; 3% instances), acl (1; 2% instances)
Parents of NOUN
nodes belong to 6 different parts of speech: VERB (42; 68% instances), (8; 13% instances), NOUN (7; 11% instances), ADJ (3; 5% instances), NUM (1; 2% instances), PROPN (1; 2% instances)
33 (53%) NOUN
nodes are leaves.
16 (26%) NOUN
nodes have one child.
3 (5%) NOUN
nodes have two children.
10 (16%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 4.
Children of NOUN
nodes are attached using 10 different relations: nmod:poss (16; 30% instances), det (10; 19% instances), punct (10; 19% instances), amod (6; 11% instances), nmod (4; 8% instances), advmod (2; 4% instances), nsubj (2; 4% instances), acl (1; 2% instances), case (1; 2% instances), compound (1; 2% instances)
Children of NOUN
nodes belong to 8 different parts of speech: PRON (15; 28% instances), DET (10; 19% instances), PUNCT (10; 19% instances), ADJ (7; 13% instances), NOUN (7; 13% instances), ADV (2; 4% instances), ADP (1; 2% instances), VERB (1; 2% instances)