home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kadiweu-Unicamp: POS Tags: NOUN

There are 36 NOUN lemmas (44%), 48 NOUN types (49%) and 111 NOUN tokens (35%). Out of 11 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: Geladi, binie, watece, naigi, ooligi, wetiga, idi, weiigi, eyodi, iwa

The 10 most frequent NOUN types: iGeladi, libinienigi, liGeladi, niwatece, looligi, naigi, eyodi, weiigi, wetiGa, niganaGacanajo

The 10 most frequent ambiguous lemmas: napioi (ADJ 3, NOUN 2, PRON 1)

The 10 most frequent ambiguous types: napioi (ADJ 3, NOUN 2)

Morphology

The form / lemma ratio of NOUN is 1.333333 (the average of all parts of speech is 1.209877).

The 1st highest number of forms (4) was observed with the lemma “binie”: libiniena, libinienaGa, libinienigi, libinienigipi.

The 2nd highest number of forms (2) was observed with the lemma “Geladi”: iGeladi, liGeladi.

The 3rd highest number of forms (2) was observed with the lemma “idi”: lidGegi, lidi.

NOUN occurs with 5 features: Number (105; 95% instances), Gender (102; 92% instances), Person[psor] (70; 63% instances), Degree (16; 14% instances), Number[psor] (12; 11% instances)

NOUN occurs with 10 feature-value pairs: Degree=Dim, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Number=Plur, Number=Sing, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3

NOUN occurs with 21 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|Person[psor]=3 (30 tokens). Examples: liGeladi, looligi, nioladi, LotaGa, lidGegi, liwigo, loojedi, lidi, liwenigi, lodajo

Relations

NOUN nodes are attached to their parents using 7 different relations: nsubj (51; 46% instances), root (23; 21% instances), obj (18; 16% instances), nmod:poss (10; 9% instances), acl:relcl (4; 4% instances), dislocated (3; 3% instances), advcl (2; 2% instances)

Parents of NOUN nodes belong to 4 different parts of speech: VERB (51; 46% instances), NOUN (31; 28% instances), (23; 21% instances), ADJ (6; 5% instances)

42 (38%) NOUN nodes are leaves.

36 (32%) NOUN nodes have one child.

25 (23%) NOUN nodes have two children.

8 (7%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 4.

Children of NOUN nodes are attached using 8 different relations: det (34; 30% instances), punct (23; 21% instances), nsubj (18; 16% instances), nmod:poss (15; 13% instances), acl:relcl (10; 9% instances), mark (10; 9% instances), advcl (1; 1% instances), advmod (1; 1% instances)

Children of NOUN nodes belong to 9 different parts of speech: DET (32; 29% instances), NOUN (31; 28% instances), PUNCT (23; 21% instances), SCONJ (10; 9% instances), PRON (5; 4% instances), PROPN (4; 4% instances), ADJ (3; 3% instances), VERB (3; 3% instances), PART (1; 1% instances)