home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Madi-Jarawara: POS Tags: NOUN

There are 11 NOUN lemmas (23%), 11 NOUN types (17%) and 16 NOUN tokens (14%). Out of 8 observed tags, the rank of NOUN is: 2 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent NOUN lemmas: _, abe, yama, yobe, abise, howe, ini, ino, kabikana, taha

The 10 most frequent NOUN types: Taha, Yobe, abe, ini, yama, Aba, Abise, Howe, Kabikana, ino

The 10 most frequent ambiguous lemmas: _ (NOUN 3, VERB 3, PRON 2, PUNCT 1, X 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.382979).

The 1st highest number of forms (3) was observed with the lemma “_”: Aba, Taha, ini.

The 2nd highest number of forms (1) was observed with the lemma “abe”: abe.

The 3rd highest number of forms (1) was observed with the lemma “abise”: Abise.

NOUN occurs with 1 features: Gender (3; 19% instances)

NOUN occurs with 2 feature-value pairs: Gender=Fem, Gender=Masc

NOUN occurs with 3 feature combinations. The most frequent feature combination is _ (13 tokens). Examples: Taha, Yobe, abe, yama, Aba, Abise, Howe, Kabikana, wawasi

Relations

NOUN nodes are attached to their parents using 7 different relations: obj (6; 38% instances), nsubj (3; 19% instances), nmod (2; 13% instances), root (2; 13% instances), compound (1; 6% instances), obl (1; 6% instances), vocative (1; 6% instances)

Parents of NOUN nodes belong to 3 different parts of speech: VERB (11; 69% instances), NOUN (3; 19% instances), (2; 13% instances)

10 (63%) NOUN nodes are leaves.

4 (25%) NOUN nodes have one child.

2 (13%) NOUN nodes have two children.

The highest child degree of a NOUN node is 2.

Children of NOUN nodes are attached using 5 different relations: amod (2; 25% instances), nmod (2; 25% instances), punct (2; 25% instances), compound (1; 13% instances), dep (1; 13% instances)

Children of NOUN nodes belong to 4 different parts of speech: NOUN (3; 38% instances), PUNCT (2; 25% instances), VERB (2; 25% instances), X (1; 13% instances)