home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Apurina-UFPA: POS Tags: NOUN

There are 152 NOUN lemmas (46%), 177 NOUN types (43%) and 320 NOUN tokens (29%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: awapukutxi, ximaky, iãtã, awinhi, sytu, tiwitxi, awiri, keku, kumyrype, maky

The 10 most frequent NOUN types: ximaky, iãtã, awinhi, awiri, aapuku, kumyrype, maky, yky, ywãtãa, kumyry

The 10 most frequent ambiguous lemmas: awinhi (NOUN 8, VERB 2), apiku (NOUN 3, ADV 1), nhipukury (NOUN 3, VERB 1), _ (PUNCT 9, ADV 1, NOUN 1, PROPN 1), nhipuku (NOUN 1, VERB 1), were (NOUN 1, VERB 1)

The 10 most frequent ambiguous types: awinhi (NOUN 8, VERB 2), nhipukury (NOUN 4, VERB 2), apikumunhi (NOUN 3, ADV 1), awinhinã (NOUN 1, VERB 1), nere (NOUN 1, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.164474 (the average of all parts of speech is 1.264438).

The 1st highest number of forms (5) was observed with the lemma “awapukutxi”: aapuku, aapukumunhi, aapukutxi, aapukutxiã, ũaapuku.

The 2nd highest number of forms (5) was observed with the lemma “keku”: keku, kekutxi, ukeku, ukieku, ykeku.

The 3rd highest number of forms (3) was observed with the lemma “kikiu”: Ykikiute, kikiu, kikiute.

NOUN occurs with 12 features: Gender (162; 51% instances), Case (145; 45% instances), Possessed (135; 42% instances), Number (123; 38% instances), Number[psor] (50; 16% instances), Person[psor] (38; 12% instances), Gender[psor] (33; 10% instances), Gender[subj] (3; 1% instances), Number[subj] (3; 1% instances), Person[subj] (3; 1% instances), VerbType (3; 1% instances), VerbForm (2; 1% instances)

NOUN occurs with 22 feature-value pairs: Case=Com, Case=Dat, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender[psor]=Fem, Gender[psor]=Masc, Gender[subj]=Masc, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Number[subj]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3, Person[subj]=3, Possessed=No, Possessed=Yes, VerbForm=Vnoun, VerbType=Vido

NOUN occurs with 59 feature combinations. The most frequent feature combination is _ (127 tokens). Examples: iãtã, awinhi, ywãtãa, sytu, ũimiakury, kumyry, kumyrype, ũtanyry, aiku, atãkary

Relations

NOUN nodes are attached to their parents using 19 different relations: obj (100; 31% instances), nsubj (87; 27% instances), obl (31; 10% instances), conj (24; 8% instances), nmod (20; 6% instances), obl:lmod (15; 5% instances), root (11; 3% instances), nsubj:cop (9; 3% instances), nmod:poss (6; 2% instances), compound (4; 1% instances), obl:tmod (4; 1% instances), list (2; 1% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), dislocated (1; 0% instances), obj:agent (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)

Parents of NOUN nodes belong to 7 different parts of speech: VERB (235; 73% instances), NOUN (54; 17% instances), (11; 3% instances), PRON (8; 3% instances), ADV (7; 2% instances), ADJ (3; 1% instances), PROPN (2; 1% instances)

177 (55%) NOUN nodes are leaves.

108 (34%) NOUN nodes have one child.

26 (8%) NOUN nodes have two children.

9 (3%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 20 different relations: det (36; 19% instances), punct (35; 18% instances), nmod (26; 14% instances), conj (21; 11% instances), nummod (13; 7% instances), acl (8; 4% instances), acl:relcl (7; 4% instances), nmod:poss (7; 4% instances), advmod (6; 3% instances), compound (5; 3% instances), nsubj:cop (5; 3% instances), aux (4; 2% instances), nsubj (4; 2% instances), cc (3; 2% instances), cop (3; 2% instances), advcl (2; 1% instances), case (2; 1% instances), list (2; 1% instances), amod (1; 1% instances), discourse (1; 1% instances)

Children of NOUN nodes belong to 14 different parts of speech: NOUN (54; 28% instances), PUNCT (35; 18% instances), PRON (28; 15% instances), VERB (24; 13% instances), DET (14; 7% instances), NUM (13; 7% instances), AUX (7; 4% instances), ADV (6; 3% instances), ADJ (2; 1% instances), ADP (2; 1% instances), CCONJ (2; 1% instances), PROPN (2; 1% instances), INTJ (1; 1% instances), PART (1; 1% instances)