Treebank Statistics: UD_Apurina-UFPA: POS Tags: NOUN
There are 145 NOUN
lemmas (48%), 170 NOUN
types (46%) and 296 NOUN
tokens (30%).
Out of 16 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: awapukutxi, iãtã, awinhi, sytu, tiwitxi, ximaky, awiri, keku, maky, ywãtãa
The 10 most frequent NOUN
types: iãtã, awinhi, ximaky, awiri, aapuku, maky, yky, ywãtãa, kyky, sytu
The 10 most frequent ambiguous lemmas: awinhi (NOUN 8, VERB 2), apiku (NOUN 3, ADV 1), nhipukury (NOUN 3, VERB 1), _ (PUNCT 9, ADV 1, NOUN 1, PROPN 1, VERB 1), nhipuku (NOUN 1, VERB 1), were (NOUN 1, VERB 1)
The 10 most frequent ambiguous types: awinhi (NOUN 8, VERB 2), nhipukury (NOUN 4, VERB 2), apikumunhi (NOUN 3, ADV 1), awinhinã (NOUN 1, VERB 1)
- awinhi
- nhipukury
- apikumunhi
- awinhinã
Morphology
The form / lemma ratio of NOUN
is 1.172414 (the average of all parts of speech is 1.222951).
The 1st highest number of forms (5) was observed with the lemma “awapukutxi”: aapuku, aapukumunhi, aapukutxi, aapukutxiã, ũaapuku.
The 2nd highest number of forms (5) was observed with the lemma “keku”: keku, kekutxi, ukeku, ukieku, ykeku.
The 3rd highest number of forms (3) was observed with the lemma “kikiu”: Ykikiute, kikiu, kikiute.
NOUN
occurs with 12 features: Gender (144; 49% instances), Case (127; 43% instances), Possessed (120; 41% instances), Number (107; 36% instances), Number[psor] (48; 16% instances), Person[psor] (36; 12% instances), Gender[psor] (31; 10% instances), Gender[subj] (2; 1% instances), Number[subj] (2; 1% instances), Person[subj] (2; 1% instances), VerbForm (2; 1% instances), VerbType (2; 1% instances)
NOUN
occurs with 22 feature-value pairs: Case=Com
, Case=Dat
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender[psor]=Fem
, Gender[psor]=Masc
, Gender[subj]=Masc
, Number=Plur
, Number=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Number[subj]=Sing
, Person[psor]=1
, Person[psor]=2
, Person[psor]=3
, Person[subj]=3
, Possessed=No
, Possessed=Yes
, VerbForm=Vnoun
, VerbType=Vido
NOUN
occurs with 55 feature combinations.
The most frequent feature combination is _
(124 tokens).
Examples: iãtã, awinhi, ywãtãa, sytu, ũimiakury, kumyry, ũtanyry, aiku, atãkary, iumyary
Relations
NOUN
nodes are attached to their parents using 19 different relations: nsubj (85; 29% instances), obj (83; 28% instances), conj (24; 8% instances), obl:lmod (24; 8% instances), nmod (20; 7% instances), obl (15; 5% instances), root (10; 3% instances), nsubj:cop (9; 3% instances), obl:tmod (8; 3% instances), nmod:poss (6; 2% instances), compound (3; 1% instances), list (2; 1% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), dislocated (1; 0% instances), obj:agent (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)
Parents of NOUN
nodes belong to 7 different parts of speech: VERB (213; 72% instances), NOUN (53; 18% instances), (10; 3% instances), PRON (8; 3% instances), ADV (7; 2% instances), ADJ (3; 1% instances), PROPN (2; 1% instances)
158 (53%) NOUN
nodes are leaves.
104 (35%) NOUN
nodes have one child.
25 (8%) NOUN
nodes have two children.
9 (3%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 5.
Children of NOUN
nodes are attached using 19 different relations: det (36; 19% instances), punct (34; 18% instances), nmod (26; 14% instances), conj (20; 11% instances), nummod (13; 7% instances), acl (8; 4% instances), acl:relcl (7; 4% instances), nmod:poss (7; 4% instances), advmod (6; 3% instances), cc (5; 3% instances), nsubj:cop (5; 3% instances), nsubj (4; 2% instances), compound (3; 2% instances), cop (3; 2% instances), aux (2; 1% instances), case (2; 1% instances), list (2; 1% instances), advcl (1; 1% instances), discourse (1; 1% instances)
Children of NOUN
nodes belong to 14 different parts of speech: NOUN (53; 29% instances), PUNCT (34; 18% instances), PRON (28; 15% instances), VERB (21; 11% instances), DET (14; 8% instances), NUM (13; 7% instances), ADV (6; 3% instances), AUX (5; 3% instances), CCONJ (4; 2% instances), ADP (2; 1% instances), PROPN (2; 1% instances), ADJ (1; 1% instances), INTJ (1; 1% instances), PART (1; 1% instances)