Treebank Statistics: UD_Xavante-XDT: POS Tags: NOUN
There are 109 NOUN
lemmas (38%), 147 NOUN
types (40%) and 344 NOUN
tokens (22%).
Out of 14 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 2 in number of tokens.
The 10 most frequent NOUN
lemmas: marĩ, pi’õ, aibö, ‘watébrémi, a’uwẽ, wapté, buru, ba’õtõre, mama, höimanadzé
The 10 most frequent NOUN
types: marĩ, aibö, ‘watébrémi, pi’õ, a’uwẽ, buru, wapté, ba’õtõ, Mare, bötö
The 10 most frequent ambiguous lemmas: na (ADP 31, NOUN 8, X 1), mreme (NOUN 4, VERB 1), romhuri (VERB 30, NOUN 2), wẽ (NOUN 2, VERB 2, ADV 1), hö (NOUN 1, X 1), höiwahö (ADV 2, NOUN 1), mro (NOUN 1, VERB 1), rowatsu’u (VERB 3, NOUN 1), to (NOUN 1, VERB 1)
The 10 most frequent ambiguous types: tete (NOUN 2, AUX 1), Höiwahö (ADV 2, NOUN 1), hö (NOUN 1, X 1), romhuri (VERB 16, NOUN 1), wẽ (VERB 2, ADV 1, NOUN 1)
- tete
- Höiwahö
- hö
- romhuri
- wẽ
- VERB 2: Ö wa ĩĩsima wẽ
- ADV 1: Romhuri aba ! Romhuri dza’ra wa’aba ! Aiwa’a tõ ! Atsõhui’wa wa’wa ma wi romhuri dza’ra wa’aba wẽ da .
- NOUN 1: Taha parimhã rowahutu’wa te dama tinha : “ Ãne wamhã , ma’ãpé , ai’repudu , awa’awi hã , wama ãma ĩtsahu na marĩ na te aima ĩrowatsu’u dza’ra aba na ahömhö hã , wẽ uptabi na wa te waihu’u dza’ra da ! “
Morphology
The form / lemma ratio of NOUN
is 1.348624 (the average of all parts of speech is 1.291228).
The 1st highest number of forms (5) was observed with the lemma “mama”: Aimama, timama, wamama, ĩmama, ĩĩmama.
The 2nd highest number of forms (5) was observed with the lemma “tsa”: datsa, tsa, watsa, watsai, ĩtsa.
The 3rd highest number of forms (4) was observed with the lemma “’ra”: ‘ra, ti’ra, wa’ra, ĩ’ra.
NOUN
occurs with 7 features: Person (67; 19% instances), Number (19; 6% instances), Gnq (6; 2% instances), Reflex (5; 1% instances), Degree (3; 1% instances), Case (1; 0% instances), Polarity (1; 0% instances)
NOUN
occurs with 10 feature-value pairs: Case=Ins
, Degree=Dim
, Gnq=Yes
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Reflex=Yes
NOUN
occurs with 13 feature combinations.
The most frequent feature combination is _
(266 tokens).
Examples: marĩ, aibö, ‘watébrémi, pi’õ, a’uwẽ, buru, wapté, ba’õtõ, Mare, bötö
Relations
NOUN
nodes are attached to their parents using 14 different relations: nsubj (104; 30% instances), obj (65; 19% instances), obl (57; 17% instances), nmod (37; 11% instances), root (22; 6% instances), dislocated (19; 6% instances), parataxis (14; 4% instances), vocative (8; 2% instances), advcl (7; 2% instances), conj (6; 2% instances), iobj (2; 1% instances), acl (1; 0% instances), case (1; 0% instances), ccomp (1; 0% instances)
Parents of NOUN
nodes belong to 5 different parts of speech: VERB (256; 74% instances), NOUN (63; 18% instances), (22; 6% instances), ADV (2; 1% instances), ADP (1; 0% instances)
142 (41%) NOUN
nodes are leaves.
112 (33%) NOUN
nodes have one child.
55 (16%) NOUN
nodes have two children.
35 (10%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 7.
Children of NOUN
nodes are attached using 17 different relations: case (79; 22% instances), det (61; 17% instances), dep (57; 16% instances), nmod (41; 12% instances), punct (39; 11% instances), nsubj (13; 4% instances), advmod (12; 3% instances), discourse (12; 3% instances), parataxis (9; 3% instances), mark (8; 2% instances), conj (6; 2% instances), obl (5; 1% instances), advcl (3; 1% instances), dislocated (3; 1% instances), nummod (3; 1% instances), acl (1; 0% instances), obj (1; 0% instances)
Children of NOUN
nodes belong to 12 different parts of speech: ADP (82; 23% instances), NOUN (63; 18% instances), DET (61; 17% instances), PART (51; 14% instances), PUNCT (39; 11% instances), X (18; 5% instances), ADV (12; 3% instances), SCONJ (9; 3% instances), VERB (9; 3% instances), PRON (5; 1% instances), NUM (3; 1% instances), INTJ (1; 0% instances)