home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Xavante-XDT: POS Tags: NOUN

There are 16 NOUN lemmas (26%), 22 NOUN types (29%) and 30 NOUN tokens (25%). Out of 10 observed tags, the rank of NOUN is: 2 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: _, wahiʔrata, ö, aiʔuté, höjmanazé, mama, marĩ, pire, potoʔwa, ro

The 10 most frequent NOUN types: wahiʔrata, aiʔuté, Ö, ĩʔrãzani, Warazu, Wasi, abazi, dañoʔre, höjmanazé, ipire

The 10 most frequent ambiguous lemmas: _ (NOUN 10, PART 9, VERB 5, ADP 4, PRON 4, ADV 2, AUX 2), wa (PRON 5, NOUN 1)

The 10 most frequent ambiguous types: wa (PRON 6, NOUN 1, PART 1)

Morphology

The form / lemma ratio of NOUN is 1.375000 (the average of all parts of speech is 1.262295).

The 1st highest number of forms (9) was observed with the lemma “_”: abazi, aiʔuté, dañoʔre, pese, tiʔa, wahiʔrata, ñimiromhuri, ĩʔrãzani, ʔwatébrémi.

The 2nd highest number of forms (1) was observed with the lemma “aiʔuté”: Aiʔuté.

The 3rd highest number of forms (1) was observed with the lemma “höjmanazé”: höjmanazé.

NOUN occurs with 3 features: Nmzr (3; 10% instances), Person (2; 7% instances), Number (1; 3% instances)

NOUN occurs with 4 feature-value pairs: Nmzr=Yes, Number=Plur, Person=1, Person=3

NOUN occurs with 4 feature combinations. The most frequent feature combination is _ (26 tokens). Examples: wahiʔrata, aiʔuté, Ö, Warazu, Wasi, abazi, dañoʔre, höjmanazé, marĩ, pese

Relations

NOUN nodes are attached to their parents using 6 different relations: obj (12; 40% instances), nsubj (8; 27% instances), obl (4; 13% instances), root (4; 13% instances), dep (1; 3% instances), nmod (1; 3% instances)

Parents of NOUN nodes belong to 4 different parts of speech: VERB (19; 63% instances), NOUN (6; 20% instances), (4; 13% instances), ADV (1; 3% instances)

14 (47%) NOUN nodes are leaves.

7 (23%) NOUN nodes have one child.

7 (23%) NOUN nodes have two children.

2 (7%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 7.

Children of NOUN nodes are attached using 9 different relations: dep (10; 32% instances), discourse (5; 16% instances), nmod (4; 13% instances), case (3; 10% instances), nsubj (3; 10% instances), det (2; 6% instances), obj (2; 6% instances), advmod (1; 3% instances), punct (1; 3% instances)

Children of NOUN nodes belong to 7 different parts of speech: PART (19; 61% instances), NOUN (6; 19% instances), DET (2; 6% instances), ADP (1; 3% instances), PRON (1; 3% instances), PROPN (1; 3% instances), PUNCT (1; 3% instances)