home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Xavante-XDT: POS Tags: PART

There are 32 PART lemmas (7%), 35 PART types (6%) and 322 PART tokens (14%). Out of 15 observed tags, the rank of PART is: 6 in number of lemmas, 6 in number of types and 2 in number of tokens.

The 10 most frequent PART lemmas: za’ra, hã, e, ma, norĩ, za, õ, aba, tô, _

The 10 most frequent PART types: hã, za’ra, e, ma, za, õ, norĩ, tô, aba, norĩhã

The 10 most frequent ambiguous lemmas: (DET 61, PART 49, PRON 4), ma (PART 32, ADP 20, AUX 12, PRON 2, SCONJ 2), norĩ (PART 27, NOUN 7, X 3, PRON 2), õ (PART 22, DET 6, ADV 4, PRON 1), (PART 11, ADV 1), _ (PART 7, PUNCT 6, ADV 4, NOUN 4, VERB 4, PRON 2), ‘re (PART 6, ADP 3, X 2), te (AUX 188, PART 5, ADP 3, PRON 1, X 1), zaʔra (PART 4, X 2), re (PART 3, ADP 1)

The 10 most frequent ambiguous types: (DET 61, PART 51, PRON 5), ma (PART 31, ADP 13, AUX 12, PRON 2), õ (PART 22, ADV 4, DET 4), norĩ (PART 19, NOUN 7, X 3, PRON 2), (PART 11, ADV 1), ‘re (PART 8, ADP 3, X 2), te (AUX 185, PART 5, ADP 3, PRON 1, X 1), zaʔra (PART 4, X 2), (PART 2, ADV 1), di (X 23, AUX 19, PART 2)

Morphology

The form / lemma ratio of PART is 1.093750 (the average of all parts of speech is 1.229787).

The 1st highest number of forms (6) was observed with the lemma “_”: ‘re, dzahuré, hã, norĩ, za, za’ra.

The 2nd highest number of forms (2) was observed with the lemma “aba”: aba, wa’aba.

The 3rd highest number of forms (2) was observed with the lemma “norĩ”: norĩ, norĩhã.

PART occurs with 12 features: Number (106; 33% instances), Emph (36; 11% instances), Tense (34; 11% instances), Htp (32; 10% instances), Int (32; 10% instances), Aspect (28; 9% instances), Fact (9; 3% instances), Polarity (7; 2% instances), Degree (3; 1% instances), Person (2; 1% instances), Mood (1; 0% instances), NumType (1; 0% instances)

PART occurs with 19 feature-value pairs: Aspect=Iter, Aspect=Perf, Aspect=Prog, Aspect=Prosp, Degree=Dim, Emph=Yes, Fact=Yes, Htp=Yes, Int=Yes, Mood=Sub, NumType=Dist, Number=Coll, Number=Dual, Number=Plur, Person=1, Person=2, Polarity=Neg, Tense=Imp, Tense=Past

PART occurs with 22 feature combinations. The most frequent feature combination is _ (72 tokens). Examples: hã, õ, ‘re, e, te, bö, di, tõ, bété, ni

Relations

PART nodes are attached to their parents using 5 different relations: dep (276; 86% instances), discourse (41; 13% instances), nsubj (3; 1% instances), nmod (1; 0% instances), obj (1; 0% instances)

Parents of PART nodes belong to 7 different parts of speech: VERB (211; 66% instances), NOUN (75; 23% instances), PRON (19; 6% instances), ADV (11; 3% instances), DET (4; 1% instances), ADP (1; 0% instances), X (1; 0% instances)

321 (100%) PART nodes are leaves.

1 (0%) PART nodes have one child.

The highest child degree of a PART node is 1.

Children of PART nodes are attached using 1 different relations: dep (1; 100% instances)

Children of PART nodes belong to 1 different parts of speech: X (1; 100% instances)