home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: PART

There are 15 PART lemmas (1%), 17 PART types (1%) and 34 PART tokens (0%). Out of 16 observed tags, the rank of PART is: 10 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent PART lemmas: dy, _, ma, rema, kae, uw, bokwa, bokware, ca, dyji

The 10 most frequent PART types: dy, ma, rema, U, Uw, bokware, ca, kaegae, ty, Uwy

The 10 most frequent ambiguous lemmas: dy (X 12, PART 9), _ (NOUN 201, VERB 142, ADV 84, PUNCT 64, X 56, ADP 44, PRON 42, PROPN 36, DET 10, PART 6, SCONJ 6, CCONJ 2, ADJ 1), ma (SCONJ 4, PART 3, CCONJ 1, X 1), rema (AUX 5, PART 3, CCONJ 2, NOUN 1), kae (ADP 38, PART 2, ADV 1, NOUN 1), uw (INTJ 7, ADV 2, PART 2, X 1), bokwa (ADV 3, VERB 2, NOUN 1, PART 1, X 1), bokware (AUX 10, PART 1), ca (ADV 6, INTJ 6, PART 1, X 1), dyji (X 14, ADP 3, PART 1, SCONJ 1)

The 10 most frequent ambiguous types: dy (PART 8, X 8), ma (SCONJ 4, PART 3, CCONJ 2, X 1), rema (AUX 5, CCONJ 3, PART 3), U (INTJ 3, PART 2), Uw (INTJ 7, ADV 2, PART 2, X 1), bokware (AUX 10, PART 2, VERB 1, X 1), ca (ADV 3, INTJ 2, PART 1, X 1), ty (ADV 24, PART 2, X 1), Uwy (ADV 2, PART 1), dyji (X 15, ADP 4, PART 1, SCONJ 1)

Morphology

The form / lemma ratio of PART is 1.133333 (the average of all parts of speech is 1.661916).

The 1st highest number of forms (5) was observed with the lemma “_”: U, boroie, ca, ty, ure.

The 2nd highest number of forms (2) was observed with the lemma “dy”: dy, ty.

The 3rd highest number of forms (1) was observed with the lemma “bokwa”: bokware.

PART occurs with 8 features: Polarity (5; 15% instances), Mood (3; 9% instances), Foc (2; 6% instances), Int (2; 6% instances), Nomzr (2; 6% instances), Speech (2; 6% instances), Number (1; 3% instances), Person (1; 3% instances)

PART occurs with 8 feature-value pairs: Foc=Yes, Int=Yes, Mood=Ind, Nomzr=Rel, Number=Plur, Person=3, Polarity=Neg, Speech=Ind

PART occurs with 8 feature combinations. The most frequent feature combination is _ (24 tokens). Examples: dy, ma, rema, U, Uw, ca, ty, Uwy, boroie, dyji

Relations

PART nodes are attached to their parents using 8 different relations: dep (15; 44% instances), mark (6; 18% instances), ccomp (4; 12% instances), discourse (3; 9% instances), advmod (2; 6% instances), root (2; 6% instances), cc (1; 3% instances), parataxis (1; 3% instances)

Parents of PART nodes belong to 5 different parts of speech: VERB (22; 65% instances), NOUN (7; 21% instances), PRON (2; 6% instances), (2; 6% instances), ADV (1; 3% instances)

27 (79%) PART nodes are leaves.

7 (21%) PART nodes have one child.

The highest child degree of a PART node is 1.

Children of PART nodes are attached using 2 different relations: punct (4; 57% instances), nsubj (3; 43% instances)

Children of PART nodes belong to 3 different parts of speech: PUNCT (4; 57% instances), PRON (2; 29% instances), NOUN (1; 14% instances)