home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tagalog-TRG: POS Tags: PART

There are 6 PART lemmas (3%), 7 PART types (3%) and 19 PART tokens (3%). Out of 13 observed tags, the rank of PART is: 8 in number of lemmas, 8 in number of types and 9 in number of tokens.

The 10 most frequent PART lemmas: hindi, ba, daw, ano, kaya, sana

The 10 most frequent PART types: hindi, daw, ba, ano, bang, kaya, sana

The 10 most frequent ambiguous lemmas: hindi (PART 6, INTJ 1), ano (VERB 2, ADJ 1, PART 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PART is 1.166667 (the average of all parts of speech is 1.247253).

The 1st highest number of forms (2) was observed with the lemma “ba”: ba, bang.

The 2nd highest number of forms (1) was observed with the lemma “ano”: ano.

The 3rd highest number of forms (1) was observed with the lemma “daw”: daw.

PART occurs with 3 features: PartType (13; 68% instances), Polarity (6; 32% instances), Link (1; 5% instances)

PART occurs with 5 feature-value pairs: Link=Yes, PartType=Des, PartType=Int, PartType=Nfh, Polarity=Neg

PART occurs with 5 feature combinations. The most frequent feature combination is Polarity=Neg (6 tokens). Examples: hindi

Relations

PART nodes are attached to their parents using 1 different relations: advmod (19; 100% instances)

Parents of PART nodes belong to 2 different parts of speech: VERB (16; 84% instances), ADJ (3; 16% instances)

18 (95%) PART nodes are leaves.

1 (5%) PART nodes have one child.

The highest child degree of a PART node is 1.

Children of PART nodes are attached using 1 different relations: punct (1; 100% instances)

Children of PART nodes belong to 1 different parts of speech: PUNCT (1; 100% instances)