home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tagalog-TRG: POS Tags: NOUN

There are 63 NOUN lemmas (35%), 66 NOUN types (29%) and 159 NOUN tokens (22%). Out of 13 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: bata, pagkain, babae, nanay, libro, titser, bahay, bangka, banko, bigas

The 10 most frequent NOUN types: bata, pagkain, babae, nanay, libro, titser, bahay, bangka, banko, bigas

The 10 most frequent ambiguous lemmas: sulat (VERB 5, NOUN 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.047619 (the average of all parts of speech is 1.247253).

The 1st highest number of forms (2) was observed with the lemma “bata”: bata, batang.

The 2nd highest number of forms (2) was observed with the lemma “diyaryo”: diyaryo, diyaryong.

The 3rd highest number of forms (2) was observed with the lemma “lalaki”: lalaki, lalaking.

NOUN occurs with 3 features: Gender (4; 3% instances), Link (3; 2% instances), Foreign (1; 1% instances)

NOUN occurs with 4 feature-value pairs: Foreign=Yes, Gender=Fem, Gender=Masc, Link=Yes

NOUN occurs with 5 feature combinations. The most frequent feature combination is _ (151 tokens). Examples: bata, pagkain, babae, nanay, libro, titser, bahay, bangka, banko, bigas

Relations

NOUN nodes are attached to their parents using 12 different relations: nsubj (58; 36% instances), obj (27; 17% instances), obl (20; 13% instances), nsubj:pass (15; 9% instances), root (15; 9% instances), obj:agent (13; 8% instances), nsubj:lfoc (4; 3% instances), iobj:patient (3; 2% instances), compound:redup (1; 1% instances), iobj (1; 1% instances), nsubj:bfoc (1; 1% instances), nsubj:ifoc (1; 1% instances)

Parents of NOUN nodes belong to 7 different parts of speech: VERB (114; 72% instances), (15; 9% instances), ADJ (13; 8% instances), NOUN (12; 8% instances), PRON (3; 2% instances), ADV (1; 1% instances), PROPN (1; 1% instances)

8 (5%) NOUN nodes are leaves.

118 (74%) NOUN nodes have one child.

26 (16%) NOUN nodes have two children.

7 (4%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 4.

Children of NOUN nodes are attached using 9 different relations: case (146; 76% instances), punct (16; 8% instances), nsubj (13; 7% instances), det (10; 5% instances), acl:relcl (3; 2% instances), nmod:poss (2; 1% instances), compound:redup (1; 1% instances), csubj (1; 1% instances), nmod (1; 1% instances)

Children of NOUN nodes belong to 6 different parts of speech: ADP (146; 76% instances), PUNCT (16; 8% instances), NOUN (12; 6% instances), DET (9; 5% instances), PRON (5; 3% instances), VERB (5; 3% instances)