home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tagalog-Ugnayan: POS Tags: DET

There are 4 DET lemmas (1%), 4 DET types (1%) and 28 DET tokens (3%). Out of 14 observed tags, the rank of DET is: 11 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: mga, marami, ilan, bawat

The 10 most frequent DET types: mga, marami, ilan, bawat

The 10 most frequent ambiguous lemmas: marami (DET 7, ADJ 3)

The 10 most frequent ambiguous types: marami (DET 3, ADJ 2)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.116129).

The 1st highest number of forms (1) was observed with the lemma “bawat”: bawat.

The 2nd highest number of forms (1) was observed with the lemma “ilan”: ilan.

The 3rd highest number of forms (1) was observed with the lemma “marami”: marami.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 2 different relations: det (25; 89% instances), root (3; 11% instances)

Parents of DET nodes belong to 4 different parts of speech: NOUN (23; 82% instances), (3; 11% instances), NUM (1; 4% instances), VERB (1; 4% instances)

17 (61%) DET nodes are leaves.

8 (29%) DET nodes have one child.

0 (0%) DET nodes have two children.

3 (11%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 6 different relations: mark (8; 47% instances), nsubj (3; 18% instances), punct (3; 18% instances), advcl (1; 6% instances), advmod (1; 6% instances), obl (1; 6% instances)

Children of DET nodes belong to 4 different parts of speech: PART (8; 47% instances), VERB (5; 29% instances), PUNCT (3; 18% instances), ADV (1; 6% instances)