home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cebuano-GJA: POS Tags: PROPN

There are 37 PROPN lemmas (8%), 37 PROPN types (7%) and 63 PROPN tokens (5%). Out of 14 observed tags, the rank of PROPN is: 4 in number of lemmas, 5 in number of types and 8 in number of tokens.

The 10 most frequent PROPN lemmas: Tom, Mary, Pedro, Juan, Cebu, Ditang, Maria, Peter, Adot, Alicia

The 10 most frequent PROPN types: Tom, Mary, Pedro, Juan, Cebu, Ditang, Maria, Peter, Adot, Alicia

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.162584).

The 1st highest number of forms (1) was observed with the lemma “Adot”: Adot.

The 2nd highest number of forms (1) was observed with the lemma “Alicia”: Alicia.

The 3rd highest number of forms (1) was observed with the lemma “Ana”: Ana.

PROPN occurs with 1 features: Gender (40; 63% instances)

PROPN occurs with 2 feature-value pairs: Gender=Fem, Gender=Masc

PROPN occurs with 3 feature combinations. The most frequent feature combination is Gender=Masc (27 tokens). Examples: Tom, Pedro, Juan, Adot, Lito, Rolando, Ruben, Tonying, Toto, Undo

Relations

PROPN nodes are attached to their parents using 9 different relations: nsubj (22; 35% instances), obj (16; 25% instances), obl (7; 11% instances), nmod (5; 8% instances), root (5; 8% instances), conj (4; 6% instances), vocative (2; 3% instances), compound (1; 2% instances), flat (1; 2% instances)

Parents of PROPN nodes belong to 7 different parts of speech: VERB (44; 70% instances), PROPN (6; 10% instances), (5; 8% instances), ADJ (3; 5% instances), ADV (2; 3% instances), NOUN (2; 3% instances), PRON (1; 2% instances)

5 (8%) PROPN nodes are leaves.

44 (70%) PROPN nodes have one child.

6 (10%) PROPN nodes have two children.

8 (13%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 6.

Children of PROPN nodes are attached using 15 different relations: case (54; 62% instances), punct (9; 10% instances), nmod (4; 5% instances), cc (3; 3% instances), advmod (2; 2% instances), amod (2; 2% instances), conj (2; 2% instances), mark (2; 2% instances), nsubj (2; 2% instances), orphan (2; 2% instances), appos (1; 1% instances), compound (1; 1% instances), flat (1; 1% instances), nummod (1; 1% instances), obl (1; 1% instances)

Children of PROPN nodes belong to 10 different parts of speech: ADP (54; 62% instances), PUNCT (9; 10% instances), NOUN (7; 8% instances), PROPN (6; 7% instances), CCONJ (3; 3% instances), PART (3; 3% instances), ADJ (2; 2% instances), NUM (1; 1% instances), PRON (1; 1% instances), SCONJ (1; 1% instances)