home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Assamese-AiW: POS Tags: PROPN

There are 5 PROPN lemmas (1%), 8 PROPN types (2%) and 11 PROPN tokens (1%). Out of 15 observed tags, the rank of PROPN is: 13 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent PROPN lemmas: এলিচ, ডিনা, অস্ট্রেলিয়া, দুৰ্গা, নিউজিলেণ্ড

The 10 most frequent PROPN types: এলিচ, এলিচে, অস্ট্রেলিয়া, এলিচো, ডিনাই, ডিনাজনীলৈ, দুৰ্গা, নিউজিলেণ্ড

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PROPN is 1.600000 (the average of all parts of speech is 1.317618).

The 1st highest number of forms (3) was observed with the lemma “এলিচ”: এলিচ, এলিচে, এলিচো.

The 2nd highest number of forms (2) was observed with the lemma “ডিনা”: ডিনাই, ডিনাজনীলৈ.

The 3rd highest number of forms (1) was observed with the lemma “অস্ট্রেলিয়া”: অস্ট্রেলিয়া.

PROPN occurs with 3 features: Number (7; 64% instances), Gender (4; 36% instances), Case (3; 27% instances)

PROPN occurs with 4 feature-value pairs: Case=Erg, Case=Nom, Gender=Fem, Number=Sing

PROPN occurs with 7 feature combinations. The most frequent feature combination is _ (3 tokens). Examples: এলিচে, ডিনাই, ডিনাজনীলৈ

Relations

PROPN nodes are attached to their parents using 5 different relations: nsubj (7; 64% instances), appos (1; 9% instances), conj (1; 9% instances), nmod (1; 9% instances), parataxis (1; 9% instances)

Parents of PROPN nodes belong to 3 different parts of speech: VERB (8; 73% instances), NOUN (2; 18% instances), PROPN (1; 9% instances)

9 (82%) PROPN nodes are leaves.

1 (9%) PROPN nodes have one child.

1 (9%) PROPN nodes have two children.

The highest child degree of a PROPN node is 2.

Children of PROPN nodes are attached using 3 different relations: cc (1; 33% instances), conj (1; 33% instances), punct (1; 33% instances)

Children of PROPN nodes belong to 3 different parts of speech: CCONJ (1; 33% instances), PROPN (1; 33% instances), PUNCT (1; 33% instances)