home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tamil-MWTT: POS Tags: PROPN

There are 7 PROPN lemmas (1%), 32 PROPN types (4%) and 315 PROPN tokens (12%). Out of 13 observed tags, the rank of PROPN is: 10 in number of lemmas, 7 in number of types and 4 in number of tokens.

The 10 most frequent PROPN lemmas: குமார், ராஜா, அமெரிக்கா, சென்னை, கமலா, பாண்டிச்சேரி, ராமன்

The 10 most frequent PROPN types: குமார், குமாருக்கு, குமாரை, ராஜா, ராஜாவை, குமாருக்குத், குமாருக்குப், ராஜாவுக்கு, அமெரிகாவுக்கு, குமாருக்குச்

The 10 most frequent ambiguous lemmas: ராஜா (PROPN 21, ADV 1), சென்னை (PROPN 2, NOUN 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PROPN is 4.571429 (the average of all parts of speech is 1.743028).

The 1st highest number of forms (18) was observed with the lemma “குமார்”: குமாரது, குமாரால், குமாராவது, குமாரிடம், குமாருக்கு, குமாருக்குச், குமாருக்குத், குமாருக்குப், குமாருடையது, குமாரும், குமாரே, குமாரை, குமாரைத், குமாரைப், குமாரோ, குமாரோடு, குமார், குமார்தான்.

The 2nd highest number of forms (8) was observed with the lemma “ராஜா”: ராஜா, ராஜாவிடம், ராஜாவுக்கு, ராஜாவும், ராஜாவை, ராஜாவைத், ராஜாவைப், ராஜாவோ.

The 3rd highest number of forms (2) was observed with the lemma “சென்னை”: சென்னைக்கு, சென்னையில்.

PROPN occurs with 5 features: Case (315; 100% instances), Number (315; 100% instances), Person (315; 100% instances), Polite (39; 12% instances), Gender (2; 1% instances)

PROPN occurs with 11 feature-value pairs: Case=Acc, Case=Com, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Neut, Number=Sing, Person=3, Polite=Form

PROPN occurs with 12 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=3 (202 tokens). Examples: குமார், ராஜா, குமாரும், குமாரோ, குமாராவது, குமாரே, குமார்தான், ராஜாவும், ராஜாவோ

Relations

PROPN nodes are attached to their parents using 11 different relations: nsubj (228; 72% instances), nsubj:nc (38; 12% instances), obj (17; 5% instances), obl (9; 3% instances), root (7; 2% instances), iobj (5; 2% instances), nsubj:pass (4; 1% instances), conj (3; 1% instances), nmod (2; 1% instances), obl:cmpr (1; 0% instances), obl:lmod (1; 0% instances)

Parents of PROPN nodes belong to 6 different parts of speech: VERB (282; 90% instances), NOUN (18; 6% instances), (7; 2% instances), PROPN (4; 1% instances), ADV (2; 1% instances), PRON (2; 1% instances)

299 (95%) PROPN nodes are leaves.

9 (3%) PROPN nodes have one child.

6 (2%) PROPN nodes have two children.

1 (0%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 3.

Children of PROPN nodes are attached using 7 different relations: punct (7; 29% instances), case (6; 25% instances), nsubj (4; 17% instances), conj (3; 13% instances), obl (2; 8% instances), advcl (1; 4% instances), cc (1; 4% instances)

Children of PROPN nodes belong to 7 different parts of speech: PUNCT (7; 29% instances), ADP (6; 25% instances), NOUN (4; 17% instances), PROPN (4; 17% instances), ADV (1; 4% instances), CCONJ (1; 4% instances), PRON (1; 4% instances)