Treebank Statistics: UD_Tamil-MWTT: POS Tags: PROPN
There are 7 PROPN lemmas (1%), 32 PROPN types (4%) and 315 PROPN tokens (12%).
Out of 13 observed tags, the rank of PROPN is: 10 in number of lemmas, 7 in number of types and 4 in number of tokens.
The 10 most frequent PROPN lemmas: குமார், ராஜா, அமெரிக்கா, சென்னை, கமலா, பாண்டிச்சேரி, ராமன்
The 10 most frequent PROPN types: குமார், குமாருக்கு, குமாரை, ராஜா, ராஜாவை, குமாருக்குத், குமாருக்குப், ராஜாவுக்கு, அமெரிகாவுக்கு, குமாருக்குச்
The 10 most frequent ambiguous lemmas: ராஜா (PROPN 21, ADV 1), சென்னை (PROPN 2, NOUN 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PROPN is 4.571429 (the average of all parts of speech is 1.743028).
The 1st highest number of forms (18) was observed with the lemma “குமார்”: குமாரது, குமாரால், குமாராவது, குமாரிடம், குமாருக்கு, குமாருக்குச், குமாருக்குத், குமாருக்குப், குமாருடையது, குமாரும், குமாரே, குமாரை, குமாரைத், குமாரைப், குமாரோ, குமாரோடு, குமார், குமார்தான்.
The 2nd highest number of forms (8) was observed with the lemma “ராஜா”: ராஜா, ராஜாவிடம், ராஜாவுக்கு, ராஜாவும், ராஜாவை, ராஜாவைத், ராஜாவைப், ராஜாவோ.
The 3rd highest number of forms (2) was observed with the lemma “சென்னை”: சென்னைக்கு, சென்னையில்.
PROPN occurs with 5 features: Case (315; 100% instances), Number (315; 100% instances), Person (315; 100% instances), Polite (39; 12% instances), Gender (2; 1% instances)
PROPN occurs with 11 feature-value pairs: Case=Acc, Case=Com, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Neut, Number=Sing, Person=3, Polite=Form
PROPN occurs with 12 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|Person=3 (202 tokens).
Examples: குமார், ராஜா, குமாரும், குமாரோ, குமாராவது, குமாரே, குமார்தான், ராஜாவும், ராஜாவோ
Relations
PROPN nodes are attached to their parents using 11 different relations: nsubj (228; 72% instances), nsubj:nc (37; 12% instances), obj (17; 5% instances), obl (10; 3% instances), root (7; 2% instances), iobj (5; 2% instances), nsubj:pass (4; 1% instances), conj (3; 1% instances), nmod (2; 1% instances), obl:cmpr (1; 0% instances), obl:lmod (1; 0% instances)
Parents of PROPN nodes belong to 6 different parts of speech: VERB (282; 90% instances), NOUN (18; 6% instances), (7; 2% instances), PROPN (4; 1% instances), ADV (2; 1% instances), PRON (2; 1% instances)
299 (95%) PROPN nodes are leaves.
9 (3%) PROPN nodes have one child.
6 (2%) PROPN nodes have two children.
1 (0%) PROPN nodes have three or more children.
The highest child degree of a PROPN node is 3.
Children of PROPN nodes are attached using 7 different relations: punct (7; 29% instances), case (6; 25% instances), nsubj (4; 17% instances), conj (3; 13% instances), obl (2; 8% instances), advcl (1; 4% instances), cc (1; 4% instances)
Children of PROPN nodes belong to 7 different parts of speech: PUNCT (7; 29% instances), ADP (6; 25% instances), NOUN (4; 17% instances), PROPN (4; 17% instances), ADV (1; 4% instances), CCONJ (1; 4% instances), PRON (1; 4% instances)