home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tamil-MWTT: POS Tags: PRON

There are 25 PRON lemmas (5%), 55 PRON types (6%) and 171 PRON tokens (7%). Out of 13 observed tags, the rank of PRON is: 5 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: நான், அவன், தான், இது, நீங்கள், நீ, எல்லோரும், அது, அவள், என்ன

The 10 most frequent PRON types: நான், அவன், தன், எல்லோரும், தன்னை, நீ, என், எங்கள், என்ன, என்னை

The 10 most frequent ambiguous lemmas: தான் (PRON 17, ADV 2, PART 2), எல்லோரும் (PRON 7, NOUN 1), சில (PRON 2, ADJ 1, DET 1), பல (DET 1, PRON 1)

The 10 most frequent ambiguous types: எல்லோரும் (PRON 7, NOUN 1)

Morphology

The form / lemma ratio of PRON is 2.200000 (the average of all parts of speech is 1.743028).

The 1st highest number of forms (10) was observed with the lemma “நான்”: எனக்குப், என், என்னால், என்னிடம், என்னை, என்னையே, நானாக, நானாவது, நானே, நான்.

The 2nd highest number of forms (5) was observed with the lemma “அவன்”: அவனாக, அவனிடம், அவனுக்கு, அவனுடைய, அவன்.

The 3rd highest number of forms (5) was observed with the lemma “இது”: இது, இதை, இதைத், இதைப், இவைகள்.

PRON occurs with 7 features: Number (165; 96% instances), Case (158; 92% instances), Gender (153; 89% instances), Person (140; 82% instances), PronType (48; 28% instances), Animacy (36; 21% instances), Polite (4; 2% instances)

PRON occurs with 19 feature-value pairs: Animacy=Anim, Case=Acc, Case=Ben, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Gender=Com, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Form, PronType=Ind, PronType=Prs

PRON occurs with 41 feature combinations. The most frequent feature combination is Animacy=Anim|Case=Nom|Gender=Com|Number=Sing|Person=1|PronType=Prs (27 tokens). Examples: நான்

Relations

PRON nodes are attached to their parents using 9 different relations: nsubj (89; 52% instances), obj (30; 18% instances), nmod:poss (25; 15% instances), nsubj:nc (7; 4% instances), root (7; 4% instances), iobj (5; 3% instances), obl (5; 3% instances), nmod (2; 1% instances), obl:arg (1; 1% instances)

Parents of PRON nodes belong to 6 different parts of speech: VERB (111; 65% instances), NOUN (44; 26% instances), PRON (7; 4% instances), (7; 4% instances), ADV (1; 1% instances), PROPN (1; 1% instances)

159 (93%) PRON nodes are leaves.

5 (3%) PRON nodes have one child.

7 (4%) PRON nodes have two children.

The highest child degree of a PRON node is 2.

Children of PRON nodes are attached using 5 different relations: punct (7; 37% instances), nsubj (6; 32% instances), nmod (3; 16% instances), case (2; 11% instances), acl (1; 5% instances)

Children of PRON nodes belong to 5 different parts of speech: PRON (7; 37% instances), PUNCT (7; 37% instances), ADP (2; 11% instances), PROPN (2; 11% instances), VERB (1; 5% instances)