home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sanskrit-Vedic: POS Tags: DET

There are 27 DET lemmas (0%), 100 DET types (0%) and 565 DET tokens (0%). Out of 13 observed tags, the rank of DET is: 8 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: viśva, sva, bahu, ubhaya, puru, tāvat, yāvat, bhūri, samāna, etāvat

The 10 most frequent DET types: svena, viśva, viśvāḥ, viśvā, bahu, viśve, sve, svayā, viśvam, viśvāni

The 10 most frequent ambiguous lemmas: viśva (ADJ 245, DET 237, PRON 3, NOUN 2), sva (DET 180, ADJ 107, NOUN 14), bahu (ADJ 72, DET 59, PRON 1), ubhaya (ADJ 57, DET 21), puru (ADJ 40, DET 18), tāvat (ADJ 50, ADV 25, DET 11), yāvat (ADJ 75, SCONJ 52, DET 7, ADV 3), bhūri (ADJ 29, DET 6), samāna (ADJ 96, NOUN 9, DET 4), etāvat (ADJ 37, DET 3)

The 10 most frequent ambiguous types: svena (DET 67, ADJ 3), viśva (ADJ 60, DET 50), viśvāḥ (DET 36, ADJ 14), viśvā (DET 35, ADJ 17, PRON 1), bahu (ADJ 35, DET 34, PRON 1), viśve (ADJ 50, DET 26), sve (DET 25, ADJ 5), svayā (DET 23, ADJ 1), viśvam (ADJ 23, DET 22), viśvāni (DET 22, ADJ 13)

Morphology

The form / lemma ratio of DET is 3.703704 (the average of all parts of speech is 2.674382).

The 1st highest number of forms (24) was observed with the lemma “viśva”: _, viśva, viśvaiḥ, viśvam, viśvasmai, viśvasmāt, viśvasya, viśvasyāḥ, viśvaḥ, viśve, viśvebhiḥ, viśvebhyaḥ, viśvena, viśveṣu, viśveṣām, viśvā, viśvābhiḥ, viśvām, viśvān, viśvāni, viśvāsu, viśvāsām, viśvāt, viśvāḥ.

The 2nd highest number of forms (18) was observed with the lemma “sva”: _, sva, svaiḥ, svam, svasya, svayā, svaḥ, sve, svebhiḥ, svena, svā, svām, svāsaḥ, svāt, svāya, svāyai, svāyām, svāḥ.

The 3rd highest number of forms (12) was observed with the lemma “bahu”: _, bahavaḥ, bahave, bahu, bahum, bahunā, bahuḥ, bahvīm, bahvīḥ, bahūn, bahūnām, bahūḥ.

DET occurs with 4 features: Case (442; 78% instances), Gender (442; 78% instances), Number (442; 78% instances), Compound (123; 22% instances)

DET occurs with 14 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Compound=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

DET occurs with 43 feature combinations. The most frequent feature combination is Compound=Yes (123 tokens). Examples: viśva, bahu, sva, _, puru, bhūri, ubhaya, jīva, samāna

Relations

DET nodes are attached to their parents using 1 different relations: det (565; 100% instances)

Parents of DET nodes belong to 6 different parts of speech: NOUN (526; 93% instances), ADJ (26; 5% instances), PRON (7; 1% instances), ADV (2; 0% instances), NUM (2; 0% instances), VERB (2; 0% instances)

488 (86%) DET nodes are leaves.

75 (13%) DET nodes have one child.

2 (0%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 8 different relations: discourse (66; 84% instances), conj (4; 5% instances), acl:relcl (3; 4% instances), det (2; 3% instances), acl (1; 1% instances), case (1; 1% instances), cc (1; 1% instances), nmod:appos (1; 1% instances)

Children of DET nodes belong to 5 different parts of speech: PART (67; 85% instances), ADJ (5; 6% instances), PRON (3; 4% instances), VERB (3; 4% instances), CCONJ (1; 1% instances)