Treebank Statistics: UD_Sanskrit-Vedic: POS Tags: NUM
There are 28 NUM
lemmas (1%), 70 NUM
types (1%) and 292 NUM
tokens (1%).
Out of 11 observed tags, the rank of NUM
is: 5 in number of lemmas, 6 in number of types and 9 in number of tokens.
The 10 most frequent NUM
lemmas: śata, tri, saptan, dvi, eka, sahasra, aṣṭan, dvādaśan, pañcan, catur
The 10 most frequent NUM
types: śatam, sapta, pañca, dvādaśa, eka, śata, tisraḥ, ekādaśa, aṣṭau, daśa
The 10 most frequent ambiguous lemmas: eka (PRON 28, NUM 22), catur (NUM 13, ADV 2), trayastriṃśat (NOUN 3, NUM 3), pañcāśat (NUM 2, NOUN 1), saptadaśan (NUM 2, ADJ 1), daśama (ADJ 3, NUM 1), ekaviṃśa (ADJ 3, NUM 1), ekaviṃśati (NOUN 5, NUM 1), tris (ADV 22, NUM 1)
The 10 most frequent ambiguous types: eka (NUM 12, PRON 5), daśa (NUM 9, VERB 1), ekam (PRON 4, NUM 3), trayastriṃśat (NOUN 3, NUM 3), ekayā (NUM 2, PRON 2), ekaḥ (PRON 12, NUM 2), saptadaśa (NUM 2, ADJ 1), daśame (ADJ 1, NUM 1), ekaviṃśatyā (NOUN 4, NUM 1), ekena (NUM 1, PRON 1)
- eka
- daśa
- ekam
- trayastriṃśat
- ekayā
- ekaḥ
- saptadaśa
- daśame
- ADJ 1: hiraṇyayī araṇī yam nirmanthataḥ aśvinā tam te garbham havāmahe daśame māsi sūtave
- NUM 1: ṛṇam asmin saṃnayati amṛta tvam ca gacchati pitā putrasya jātasya paśyet ced jīvataḥ mukham yāvantaḥ pṛthivyām bhogāḥ yāvantaḥ jātavedasi yāvantaḥ apsu prāṇinām bhūyān putre pituḥ tatas śaśvat putreṇa pitaraḥ atyāyan bahulam tamaḥ ātmā hi jajñe ātmanaḥ saḥ irāvatī atitāriṇī kim nu malam kim ajinam kim u śmaśrūṇi kim tapaḥ putram brahmāṇaḥ icchadhvam sa vai lokaḥ avadāvadaḥ annam ha prāṇaḥ śaraṇam ha vāsaḥ rūpam hiraṇyam paśavaḥ vivāhāḥ sakhā ha jāyā kṛpaṇam ha duhitā jyotiḥ ha putraḥ parame vyoman patiḥ jāyām praviśati garbhaḥ bhūtvā sa mātaram tasyām punar navaḥ bhūtvā daśame māsi jāyate tat jāyā jāyā bhavati yat asyām jāyate punar ābhūtiḥ eṣā ābhūtiḥ bījam etat nidhīyate devāḥ ca etām ṛṣayaḥ ca tejaḥ samabharat mahat devāḥ manuṣyān abruvan eṣā vaḥ jananī punar na aputrasya lokaḥ asti iti tat sarve paśavaḥ viduḥ tasmāt tu putraḥ mātaram svasāram ca adhirohati eṣa panthāḥ urugāyaḥ su śevaḥ yam putriṇaḥ ākramante viśokāḥ tam paśyanti paśavaḥ vayāṃsi ca tasmāt te mātrā api mithunībhavanti iti ha asmai ākhyāya
- ekaviṃśatyā
- ekena
Morphology
The form / lemma ratio of NUM
is 2.500000 (the average of all parts of speech is 2.167316).
The 1st highest number of forms (9) was observed with the lemma “tri”: tisraḥ, tisṛbhiḥ, trayaḥ, trayāṇām, tri, triṣu, trī, trīn, trīṇi.
The 2nd highest number of forms (7) was observed with the lemma “eka”: eka, ekam, ekayā, ekaḥ, ekena, ekā, ekām.
The 3rd highest number of forms (6) was observed with the lemma “śata”: śata, śatam, śatasya, śatā, śatāni, śatāḥ.
NUM
occurs with 3 features: Gender (238; 82% instances), Case (215; 74% instances), Number (215; 74% instances)
NUM
occurs with 14 feature-value pairs: Case=Abl
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Case=Voc
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Dual
, Number=Plur
, Number=Sing
NUM
occurs with 37 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Neut|Number=Sing
(57 tokens).
Examples: śatam, dvādaśa, nava, sapta, ekādaśa, sahasram, daśa, ekam, pañca, saptadaśa
Relations
NUM
nodes are attached to their parents using 10 different relations: nummod (181; 62% instances), conj (27; 9% instances), root (20; 7% instances), nsubj (18; 6% instances), obj (17; 6% instances), obl (12; 4% instances), flat (7; 2% instances), xcomp (5; 2% instances), nmod (4; 1% instances), compound (1; 0% instances)
Parents of NUM
nodes belong to 7 different parts of speech: NOUN (160; 55% instances), VERB (64; 22% instances), NUM (28; 10% instances), (20; 7% instances), ADJ (17; 6% instances), PRON (2; 1% instances), ADV (1; 0% instances)
213 (73%) NUM
nodes are leaves.
45 (15%) NUM
nodes have one child.
19 (7%) NUM
nodes have two children.
15 (5%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 5.
Children of NUM
nodes are attached using 19 different relations: nsubj (32; 23% instances), conj (22; 16% instances), advmod (14; 10% instances), det (14; 10% instances), flat (11; 8% instances), cc (8; 6% instances), acl (5; 4% instances), amod (5; 4% instances), discourse (5; 4% instances), nmod (5; 4% instances), nummod (4; 3% instances), orphan (3; 2% instances), mark (2; 1% instances), obl (2; 1% instances), advcl (1; 1% instances), appos (1; 1% instances), case (1; 1% instances), ccomp (1; 1% instances), vocative (1; 1% instances)
Children of NUM
nodes belong to 9 different parts of speech: NOUN (48; 35% instances), NUM (28; 20% instances), PRON (23; 17% instances), ADV (18; 13% instances), CCONJ (8; 6% instances), VERB (5; 4% instances), PART (4; 3% instances), ADJ (2; 1% instances), SCONJ (1; 1% instances)