home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Akkadian-RIAO: POS Tags: NOUN

There are 698 NOUN lemmas (42%), 1233 NOUN types (42%) and 8922 NOUN tokens (38%). Out of 13 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: šarru, mātu, _, ālu, māru, kiššatu, bēlu, šadû, ēkallu, ilu

The 10 most frequent NOUN types: šar, māt, mār, kiššati, šarru, ālāni, libbi, maddattu, ilāni, ummānāti

The 10 most frequent ambiguous lemmas: _ (PRON 2390, NOUN 411, VERB 333, PART 171, CCONJ 142, NUM 100, ADJ 64, PROPN 47, ADV 6, ADP 5, X 3, DET 2), pānu (NOUN 84, ADV 8), šaknu (NOUN 50, VERB 31), ašru (NOUN 42, SCONJ 4), dannu (ADJ 172, NOUN 18, VERB 5), manû (VERB 19, NOUN 15), danānu (NOUN 10, VERB 5), rabû (ADJ 175, NOUN 9, VERB 4), abnu (NOUN 8, VERB 1), edû (NOUN 8, VERB 1)

The 10 most frequent ambiguous types: pān (NOUN 59, ADV 8), dannūti (NOUN 32, ADJ 22), ašar (NOUN 16, SCONJ 4), x (NOUN 16, PROPN 16, VERB 10, ADP 3, X 3, NUM 1), rabûti (ADJ 55, NOUN 7), balṭūti (ADJ 14, NOUN 4), ekṣūte (NOUN 3, ADJ 2), šaknu (VERB 11, NOUN 3), arkât (NOUN 2, ADJ 1), battubatte (ADP 3, NOUN 2, ADV 1)

Morphology

The form / lemma ratio of NOUN is 1.766476 (the average of all parts of speech is 1.796005).

The 1st highest number of forms (154) was observed with the lemma “_”: BAD.MEŠ, IM.MEŠ-ni, KA, NIG₂, abulli, ahu, amēlî, appī, asummēni, ašar, biriquš, bunnannî, bāb, bābāni, bēli, bēlē, bēlūt, danān, daprānī, diqārāt, diqārāte, dēkta, dēktu, dīkta, dūr, ebūrī, ekurrāt, emūqī, ereqqu, gabadibbī, gāgī, hurās, ihzi, išātāti, iṣṣūrē, kakkī, kasap, kinūn, kirâti, kissāte, kitekittê, kiššūt, kurummāti?, kussâ, labbāku, libba, libbi, limnīti, limētuš, liātu, lubulte, līt, līṭī, maddāte, makkūri, malkūt, ma’dūte, muddahhiṣī, muddahṣī, mudiš, muhhi, multa’’ît, multa’’īt, munēr, murte’ât, mušahmeṭi, mušerbû, mār, māt, māti, mūrānī, nablū, nakrī, namurrat, namzī’āte, narkabta, narkabāti, narû, nathi, nazzī’āte, nudunnî, nāhirī, nērbē, nērbī, nīqāte, nīše, pagūta, palê, pattûti, pilše, puhur, pî, pāni, pēt, qumāšātu, qāti, qā’iš, ritti, rā’im, rā’imat, sa-x-ti, sakkī, salmē, siqir, siqr, sā’te, sītāt, sītāte, tabbilī, tanatt, tidūki, tiklē, tuqmate, tuqmati, tuqumtu, tîrēte, ummānāti, urdūte, urdūti, urdūtu, uznī, x, x-ba-meš, x-e, x-i-te, x-x-x, x-x-x.MEŠ, x-šunu, x.MEŠ, {NA₄}x, āl, āli, ēkallāti, šaknūte, šaknūti, šamgāni, šangût, šanāti, šaressu, šarrūt, še’u, še’ī, še’ū, šum, šumi, šumēli, šurmēnī, šīrī, ūme, ṣalūl, ṣibtāti, ṣulūl, ṣābī, ṣēnī.

The 2nd highest number of forms (8) was observed with the lemma “mātu”: mās, māt, māta, māti, mātu, mātāt, mātāte, mātāti.

The 3rd highest number of forms (7) was observed with the lemma “nakru”: nakirē, nakirī, nakri, nakrī, nakrūt, nakrūti, nukrī.

NOUN occurs with 6 features: Number (8901; 100% instances), Gender (8899; 100% instances), NounBase (7320; 82% instances), Case (3242; 36% instances), Person (30; 0% instances), VerbForm (27; 0% instances)

NOUN occurs with 16 feature-value pairs: Case=Acc, Case=Gen, Case=Loc, Case=Nom, Gender=Com, Gender=Fem, Gender=Masc, NounBase=Bound, NounBase=Free, NounBase=Suffixal, NounBase=Terminal, Number=Plur, Number=Sing, Person=1, Person=3, VerbForm=Stat

NOUN occurs with 47 feature combinations. The most frequent feature combination is Gender=Masc|NounBase=Bound|Number=Sing (1567 tokens). Examples: šar, mār, bīt, iššak, pān, āl, šakin, rēš, pāṭ, bēl

Relations

NOUN nodes are attached to their parents using 13 different relations: nmod:poss (2043; 23% instances), appos (1856; 21% instances), obl (1735; 19% instances), obj (1421; 16% instances), conj (1222; 14% instances), nmod (253; 3% instances), nsubj (191; 2% instances), root (191; 2% instances), acl:relcl (6; 0% instances), acl (1; 0% instances), csubj (1; 0% instances), dep (1; 0% instances), vocative (1; 0% instances)

Parents of NOUN nodes belong to 10 different parts of speech: NOUN (3511; 39% instances), VERB (3396; 38% instances), PROPN (1763; 20% instances), (191; 2% instances), PRON (20; 0% instances), ADJ (19; 0% instances), DET (16; 0% instances), PART (4; 0% instances), ADV (1; 0% instances), NUM (1; 0% instances)

1643 (18%) NOUN nodes are leaves.

4847 (54%) NOUN nodes have one child.

2033 (23%) NOUN nodes have two children.

399 (4%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 34.

Children of NOUN nodes are attached using 20 different relations: nmod:poss (2983; 28% instances), det:poss (1981; 19% instances), case (1863; 18% instances), conj (1228; 12% instances), amod (949; 9% instances), nmod (548; 5% instances), nummod (321; 3% instances), appos (221; 2% instances), cc (199; 2% instances), acl:relcl (197; 2% instances), dep (56; 1% instances), det (50; 0% instances), obl (18; 0% instances), nsubj (9; 0% instances), obj (8; 0% instances), advmod (3; 0% instances), cop (2; 0% instances), acl (1; 0% instances), advmod:emph (1; 0% instances), mark (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (3511; 33% instances), PRON (2049; 19% instances), ADP (1864; 18% instances), PROPN (1433; 13% instances), ADJ (838; 8% instances), NUM (324; 3% instances), VERB (212; 2% instances), CCONJ (197; 2% instances), DET (150; 1% instances), PART (56; 1% instances), ADV (2; 0% instances), X (2; 0% instances), SCONJ (1; 0% instances)