Treebank Statistics: UD_Akkadian-RIAO: POS Tags: NOUN
There are 697 NOUN
lemmas (42%), 1231 NOUN
types (42%) and 8729 NOUN
tokens (38%).
Out of 13 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: šarru, mātu, _, ālu, māru, kiššatu, bēlu, šadû, ēkallu, ummānu
The 10 most frequent NOUN
types: šar, māt, mār, kiššati, šarru, ālāni, libbi, maddattu, ummānāti, ilāni
The 10 most frequent ambiguous lemmas: _ (PRON 2345, NOUN 405, VERB 323, PART 169, CCONJ 133, NUM 100, ADJ 63, PROPN 47, ADV 6, ADP 5, X 3, DET 2), pānu (NOUN 83, ADV 7), šaknu (NOUN 50, VERB 30), ašru (NOUN 41, SCONJ 4), dannu (ADJ 166, NOUN 18, VERB 5), manû (VERB 19, NOUN 15), danānu (NOUN 10, VERB 5), rabû (ADJ 166, NOUN 9, VERB 4), abnu (NOUN 8, VERB 1), mahāṣu (NOUN 8, VERB 1)
The 10 most frequent ambiguous types: pān (NOUN 58, ADV 7), dannūti (NOUN 32, ADJ 22), ašar (NOUN 16, SCONJ 4), x (NOUN 16, PROPN 16, VERB 10, ADP 3, X 3, NUM 1), rabûti (ADJ 51, NOUN 7), balṭūti (ADJ 14, NOUN 4), ekṣūte (NOUN 3, ADJ 2), šaknu (VERB 11, NOUN 3), arkât (NOUN 2, ADJ 1), battubatte (ADP 3, NOUN 2, ADV 1)
- pān
- dannūti
- ašar
- x
- NOUN 16: Tukulti-Ninurta šarru rabû x
- PROPN 16: ēkal x mār Aššur-dan
- VERB 10: arki šu x
- ADP 3: x Til-ša-Zabdani u Til-ša-Abatani x
- X 3: maddattu ša {m}x hurāṣī annakī erî parzillī x taphi erî x.MEŠ erî x amhur
- NUM 1: 3 mana hurāṣi 7 mana ṣarpū kaspi x bilat annakī 40 diqārī siparri 1 biltu murru x mē immerī x mē 40 alpī 20 imērī 20 iṣṣūrāti akalī šikarī ê tibnī kissutu
- rabûti
- balṭūti
- ekṣūte
- NOUN 3: Aššur-naṣir-apli rubû naʾdu pālih ilāni rabûti ušumgallu ekdu kāšid ālāni u huršānī pāṭ gimri šunu šar bēlē mulaʾʾiṭ ekṣūte āpir šalummate lā ādiru tuqumti uršānu lā pādû murīb anunte šar tanadāte rēʾû ṣalūlu kibrāti šarru ša qibīt pî šu ušharmaṭu šadê u tâmāti
- ADJ 2: mātāti dannāte huršānī ekṣūte šarrāni ekdūte lā pādûte ultu ṣīt Šamši adi ereb Šamši ana šēpī ya ušekniša pâ ištēn ušaškin
- šaknu
- arkât
- battubatte
Morphology
The form / lemma ratio of NOUN
is 1.766141 (the average of all parts of speech is 1.795510).
The 1st highest number of forms (154) was observed with the lemma “_”: BAD.MEŠ, IM.MEŠ-ni, KA, NIG₂, abulli, ahu, amēlî, appī, asummēni, ašar, biriquš, bunnannî, bāb, bābāni, bēli, bēlē, bēlūt, danān, daprānī, diqārāt, diqārāte, dēkta, dēktu, dīkta, dūr, ebūrī, ekurrāt, emūqī, ereqqu, gabadibbī, gāgī, hurās, ihzi, išātāti, iṣṣūrē, kakkī, kasap, kinūn, kirâti, kissāte, kitekittê, kiššūt, kurummāti?, kussâ, labbāku, libba, libbi, limnīti, limētuš, liātu, lubulte, līt, līṭī, maddāte, makkūri, malkūt, ma’dūte, muddahhiṣī, muddahṣī, mudiš, muhhi, multa’’ît, multa’’īt, munēr, murte’ât, mušahmeṭi, mušerbû, mār, māt, māti, mūrānī, nablū, nakrī, namurrat, namzī’āte, narkabta, narkabāti, narû, nathi, nazzī’āte, nudunnî, nāhirī, nērbē, nērbī, nīqāte, nīše, pagūta, palê, pattûti, pilše, puhur, pî, pāni, pēt, qumāšātu, qāti, qā’iš, ritti, rā’im, rā’imat, sa-x-ti, sakkī, salmē, siqir, siqr, sā’te, sītāt, sītāte, tabbilī, tanatt, tidūki, tiklē, tuqmate, tuqmati, tuqumtu, tîrēte, ummānāti, urdūte, urdūti, urdūtu, uznī, x, x-ba-meš, x-e, x-i-te, x-x-x, x-x-x.MEŠ, x-šunu, x.MEŠ, {NA₄}x, āl, āli, ēkallāti, šaknūte, šaknūti, šamgāni, šangût, šanāti, šaressu, šarrūt, še’u, še’ī, še’ū, šum, šumi, šumēli, šurmēnī, šīrī, ūme, ṣalūl, ṣibtāti, ṣulūl, ṣābī, ṣēnī.
The 2nd highest number of forms (8) was observed with the lemma “mātu”: mās, māt, māta, māti, mātu, mātāt, mātāte, mātāti.
The 3rd highest number of forms (7) was observed with the lemma “nakru”: nakirē, nakirī, nakri, nakrī, nakrūt, nakrūti, nukrī.
NOUN
occurs with 6 features: Number (8708; 100% instances), Gender (8706; 100% instances), NounBase (7169; 82% instances), Case (3182; 36% instances), Person (30; 0% instances), VerbForm (27; 0% instances)
NOUN
occurs with 16 feature-value pairs: Case=Acc
, Case=Gen
, Case=Loc
, Case=Nom
, Gender=Com
, Gender=Fem
, Gender=Masc
, NounBase=Bound
, NounBase=Free
, NounBase=Suffixal
, NounBase=Terminal
, Number=Plur
, Number=Sing
, Person=1
, Person=3
, VerbForm=Stat
NOUN
occurs with 47 feature combinations.
The most frequent feature combination is Gender=Masc|NounBase=Bound|Number=Sing
(1525 tokens).
Examples: šar, mār, bīt, iššak, pān, āl, šakin, rēš, pāṭ, bēl
Relations
NOUN
nodes are attached to their parents using 13 different relations: nmod:poss (1983; 23% instances), appos (1804; 21% instances), obl (1719; 20% instances), obj (1393; 16% instances), conj (1202; 14% instances), nmod (250; 3% instances), nsubj (189; 2% instances), root (179; 2% instances), acl:relcl (6; 0% instances), acl (1; 0% instances), csubj (1; 0% instances), dep (1; 0% instances), vocative (1; 0% instances)
Parents of NOUN
nodes belong to 10 different parts of speech: NOUN (3424; 39% instances), VERB (3351; 38% instances), PROPN (1716; 20% instances), (179; 2% instances), PRON (21; 0% instances), ADJ (17; 0% instances), DET (16; 0% instances), PART (3; 0% instances), ADV (1; 0% instances), NUM (1; 0% instances)
1603 (18%) NOUN
nodes are leaves.
4723 (54%) NOUN
nodes have one child.
2012 (23%) NOUN
nodes have two children.
391 (4%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 34.
Children of NOUN
nodes are attached using 20 different relations: nmod:poss (2915; 28% instances), det:poss (1947; 19% instances), case (1844; 18% instances), conj (1208; 12% instances), amod (924; 9% instances), nmod (539; 5% instances), nummod (316; 3% instances), appos (215; 2% instances), cc (197; 2% instances), acl:relcl (185; 2% instances), dep (55; 1% instances), det (49; 0% instances), obl (18; 0% instances), advmod (13; 0% instances), nsubj (9; 0% instances), obj (6; 0% instances), cop (2; 0% instances), acl (1; 0% instances), advmod:emph (1; 0% instances), mark (1; 0% instances)
Children of NOUN
nodes belong to 13 different parts of speech: NOUN (3424; 33% instances), PRON (2014; 19% instances), ADP (1845; 18% instances), PROPN (1416; 14% instances), ADJ (818; 8% instances), NUM (319; 3% instances), VERB (200; 2% instances), CCONJ (195; 2% instances), DET (145; 1% instances), PART (64; 1% instances), ADV (2; 0% instances), X (2; 0% instances), SCONJ (1; 0% instances)