Treebank Statistics: UD_Hittite-HitTB: POS Tags: NOUN
There are 155 NOUN lemmas (35%), 226 NOUN types (32%) and 387 NOUN tokens (30%).
Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: URU, d, utnē-, šiun(i)-, GIŠ, LÚ, pišna/i-, UDU-u-, per, šiwatt-
The 10 most frequent NOUN types: URU, d, LÚ, GIŠ, KUR, KÙ.BABBAR, LÚ.MEŠ, m, DUMU, UDU
The 10 most frequent ambiguous lemmas: šuḫḫa- (NOUN 2, VERB 1), idālu- (ADJ 2, NOUN 1)
The 10 most frequent ambiguous types: GAL (ADJ 1, NOUN 1)
- GAL
- ADJ 1: nu-u š-ša-an 2 NINDA.GUR4.RA.ḪI.A GAL 10 NINDA.GUR4.RA TUR GIŠ BANŠUR-i da-a-i
- NOUN 1: MUNUS.LUGAL f Pu-du-ḫé-pa-aš kán ku-wa-pí m UR.MAḪ.LÚ-in GAL DUB.SAR.MEŠ URU Ḫa-at-tu-ši A-NA ṬUP-PA-ḪI.A URU Ki-iz-zu-wa-at-na ša-an-ḫu-u-wa-an-zi ú-e-ri-ya-at n a-aš-ta ke-e ṬUP-PA-ḪI.A ŠA EZEN4 ḫi-šu-wa-a-aš a-pí-ya UD-at ar-ḫa a-ni-ya-at
Morphology
The form / lemma ratio of NOUN is 1.458065 (the average of all parts of speech is 1.571106).
The 1st highest number of forms (7) was observed with the lemma “šiun(i)-”: DINGIR, DINGIR-LIM, DINGIR-LIM-ni, DINGIR-LUM, DINGIR.MEŠ-aš, DINGIR.MEŠ-eš, ši-ú.
The 2nd highest number of forms (5) was observed with the lemma “per”: pár-na, É, É-er, É-na, É-ri.
The 3rd highest number of forms (5) was observed with the lemma “šiwatt-”: UD, UD-UM, UD-at, UD-aš, UD-ti.
NOUN occurs with 5 features: Gender (254; 66% instances), Number (223; 58% instances), Case (198; 51% instances), Language (3; 1% instances), Definite (2; 1% instances)
NOUN occurs with 14 feature-value pairs: Case=Abl, Case=Abs, Case=Acc, Case=All, Case=Dat, Case=Gen, Case=Ins, Case=Nom, Definite=Cons, Gender=Com, Gender=Neut, Language=Sum, Number=Plur, Number=Sing
NOUN occurs with 49 feature combinations.
The most frequent feature combination is _ (119 tokens).
Examples: URU, d, GIŠ, LÚ, m, GÍN, UDU, DUG, KAM, MA.NA
Relations
NOUN nodes are attached to their parents using 16 different relations: nmod:det (97; 25% instances), obj (69; 18% instances), obl (68; 18% instances), nmod (49; 13% instances), nsubj (45; 12% instances), conj (17; 4% instances), iobj (7; 2% instances), appos (6; 2% instances), root (6; 2% instances), xcomp (6; 2% instances), dislocated (5; 1% instances), compound (4; 1% instances), orphan (4; 1% instances), vocative (2; 1% instances), advcl (1; 0% instances), parataxis (1; 0% instances)
Parents of NOUN nodes belong to 9 different parts of speech: VERB (199; 51% instances), NOUN (92; 24% instances), PROPN (81; 21% instances), (6; 2% instances), ADJ (4; 1% instances), PRON (2; 1% instances), DET (1; 0% instances), NUM (1; 0% instances), PART (1; 0% instances)
210 (54%) NOUN nodes are leaves.
112 (29%) NOUN nodes have one child.
45 (12%) NOUN nodes have two children.
20 (5%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 6.
Children of NOUN nodes are attached using 25 different relations: nmod (88; 32% instances), case (34; 12% instances), nmod:det (26; 9% instances), nummod (26; 9% instances), conj (17; 6% instances), discourse (15; 5% instances), amod (12; 4% instances), det (12; 4% instances), orphan (8; 3% instances), cc (6; 2% instances), nsubj (5; 2% instances), compound (4; 1% instances), acl:relcl (3; 1% instances), advmod (3; 1% instances), advmod:emph (3; 1% instances), cop (3; 1% instances), discourse:conn (2; 1% instances), dislocated (2; 1% instances), expl:pass (2; 1% instances), obl (2; 1% instances), acl (1; 0% instances), appos (1; 0% instances), flat (1; 0% instances), mark (1; 0% instances), obj (1; 0% instances)
Children of NOUN nodes belong to 13 different parts of speech: NOUN (92; 33% instances), PRON (39; 14% instances), ADP (34; 12% instances), NUM (26; 9% instances), PART (25; 9% instances), PROPN (20; 7% instances), DET (12; 4% instances), ADJ (9; 3% instances), VERB (8; 3% instances), CCONJ (6; 2% instances), ADV (3; 1% instances), AUX (3; 1% instances), SCONJ (1; 0% instances)