home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hittite-HitTB: POS Tags: NOUN

There are 155 NOUN lemmas (35%), 226 NOUN types (32%) and 387 NOUN tokens (30%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: URU, d, utnē-, šiun(i)-, GIŠ, LÚ, pišna/i-, UDU-u-, per, šiwatt-

The 10 most frequent NOUN types: URU, d, LÚ, GIŠ, KUR, KÙ.BABBAR, LÚ.MEŠ, m, DUMU, UDU

The 10 most frequent ambiguous lemmas: šuḫḫa- (NOUN 2, VERB 1), idālu- (ADJ 2, NOUN 1)

The 10 most frequent ambiguous types: GAL (ADJ 1, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.458065 (the average of all parts of speech is 1.571106).

The 1st highest number of forms (7) was observed with the lemma “šiun(i)-”: DINGIR, DINGIR-LIM, DINGIR-LIM-ni, DINGIR-LUM, DINGIR.MEŠ-aš, DINGIR.MEŠ-eš, ši-ú.

The 2nd highest number of forms (5) was observed with the lemma “per”: pár-na, É, É-er, É-na, É-ri.

The 3rd highest number of forms (5) was observed with the lemma “šiwatt-”: UD, UD-UM, UD-at, UD-aš, UD-ti.

NOUN occurs with 5 features: Gender (254; 66% instances), Number (223; 58% instances), Case (198; 51% instances), Language (3; 1% instances), Definite (2; 1% instances)

NOUN occurs with 14 feature-value pairs: Case=Abl, Case=Abs, Case=Acc, Case=All, Case=Dat, Case=Gen, Case=Ins, Case=Nom, Definite=Cons, Gender=Com, Gender=Neut, Language=Sum, Number=Plur, Number=Sing

NOUN occurs with 49 feature combinations. The most frequent feature combination is _ (119 tokens). Examples: URU, d, GIŠ, LÚ, m, GÍN, UDU, DUG, KAM, MA.NA

Relations

NOUN nodes are attached to their parents using 16 different relations: nmod:det (97; 25% instances), obj (69; 18% instances), obl (68; 18% instances), nmod (49; 13% instances), nsubj (45; 12% instances), conj (17; 4% instances), iobj (7; 2% instances), appos (6; 2% instances), root (6; 2% instances), xcomp (6; 2% instances), dislocated (5; 1% instances), compound (4; 1% instances), orphan (4; 1% instances), vocative (2; 1% instances), advcl (1; 0% instances), parataxis (1; 0% instances)

Parents of NOUN nodes belong to 9 different parts of speech: VERB (199; 51% instances), NOUN (92; 24% instances), PROPN (81; 21% instances), (6; 2% instances), ADJ (4; 1% instances), PRON (2; 1% instances), DET (1; 0% instances), NUM (1; 0% instances), PART (1; 0% instances)

210 (54%) NOUN nodes are leaves.

112 (29%) NOUN nodes have one child.

45 (12%) NOUN nodes have two children.

20 (5%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 6.

Children of NOUN nodes are attached using 25 different relations: nmod (88; 32% instances), case (34; 12% instances), nmod:det (26; 9% instances), nummod (26; 9% instances), conj (17; 6% instances), discourse (15; 5% instances), amod (12; 4% instances), det (12; 4% instances), orphan (8; 3% instances), cc (6; 2% instances), nsubj (5; 2% instances), compound (4; 1% instances), acl:relcl (3; 1% instances), advmod (3; 1% instances), advmod:emph (3; 1% instances), cop (3; 1% instances), discourse:conn (2; 1% instances), dislocated (2; 1% instances), expl:pass (2; 1% instances), obl (2; 1% instances), acl (1; 0% instances), appos (1; 0% instances), flat (1; 0% instances), mark (1; 0% instances), obj (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (92; 33% instances), PRON (39; 14% instances), ADP (34; 12% instances), NUM (26; 9% instances), PART (25; 9% instances), PROPN (20; 7% instances), DET (12; 4% instances), ADJ (9; 3% instances), VERB (8; 3% instances), CCONJ (6; 2% instances), ADV (3; 1% instances), AUX (3; 1% instances), SCONJ (1; 0% instances)