home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ancient_Greek-Perseus: POS Tags: NOUN

There are 5753 NOUN lemmas (41%), 12716 NOUN types (30%) and 41252 NOUN tokens (20%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: ἀνήρ, θεός, ναῦς, ζεύς, χείρ, τρώς, πόλις, θυμός, παῖς, ἵππος

The 10 most frequent NOUN types: θεῶν, ἀνδρῶν, Ἕκτωρ, Διὸς, ἵππους, θυμὸν, πόλιν, Τρώων, Ζεὺς, νῆας

The 10 most frequent ambiguous lemmas: Ἀθήναιος (NOUN 157, ADJ 2), Λακεδαιμόνιος (NOUN 135, ADJ 8), Ἀθηναῖος (NOUN 87, ADJ 19), βοῦς (NOUN 82, ADJ 1), πλῆθος (NOUN 76, ADJ 1), νεκρός (NOUN 64, ADJ 14), Κορίνθιος (NOUN 63, ADJ 12), χάρις (NOUN 51, ADP 2), βάρβαρος (NOUN 59, ADJ 18), Ἴλιος (NOUN 56, ADJ 2)

The 10 most frequent ambiguous types: Ἀθηναῖοι (NOUN 80, ADJ 1), Ἀθηναίων (NOUN 76, ADJ 3), Λακεδαιμόνιοι (NOUN 55, ADJ 1), Ἴλιον (NOUN 41, ADJ 1), χάριν (NOUN 39, ADP 3), ἀρχὴν (NOUN 34, ADV 1), Λακεδαιμονίους (NOUN 29, ADJ 2), Ἀχαιῶν (ADJ 181, NOUN 27), βαρβάρων (NOUN 25, ADJ 7), μηδὲν (NOUN 22, PRON 18)

Morphology

The form / lemma ratio of NOUN is 2.210325 (the average of all parts of speech is 3.010372).

The 1st highest number of forms (33) was observed with the lemma “χείρ”: χέρα, χέρας, χέρεσσιν, χέρσ̓, χείρ, χείρεσι, χείρεσσ̓, χείρεσσι, χείρεσσιν, χειρί, χειροῖν, χειρός, χειρὶ, χειρὸς, χειρῶν, χερί, χεροῖν, χερσί, χερσίν, χερσὶ, χερσὶν, χερός, χερὶ, χερὸς, χερῶν, χεὶρ, χεῖράς, χεῖρές, χεῖρα, χεῖρας, χεῖρε, χεῖρες, χεῖῤ.

The 2nd highest number of forms (33) was observed with the lemma “ἀνήρ”: τἀνδρός, τἀνδρὶ, τἀνδρὸς, ἀνέρα, ἀνέρας, ἀνέρε, ἀνέρες, ἀνέρι, ἀνέρος, ἀνέρων, ἀνήρ, ἀνδράσι, ἀνδράσιν, ἀνδρί, ἀνδρός, ἀνδρὶ, ἀνδρὸς, ἀνδρῶν, ἀνὴρ, ἁνήρ, ἁνὴρ, ἄνδρά, ἄνδρα, ἄνδρας, ἄνδρε, ἄνδρες, ἄνδρεσσι, ἄνδρεσσιν, ἄνδῤ, ἄνερ, ἅνδρες, ἆνερ, ὡνὴρ.

The 3rd highest number of forms (32) was observed with the lemma “ναῦς”: νέας, νέες, νέεσσι, νέεσσιν, νήεσσι, νήεσσιν, ναυσί, ναυσὶ, ναυσὶν, ναὸς, ναῦν, ναῦς, ναῦφι, ναῦφιν, νεός, νεὸς, νεὼς, νεῶν, νηυσί, νηυσίν, νηυσὶ, νηυσὶν, νηός, νηὶ, νηὸς, νηῒ, νηῦς, νηῶν, νῆάς, νῆα, νῆας, νῆες.

NOUN occurs with 3 features: Case (41248; 100% instances), Number (41236; 100% instances), Gender (41147; 100% instances)

NOUN occurs with 12 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

NOUN occurs with 54 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (4717 tokens). Examples: Ἕκτωρ, Ζεὺς, ἀνὴρ, Ἀπόλλων, Ἀχιλλεύς, Αἴας, πατὴρ, θυμὸς, Ἀγαμέμνων, βασιλεὺς

Relations

NOUN nodes are attached to their parents using 15 different relations: obj (11160; 27% instances), obl (9371; 23% instances), nsubj (8323; 20% instances), nmod (6917; 17% instances), conj (2765; 7% instances), vocative (771; 2% instances), xcomp (664; 2% instances), iobj (648; 2% instances), appos (237; 1% instances), root (215; 1% instances), advcl (86; 0% instances), acl (82; 0% instances), case (10; 0% instances), orphan (2; 0% instances), nsubj:outer (1; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (29551; 72% instances), NOUN (8879; 22% instances), ADJ (1722; 4% instances), PRON (508; 1% instances), DET (232; 1% instances), (215; 1% instances), ADV (115; 0% instances), CCONJ (9; 0% instances), ADP (8; 0% instances), X (7; 0% instances), NUM (3; 0% instances), PART (3; 0% instances)

12383 (30%) NOUN nodes are leaves.

16116 (39%) NOUN nodes have one child.

8222 (20%) NOUN nodes have two children.

4531 (11%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 98.

Children of NOUN nodes are attached using 23 different relations: nmod (18747; 38% instances), det (9095; 18% instances), case (7685; 15% instances), conj (2836; 6% instances), advmod (2579; 5% instances), cc (2267; 5% instances), punct (1967; 4% instances), acl (1105; 2% instances), amod (976; 2% instances), xcomp (707; 1% instances), cop (531; 1% instances), nsubj (306; 1% instances), appos (244; 0% instances), discourse (209; 0% instances), nummod (182; 0% instances), obl (156; 0% instances), advcl (88; 0% instances), mark (66; 0% instances), obj (39; 0% instances), csubj (26; 0% instances), vocative (18; 0% instances), iobj (3; 0% instances), ccomp (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: ADJ (12277; 25% instances), DET (9097; 18% instances), NOUN (8879; 18% instances), ADP (7657; 15% instances), VERB (2496; 5% instances), CCONJ (2085; 4% instances), PUNCT (1967; 4% instances), PRON (1552; 3% instances), ADV (1509; 3% instances), PART (1336; 3% instances), AUX (531; 1% instances), INTJ (209; 0% instances), NUM (184; 0% instances), SCONJ (49; 0% instances), X (5; 0% instances)