home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-ITTB: POS Tags: NOUN

There are 2394 NOUN lemmas (38%), 5881 NOUN types (28%) and 90317 NOUN tokens (20%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: forma, homo, corpus, res, intellectus, anima, materia, natura, substantia, actus

The 10 most frequent NOUN types: forma, esse, intellectus, formam, formae, materia, anima, potentia, homo, corpus

The 10 most frequent ambiguous lemmas: esse (NOUN 1419, VERB 1), bonum (NOUN 1087, ADJ 7), agens (NOUN 865, VERB 1), malum (NOUN 538, ADJ 1), amplus (NOUN 427, ADJ 9), dominus (NOUN 401, ADJ 4), philosophus (NOUN 247, ADJ 1), mundus (NOUN 203, ADJ 7), deus (PROPN 4277, NOUN 180), liber (NOUN 174, ADJ 98)

The 10 most frequent ambiguous types: esse (AUX 3031, NOUN 1418, VERB 1), intellectus (NOUN 1359, VERB 6), bonum (NOUN 699, ADJ 110), modo (NOUN 590, ADV 1), motus (NOUN 437, VERB 7), amplius (NOUN 427, ADV 17, ADJ 6), malum (NOUN 411, ADJ 33), intellectum (NOUN 407, VERB 60), agens (NOUN 373, VERB 136), actum (NOUN 296, VERB 3)

Morphology

The form / lemma ratio of NOUN is 2.456558 (the average of all parts of speech is 3.337297).

The 1st highest number of forms (11) was observed with the lemma “perfectus”: perfecta, perfectam, perfecti, perfectior, perfectiora, perfectiorem, perfectissime, perfectius, perfecto, perfectum, perfectus.

The 2nd highest number of forms (10) was observed with the lemma “malus”: mala, malas, male, mali, malis, malo, malorum, malos, malum, malus.

The 3rd highest number of forms (9) was observed with the lemma “agens”: agens, agente, agentem, agentes, agenti, agentia, agentibus, agentis, agentium.

NOUN occurs with 9 features: Case (89864; 99% instances), Number (89864; 99% instances), InflClass (88970; 99% instances), Gender (88932; 98% instances), Abbr (404; 0% instances), Proper (299; 0% instances), NameType (95; 0% instances), VerbForm (22; 0% instances), Foreign (2; 0% instances)

NOUN occurs with 26 feature-value pairs: Abbr=Yes, Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Case=Voc, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, InflClass=Ind, InflClass=IndEurA, InflClass=IndEurE, InflClass=IndEurI, InflClass=IndEurO, InflClass=IndEurU, InflClass=IndEurX, NameType=Let, NameType=Lit, NameType=Nat, Number=Plur, Number=Sing, Proper=Yes, VerbForm=Part

NOUN occurs with 190 feature combinations. The most frequent feature combination is Case=Nom|Gender=Fem|InflClass=IndEurA|Number=Sing (6465 tokens). Examples: forma, anima, causa, materia, substantia, natura, potentia, essentia, creatura, uita

Relations

NOUN nodes are attached to their parents using 26 different relations: obl (18132; 20% instances), nmod (16514; 18% instances), nsubj (16211; 18% instances), obj (7611; 8% instances), conj (7462; 8% instances), obl:arg (7090; 8% instances), nsubj:pass (5693; 6% instances), advcl (3752; 4% instances), root (2960; 3% instances), xcomp (1022; 1% instances), orphan (951; 1% instances), acl:relcl (863; 1% instances), csubj (460; 1% instances), conj:expl (337; 0% instances), ccomp (265; 0% instances), xcomp:pred (262; 0% instances), advcl:cmpr (252; 0% instances), acl (155; 0% instances), appos (129; 0% instances), csubj:pass (97; 0% instances), flat (41; 0% instances), vocative (29; 0% instances), parataxis (25; 0% instances), dislocated:nsubj (2; 0% instances), obl:agent (1; 0% instances), parataxis:rep (1; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (47410; 52% instances), NOUN (25072; 28% instances), AUX (6690; 7% instances), ADJ (4344; 5% instances), (2960; 3% instances), DET (1423; 2% instances), PRON (820; 1% instances), NUM (566; 1% instances), ADV (498; 1% instances), PROPN (345; 0% instances), PART (174; 0% instances), ADP (11; 0% instances), CCONJ (4; 0% instances)

20348 (23%) NOUN nodes are leaves.

35119 (39%) NOUN nodes have one child.

19514 (22%) NOUN nodes have two children.

15336 (17%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 13.

Children of NOUN nodes are attached using 34 different relations: case (24569; 18% instances), nmod (20653; 15% instances), amod (15417; 11% instances), det (13080; 10% instances), punct (12017; 9% instances), cc (7522; 6% instances), conj (6781; 5% instances), acl (5824; 4% instances), cop (5223; 4% instances), mark (4585; 3% instances), acl:relcl (4433; 3% instances), nsubj (3715; 3% instances), advmod (2179; 2% instances), advmod:emph (2028; 1% instances), advcl (1766; 1% instances), nummod (1301; 1% instances), advmod:neg (1274; 1% instances), orphan (861; 1% instances), obl (679; 1% instances), csubj (327; 0% instances), conj:expl (269; 0% instances), advcl:cmpr (243; 0% instances), advcl:pred (143; 0% instances), appos (114; 0% instances), obl:arg (74; 0% instances), flat (47; 0% instances), discourse (41; 0% instances), parataxis (32; 0% instances), ccomp (12; 0% instances), aux (8; 0% instances), xcomp (4; 0% instances), vocative (3; 0% instances), nsubj:pass (2; 0% instances), aux:pass (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: NOUN (25072; 19% instances), ADP (24642; 18% instances), ADJ (17204; 13% instances), DET (15070; 11% instances), PUNCT (12017; 9% instances), VERB (10338; 8% instances), CCONJ (7905; 6% instances), AUX (6124; 5% instances), ADV (4663; 3% instances), SCONJ (3981; 3% instances), PRON (3591; 3% instances), PROPN (1831; 1% instances), NUM (1416; 1% instances), PART (1365; 1% instances), X (8; 0% instances)