Treebank Statistics: UD_Latin-PROIEL: POS Tags: NOUN
There are 3238 NOUN
lemmas (37%), 7672 NOUN
types (25%) and 40437 NOUN
tokens (20%).
Out of 14 observed tags, the rank of NOUN
is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.
The 10 most frequent NOUN
lemmas: deus, homo, res, dominus, dies, filius, locus, pater, frater, spiritus
The 10 most frequent NOUN
types: dei, deus, die, deo, domini, filius, fratres, re, rebus, deum
The 10 most frequent ambiguous lemmas: princeps (NOUN 149, ADJ 12), mundus (NOUN 143, ADJ 28), epistula (NOUN 103, ADJ 1), amicus (NOUN 86, ADJ 9), liber (NOUN 80, ADJ 38), sol (NOUN 54, PROPN 1), labor (NOUN 49, VERB 6), inimicus (NOUN 45, ADJ 23), Romanus (ADJ 91, NOUN 41), publicanus (NOUN 36, ADJ 1)
The 10 most frequent ambiguous types: re (NOUN 155, ADV 33), rem (NOUN 104, ADV 7), diem (NOUN 87, ADV 15), bellum (NOUN 67, ADJ 2), principes (NOUN 58, ADJ 6), vocem (NOUN 57, VERB 1), iudaei (ADJ 1, NOUN 1), signa (NOUN 54, VERB 1), exercitum (NOUN 51, VERB 1), manu (NOUN 50, VERB 3)
- re
- rem
- diem
- bellum
- principes
- vocem
- iudaei
- signa
- exercitum
- manu
Morphology
The form / lemma ratio of NOUN
is 2.369364 (the average of all parts of speech is 3.418760).
The 1st highest number of forms (11) was observed with the lemma “ventus”: uenti, uentis, uento, uentos, venti, ventis, vento, ventorum, ventos, ventum, ventus.
The 2nd highest number of forms (11) was observed with the lemma “voluptas”: uoluptate, uoluptatem, uoluptati, uoluptatis, voluptate, voluptatem, voluptates, voluptati, voluptatibus, voluptatis, voluptatum.
The 3rd highest number of forms (10) was observed with the lemma “vinea”: uinea, uineae, uinearum, uineas, uineis, vinea, vineae, vineam, vineas, vineis.
NOUN
occurs with 3 features: Case (40361; 100% instances), Number (40361; 100% instances), Gender (40357; 100% instances)
NOUN
occurs with 13 feature-value pairs: Case=Abl
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Case=Voc
, Gender=Fem
, Gender=Fem,Masc
, Gender=Masc
, Gender=Masc,Neut
, Gender=Neut
, Number=Plur
, Number=Sing
NOUN
occurs with 52 feature combinations.
The most frequent feature combination is Case=Acc|Gender=Fem|Number=Sing
(4144 tokens).
Examples: domum, terram, rem, partem, fidem, potestatem, civitatem, gloriam, legem, vocem
Relations
NOUN
nodes are attached to their parents using 21 different relations: obl (10121; 25% instances), obj (6513; 16% instances), nmod (5916; 15% instances), nsubj (5881; 15% instances), conj (3936; 10% instances), nsubj:pass (2214; 5% instances), iobj (1927; 5% instances), appos (1036; 3% instances), root (1023; 3% instances), xcomp (468; 1% instances), vocative (454; 1% instances), advcl (228; 1% instances), obl:agent (216; 1% instances), ccomp (204; 1% instances), orphan (186; 0% instances), dislocated (61; 0% instances), parataxis (21; 0% instances), dep (18; 0% instances), csubj:pass (9; 0% instances), fixed (3; 0% instances), acl (2; 0% instances)
Parents of NOUN
nodes belong to 15 different parts of speech: VERB (25030; 62% instances), NOUN (9527; 24% instances), ADJ (1925; 5% instances), (1023; 3% instances), PROPN (783; 2% instances), ADV (779; 2% instances), AUX (503; 1% instances), PRON (445; 1% instances), NUM (210; 1% instances), CCONJ (63; 0% instances), DET (49; 0% instances), INTJ (35; 0% instances), X (26; 0% instances), SCONJ (22; 0% instances), ADP (17; 0% instances)
13635 (34%) NOUN
nodes are leaves.
15274 (38%) NOUN
nodes have one child.
6995 (17%) NOUN
nodes have two children.
4533 (11%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 29.
Children of NOUN
nodes are attached using 28 different relations: case (9465; 20% instances), det (8285; 18% instances), nmod (6666; 14% instances), amod (4683; 10% instances), conj (3788; 8% instances), cc (3483; 7% instances), acl (2609; 6% instances), cop (1472; 3% instances), nsubj (1258; 3% instances), advmod (1225; 3% instances), nummod (869; 2% instances), appos (794; 2% instances), discourse (648; 1% instances), mark (307; 1% instances), orphan (282; 1% instances), obl (256; 1% instances), advcl (186; 0% instances), ccomp (171; 0% instances), iobj (96; 0% instances), vocative (50; 0% instances), dislocated (46; 0% instances), obj (25; 0% instances), parataxis (14; 0% instances), aux (6; 0% instances), dep (2; 0% instances), fixed (2; 0% instances), obl:agent (2; 0% instances), xcomp (2; 0% instances)
Children of NOUN
nodes belong to 14 different parts of speech: NOUN (9527; 20% instances), ADP (9471; 20% instances), DET (6622; 14% instances), ADJ (5820; 12% instances), CCONJ (3497; 7% instances), VERB (3175; 7% instances), PRON (2555; 5% instances), ADV (1794; 4% instances), AUX (1524; 3% instances), PROPN (1388; 3% instances), NUM (936; 2% instances), SCONJ (315; 1% instances), INTJ (46; 0% instances), X (22; 0% instances)