home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Gothic-PROIEL: POS Tags: NUM

There are 28 NUM lemmas (1%), 64 NUM types (1%) and 400 NUM tokens (1%). Out of 14 observed tags, the rank of NUM is: 7 in number of lemmas, 8 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: ains, twai, twalif, þreis, fimf, sibun, taihun, tigjus, fidwor, þūsundi

The 10 most frequent NUM types: ains, ain, ainamma, ainana, fimf, twans, aina, sibun, twalif, þrins

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: ·r· (NUM 2, ADJ 1), hundam (NOUN 2, NUM 1), ·b· (ADJ 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 2.285714 (the average of all parts of speech is 2.624779).

The 1st highest number of forms (12) was observed with the lemma “ains”: ain, aina, ainai, ainaim, ainaizos, ainamma, ainana, ainans, ainata, ainis, ains, ainz.

The 2nd highest number of forms (7) was observed with the lemma “twai”: twa, twaddje, twai, twaim, twans, twos, ·b·.

The 3rd highest number of forms (5) was observed with the lemma “twalif”: twalib, twalibe, twalibim, twalif, ·ib·.

NUM occurs with 3 features: Case (303; 76% instances), Number (303; 76% instances), Gender (292; 73% instances)

NUM occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 33 feature combinations. The most frequent feature combination is _ (97 tokens). Examples: fimf, sibun, twalif, taihun, fidwor, ·ib·, saihs, ·l·, ahtau, ahtautehund

Relations

NUM nodes are attached to their parents using 16 different relations: nummod (195; 49% instances), nsubj (40; 10% instances), obj (32; 8% instances), appos (23; 6% instances), xcomp (20; 5% instances), obl (18; 5% instances), orphan (18; 5% instances), conj (14; 4% instances), iobj (12; 3% instances), advmod (8; 2% instances), nmod (7; 2% instances), root (7; 2% instances), nsubj:pass (3; 1% instances), ccomp (1; 0% instances), flat (1; 0% instances), obl:agent (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (177; 44% instances), VERB (131; 33% instances), NUM (50; 13% instances), ADJ (15; 4% instances), PRON (7; 2% instances), (7; 2% instances), ADV (6; 2% instances), PROPN (4; 1% instances), ADP (1; 0% instances), CCONJ (1; 0% instances), SCONJ (1; 0% instances)

263 (66%) NUM nodes are leaves.

78 (20%) NUM nodes have one child.

37 (9%) NUM nodes have two children.

22 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 16 different relations: nmod (56; 24% instances), orphan (32; 14% instances), det (31; 13% instances), advmod (20; 9% instances), nummod (20; 9% instances), cc (19; 8% instances), case (17; 7% instances), conj (15; 7% instances), appos (6; 3% instances), acl (5; 2% instances), amod (3; 1% instances), nsubj (2; 1% instances), discourse (1; 0% instances), flat (1; 0% instances), mark (1; 0% instances), obl (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NUM (50; 22% instances), NOUN (44; 19% instances), DET (31; 13% instances), CCONJ (25; 11% instances), ADV (20; 9% instances), ADJ (18; 8% instances), ADP (17; 7% instances), PRON (9; 4% instances), PROPN (8; 3% instances), VERB (7; 3% instances), SCONJ (1; 0% instances)