home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Gothic-PROIEL: POS Tags: NUM

There are 28 NUM lemmas (1%), 64 NUM types (1%) and 400 NUM tokens (1%). Out of 12 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: ains, twai, twalif, þreis, fimf, sibun, taihun, tigjus, fidwor, þūsundi

The 10 most frequent NUM types: ains, ain, ainamma, ainana, fimf, twans, aina, sibun, twalif, þrins

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: ·r· (NUM 2, ADJ 1), hundam (NOUN 2, NUM 1), ·b· (ADJ 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 2.285714 (the average of all parts of speech is 2.636364).

The 1st highest number of forms (12) was observed with the lemma “ains”: ain, aina, ainai, ainaim, ainaizos, ainamma, ainana, ainans, ainata, ainis, ains, ainz.

The 2nd highest number of forms (7) was observed with the lemma “twai”: twa, twaddje, twai, twaim, twans, twos, ·b·.

The 3rd highest number of forms (5) was observed with the lemma “twalif”: twalib, twalibe, twalibim, twalif, ·ib·.

NUM occurs with 3 features: Case (303; 76% instances), Number (303; 76% instances), Gender (292; 73% instances)

NUM occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 33 feature combinations. The most frequent feature combination is _ (97 tokens). Examples: fimf, sibun, twalif, taihun, fidwor, ·ib·, saihs, ·l·, ahtau, ahtautehund

Relations

NUM nodes are attached to their parents using 17 different relations: nummod (195; 49% instances), nsubj (39; 10% instances), obj (30; 8% instances), xcomp (28; 7% instances), conj (22; 6% instances), obl (20; 5% instances), appos (13; 3% instances), orphan (10; 3% instances), root (10; 3% instances), obl:arg (9; 2% instances), nmod (7; 2% instances), dislocated (5; 1% instances), parataxis (5; 1% instances), nsubj:pass (3; 1% instances), advcl:cmp (2; 1% instances), fixed (1; 0% instances), obl:agent (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (172; 43% instances), VERB (131; 33% instances), NUM (50; 13% instances), PRON (17; 4% instances), (10; 3% instances), ADJ (9; 2% instances), ADV (5; 1% instances), PROPN (4; 1% instances), SCONJ (2; 1% instances)

254 (64%) NUM nodes are leaves.

81 (20%) NUM nodes have one child.

49 (12%) NUM nodes have two children.

16 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 16 different relations: nmod (56; 24% instances), det (31; 13% instances), conj (26; 11% instances), cc (24; 10% instances), advmod (20; 9% instances), nummod (20; 9% instances), case (17; 7% instances), orphan (17; 7% instances), appos (6; 3% instances), acl (5; 2% instances), amod (3; 1% instances), discourse (2; 1% instances), mark (2; 1% instances), nsubj (2; 1% instances), fixed (1; 0% instances), obl (1; 0% instances)

Children of NUM nodes belong to 10 different parts of speech: NUM (50; 21% instances), NOUN (44; 19% instances), DET (31; 13% instances), ADV (25; 11% instances), CCONJ (24; 10% instances), ADP (17; 7% instances), PRON (13; 6% instances), ADJ (12; 5% instances), VERB (9; 4% instances), PROPN (8; 3% instances)