home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cappadocian-AMGiC: POS Tags: NOUN

There are 98 NOUN lemmas (28%), 104 NOUN types (24%) and 134 NOUN tokens (16%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: peδí, kóri, pará, psomí, staχtiǰís, vavás, (e)néka, Dunyá, Güzelí, dergízi

The 10 most frequent NOUN types: peδí, kóri, pará, psomí, Dunyá, dergizmú, enéka, kenér, mána, neró

The 10 most frequent ambiguous lemmas: _ (X 2, NOUN 1, PRON 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.061224 (the average of all parts of speech is 1.244253).

The 1st highest number of forms (2) was observed with the lemma “(e)néka”: enéka, néka.

The 2nd highest number of forms (2) was observed with the lemma “Güzelí”: Güzelidyú, Güzelí.

The 3rd highest number of forms (2) was observed with the lemma “enéka”: enéka, enékan.

NOUN occurs with 3 features: Case (129; 96% instances), Number (129; 96% instances), Gender (128; 96% instances)

NOUN occurs with 9 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 18 feature combinations. The most frequent feature combination is Case=Acc|Gender=Neut|Number=Sing (36 tokens). Examples: peδí, psomí, kenér, spíči, cüréi, fés, geleǰí, imurǰáχ, irésja, kalaǰí

Relations

NOUN nodes are attached to their parents using 12 different relations: obj (41; 31% instances), nsubj (34; 25% instances), obl (25; 19% instances), nmod (14; 10% instances), vocative (6; 4% instances), conj (5; 4% instances), ccomp (2; 1% instances), iobj (2; 1% instances), root (2; 1% instances), advcl (1; 1% instances), appos (1; 1% instances), xcomp (1; 1% instances)

Parents of NOUN nodes belong to 6 different parts of speech: VERB (100; 75% instances), NOUN (21; 16% instances), ADV (8; 6% instances), ADJ (2; 1% instances), (2; 1% instances), PROPN (1; 1% instances)

30 (22%) NOUN nodes are leaves.

51 (38%) NOUN nodes have one child.

32 (24%) NOUN nodes have two children.

21 (16%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 18 different relations: det (70; 38% instances), nmod (39; 21% instances), amod (12; 6% instances), case (12; 6% instances), punct (12; 6% instances), cop (6; 3% instances), nummod (6; 3% instances), advmod (5; 3% instances), conj (5; 3% instances), cc (4; 2% instances), mark (4; 2% instances), acl:relcl (3; 2% instances), acl (2; 1% instances), aux:q (2; 1% instances), advmod:emph (1; 1% instances), appos (1; 1% instances), det:poss (1; 1% instances), nsubj (1; 1% instances)

Children of NOUN nodes belong to 12 different parts of speech: DET (67; 36% instances), PRON (29; 16% instances), NOUN (21; 11% instances), ADJ (12; 6% instances), ADP (12; 6% instances), PUNCT (12; 6% instances), AUX (8; 4% instances), ADV (7; 4% instances), NUM (6; 3% instances), CCONJ (4; 2% instances), SCONJ (4; 2% instances), VERB (4; 2% instances)