home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Coptic: POS Tags: NOUN

There are 538 NOUN lemmas (44%), 553 NOUN types (39%) and 1539 NOUN tokens (14%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 3 in number of tokens.

The 10 most frequent NOUN lemmas: ⲛⲟⲩⲧⲉ, ϩⲉ, ϩⲏⲧ, ⲙⲁ, ⲣⲱⲙⲉ, ⲏⲓ, ϣⲁϫⲉ, ⲩⲛⲟⲩ, ⲗⲁⲁⲩ, ⲙⲏⲏϣⲉ

The 10 most frequent NOUN types: ⲛⲟⲩⲧⲉ, ϩⲉ, ϩⲏⲧ, ⲙⲁ, ⲣⲱⲙⲉ, ϣⲁϫⲉ, ⲏⲓ, ⲩⲛⲟⲩ, ⲗⲁⲁⲩ, ⲙⲏⲏϣⲉ

The 10 most frequent ambiguous lemmas: ϩⲉ (NOUN 49, VERB 8), ϩⲏⲧ (NOUN 40, ADP 1), ⲙⲁ (NOUN 37, AUX 1), ϣⲁϫⲉ (NOUN 26, VERB 11), ⲩⲛⲟⲩ (NOUN 24, ADV 1), ϩⲟⲟⲩ (NOUN 15, VERB 1), ⲕⲉ (DET 23, NOUN 10), ⲡⲉ (PRON 49, PART 21, NOUN 7), ⲣⲟ (NOUN 7, VERB 1), ϩⲓⲥⲉ (NOUN 6, VERB 1)

The 10 most frequent ambiguous types: ϩⲉ (NOUN 49, VERB 8), ⲙⲁ (NOUN 37, AUX 1), ϣⲁϫⲉ (NOUN 26, VERB 11), ⲩⲛⲟⲩ (NOUN 24, ADV 1), ϩⲟⲟⲩ (NOUN 15, VERB 1), ϩⲓⲥⲉ (NOUN 6, VERB 1), ⲭⲣⲓⲥⲧⲟⲥ (NOUN 6, PROPN 2), ⲉⲛⲉϩ (NOUN 5, ADV 2), ⲟⲩⲱϣ (VERB 7, NOUN 5), ⲃⲟⲗ (NOUN 4, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.027881 (the average of all parts of speech is 1.154412).

The 1st highest number of forms (2) was observed with the lemma “ϩⲏ”: ϩⲏ, ϩⲏⲧ.

The 2nd highest number of forms (2) was observed with the lemma “ϩⲏⲧ”: ϩⲏⲧ, ϩⲧⲏ.

The 3rd highest number of forms (2) was observed with the lemma “ϩⲗⲗⲟ”: ϩⲗⲗⲟ, ϩⲗⲗⲟⲓ.

NOUN occurs with 1 features: PronType (4; 0% instances)

NOUN occurs with 1 feature-value pairs: PronType=Rcp

NOUN occurs with 2 feature combinations. The most frequent feature combination is _ (1535 tokens). Examples: ⲛⲟⲩⲧⲉ, ϩⲉ, ϩⲏⲧ, ⲙⲁ, ⲣⲱⲙⲉ, ϣⲁϫⲉ, ⲏⲓ, ⲩⲛⲟⲩ, ⲗⲁⲁⲩ, ⲙⲏⲏϣⲉ

Relations

NOUN nodes are attached to their parents using 22 different relations: obl (485; 32% instances), nmod (306; 20% instances), obj (209; 14% instances), nsubj (169; 11% instances), conj (128; 8% instances), dislocated (79; 5% instances), root (46; 3% instances), appos (33; 2% instances), acl (20; 1% instances), ccomp (13; 1% instances), vocative (12; 1% instances), advmod (10; 1% instances), advcl (8; 1% instances), amod (6; 0% instances), parataxis (5; 0% instances), case (2; 0% instances), fixed (2; 0% instances), xcomp (2; 0% instances), aux (1; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), orphan (1; 0% instances)

Parents of NOUN nodes belong to 11 different parts of speech: VERB (970; 63% instances), NOUN (423; 27% instances), (46; 3% instances), PRON (37; 2% instances), DET (27; 2% instances), PROPN (22; 1% instances), NUM (8; 1% instances), ADV (2; 0% instances), PART (2; 0% instances), ADP (1; 0% instances), SCONJ (1; 0% instances)

64 (4%) NOUN nodes are leaves.

399 (26%) NOUN nodes have one child.

605 (39%) NOUN nodes have two children.

471 (31%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 10.

Children of NOUN nodes are attached using 28 different relations: det (1205; 37% instances), case (981; 30% instances), nmod (339; 10% instances), acl (140; 4% instances), cc (133; 4% instances), conj (129; 4% instances), advmod (76; 2% instances), punct (60; 2% instances), mark (54; 2% instances), advcl (26; 1% instances), nsubj (25; 1% instances), cop (22; 1% instances), appos (21; 1% instances), xcomp (11; 0% instances), aux (8; 0% instances), amod (6; 0% instances), ccomp (6; 0% instances), flat (6; 0% instances), parataxis (6; 0% instances), nummod (5; 0% instances), csubj (4; 0% instances), dislocated (4; 0% instances), obj (3; 0% instances), dep (2; 0% instances), orphan (2; 0% instances), discourse (1; 0% instances), obl (1; 0% instances), vocative (1; 0% instances)

Children of NOUN nodes belong to 14 different parts of speech: DET (1177; 36% instances), ADP (1009; 31% instances), NOUN (423; 13% instances), VERB (168; 5% instances), PRON (129; 4% instances), ADV (87; 3% instances), CCONJ (76; 2% instances), PUNCT (60; 2% instances), PART (55; 2% instances), PROPN (37; 1% instances), SCONJ (27; 1% instances), AUX (18; 1% instances), NUM (8; 0% instances), X (3; 0% instances)