home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Western_Armenian-ArmTDP: POS Tags: X

There are 290 X lemmas (2%), 291 X types (1%) and 456 X tokens (0%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: նկատի, մտիկ, հակա, Ապու, la, ERVAB, de, of, et, օղլու

The 10 most frequent X types: նկատի, մտիկ, հակա, Ապու, la, ERVAB, de, of, et, օղլու

The 10 most frequent ambiguous lemmas: թոյլ (X 4, ADJ 2), Նշան (PROPN 2, X 1), հուն (NOUN 4, X 1), մի (DET 52, PART 28, NUM 2, X 1), տէ (PART 1, X 1)

The 10 most frequent ambiguous types: թոյլ (X 4, ADJ 2), ուիքի (PROPN 2, X 1), մի (DET 44, PART 24, NUM 2, X 1), տէ (PART 1, X 1)

Morphology

The form / lemma ratio of X is 1.003448 (the average of all parts of speech is 2.043120).

The 1st highest number of forms (2) was observed with the lemma “փոնկ”: փոնկ, փոնկի.

The 2nd highest number of forms (1) was observed with the lemma “AR”: AR.

The 3rd highest number of forms (1) was observed with the lemma “ARDNA”: ARDNA.

X occurs with 4 features: Foreign (363; 80% instances), Hyph (11; 2% instances), Abbr (10; 2% instances), LangId (1; 0% instances)

X occurs with 4 feature-value pairs: Abbr=Yes, Foreign=Yes, Hyph=Yes, LangId=Hy

X occurs with 5 feature combinations. The most frequent feature combination is Foreign=Yes (353 tokens). Examples: Ապու, la, de, of, et, օղլու, Disaster, Writers, le, the

Relations

X nodes are attached to their parents using 26 different relations: flat (143; 31% instances), compound:lvc (51; 11% instances), nmod (39; 9% instances), nmod:poss (34; 7% instances), parataxis (28; 6% instances), appos (22; 5% instances), nsubj (22; 5% instances), conj (17; 4% instances), flat:name (17; 4% instances), obl (17; 4% instances), obj (11; 2% instances), root (10; 2% instances), xcomp (8; 2% instances), amod (7; 2% instances), compound (6; 1% instances), compound:redup (5; 1% instances), dep (5; 1% instances), fixed (3; 1% instances), vocative (3; 1% instances), orphan (2; 0% instances), acl (1; 0% instances), advmod (1; 0% instances), advmod:emph (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), obl:agent (1; 0% instances)

Parents of X nodes belong to 8 different parts of speech: X (173; 38% instances), NOUN (134; 29% instances), VERB (111; 24% instances), PROPN (19; 4% instances), (10; 2% instances), ADJ (7; 2% instances), NUM (1; 0% instances), PRON (1; 0% instances)

210 (46%) X nodes are leaves.

106 (23%) X nodes have one child.

59 (13%) X nodes have two children.

81 (18%) X nodes have three or more children.

The highest child degree of a X node is 14.

Children of X nodes are attached using 25 different relations: punct (220; 37% instances), flat (157; 27% instances), dep (47; 8% instances), flat:name (34; 6% instances), conj (21; 4% instances), compound (17; 3% instances), parataxis (11; 2% instances), fixed (10; 2% instances), amod (9; 2% instances), appos (9; 2% instances), case (9; 2% instances), cc (6; 1% instances), det (6; 1% instances), compound:redup (5; 1% instances), acl (4; 1% instances), acl:relcl (4; 1% instances), cop (4; 1% instances), nmod:poss (4; 1% instances), nmod (3; 1% instances), nsubj (3; 1% instances), discourse (2; 0% instances), obl (2; 0% instances), advmod:emph (1; 0% instances), nmod:npmod (1; 0% instances), vocative (1; 0% instances)

Children of X nodes belong to 12 different parts of speech: PUNCT (220; 37% instances), X (173; 29% instances), NOUN (89; 15% instances), PROPN (43; 7% instances), ADJ (18; 3% instances), ADP (12; 2% instances), VERB (10; 2% instances), ADV (7; 1% instances), CCONJ (6; 1% instances), DET (6; 1% instances), AUX (4; 1% instances), INTJ (2; 0% instances)