home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Armenian-ArmTDP: POS Tags: X

There are 158 X lemmas (1%), 162 X types (1%) and 278 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 8 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: _, թույլ, նկատի, Բի, տանուլ, Յելլ, իսլամի, շուռ, Լոս, Նյու

The 10 most frequent X types: թույլ, ո, նկատի, ր, մ, շ, Բի, ու, տանուլ, Յելլ

The 10 most frequent ambiguous lemmas: թույլ (X 20, ADJ 3), դե (INTJ 20, X 2), Բ (ADJ 5, X 1), անց (ADP 12, X 1), գ (ADJ 1, NUM 1, X 1), դա (PRON 225, X 1), կուլ (NOUN 2, X 1), մ (NOUN 6, X 1), տարեց (ADJ 4, X 1), փուլ (NOUN 17, X 1)

The 10 most frequent ambiguous types: թույլ (X 19, ADJ 3), ո (X 11, VERB 7, NOUN 1), մ (X 8, NOUN 6), ու (CCONJ 1008, X 5), Ե (PROPN 5, X 2), Տ (NOUN 9, PROPN 6, X 2), դե (INTJ 8, X 2), Ա (PROPN 56, ADJ 8, X 1), Այ (INTJ 4, X 1), Բ (PROPN 5, ADJ 3, X 1)

Morphology

The form / lemma ratio of X is 1.025316 (the average of all parts of speech is 1.883575).

The 1st highest number of forms (11) was observed with the lemma “_”: Ա, Ե, Զ, Ց, ը, մ, շ, ո, ու, ր, ւ.

The 2nd highest number of forms (1) was observed with the lemma “AGBU”: AGBU.

The 3rd highest number of forms (1) was observed with the lemma “Allianplace”: Allianplace.

X occurs with 7 features: Foreign (147; 53% instances), ExtPos (29; 10% instances), Style (7; 3% instances), Abbr (5; 2% instances), Echo (3; 1% instances), Hyph (1; 0% instances), Typo (1; 0% instances)

X occurs with 10 feature-value pairs: Abbr=Yes, Echo=Ech, ExtPos=PROPN, Foreign=Yes, Hyph=Yes, Style=Arch, Style=Coll, Style=Rare, Style=Vrnc, Typo=Yes

X occurs with 12 feature combinations. The most frequent feature combination is Foreign=Yes (119 tokens). Examples: Բի, իսլամի, Սի, ֆարգո, Community, Daily, In, Metal, Nas, ZipLine

Relations

X nodes are attached to their parents using 20 different relations: compound:lvc (56; 20% instances), goeswith (49; 18% instances), flat:name (48; 17% instances), flat (31; 11% instances), nmod (28; 10% instances), appos (13; 5% instances), nsubj (11; 4% instances), conj (10; 4% instances), nmod:poss (8; 3% instances), compound (6; 2% instances), obl (3; 1% instances), parataxis (3; 1% instances), advcl (2; 1% instances), compound:redup (2; 1% instances), nmod:npmod (2; 1% instances), root (2; 1% instances), ccomp (1; 0% instances), fixed (1; 0% instances), list (1; 0% instances), obj (1; 0% instances)

Parents of X nodes belong to 10 different parts of speech: VERB (118; 42% instances), X (73; 26% instances), NOUN (61; 22% instances), PROPN (16; 6% instances), ADJ (4; 1% instances), (2; 1% instances), ADP (1; 0% instances), ADV (1; 0% instances), NUM (1; 0% instances), PRON (1; 0% instances)

165 (59%) X nodes are leaves.

49 (18%) X nodes have one child.

23 (8%) X nodes have two children.

41 (15%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 16 different relations: punct (121; 46% instances), flat:name (58; 22% instances), flat (26; 10% instances), dep (22; 8% instances), conj (12; 5% instances), appos (3; 1% instances), compound (3; 1% instances), nmod (3; 1% instances), parataxis (3; 1% instances), acl (2; 1% instances), case (2; 1% instances), cop (2; 1% instances), det (2; 1% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), cc (1; 0% instances)

Children of X nodes belong to 11 different parts of speech: PUNCT (121; 46% instances), X (73; 28% instances), NOUN (42; 16% instances), PROPN (16; 6% instances), DET (3; 1% instances), AUX (2; 1% instances), ADJ (1; 0% instances), ADP (1; 0% instances), CCONJ (1; 0% instances), NUM (1; 0% instances), VERB (1; 0% instances)