home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: POS Tags: ADJ

There are 106 ADJ lemmas (2%), 1 ADJ types (6%) and 67604 ADJ tokens (9%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 1 in number of types and 4 in number of tokens.

The 10 most frequent ADJ lemmas: _، TBupdate، None، .، w، ,، l، hA، b، 17

The 10 most frequent ADJ types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 216429, PUNCT 72574, ADJ 66760, ADP 62646, VERB 54473, PROPN 48965, ADV 26129, SCONJ 23987, NUM 15122, AUX 6581, DET 6330, PART 5856, CCONJ 5168, PRON 2460, INTJ 54, X 32), TBupdate (NOUN 401, ADJ 280, VERB 263, X 174, ADV 74, PROPN 69, ADP 4, SCONJ 2, CCONJ 1, DET 1, PART 1, PRON 1), None (NOUN 457, X 344, VERB 264, ADJ 125, PROPN 124, ADV 34, CCONJ 20, PRON 16, SCONJ 16, PART 14, ADP 8, DET 6, AUX 2), . (NOUN 107, ADJ 95, PROPN 67, PRON 20, VERB 12, PART 6, ADP 5, X 5, CCONJ 3, ADV 2, AUX 2, DET 2, SCONJ 1), w (CCONJ 43321, NOUN 190, PUNCT 136, ADP 120, ADV 117, PROPN 78, VERB 71, SCONJ 69, ADJ 55, PRON 33, PART 10, DET 9, NUM 8, AUX 5, X 3), , (NOUN 100, CCONJ 96, VERB 34, PROPN 33, ADJ 30, ADP 30, PRON 11, SCONJ 11, PART 10, AUX 5, DET 5, ADV 4), l (ADP 15449, PART 123, NOUN 98, AUX 67, CCONJ 33, ADJ 30, PUNCT 19, VERB 9, SCONJ 8, PROPN 7, ADV 6, PRON 5, DET 2, INTJ 2, NUM 1, X 1), hA (PRON 10321, SCONJ 313, AUX 69, NOUN 56, ADP 25, CCONJ 19, ADJ 18, PUNCT 17, PROPN 9, VERB 9, ADV 4, NUM 3, PART 3, DET 1), b (ADP 12204, NOUN 65, VERB 17, ADJ 16, PUNCT 15, PRON 12, CCONJ 10, SCONJ 7, PROPN 6, ADV 5, AUX 2, PART 2, X 2, DET 1, NUM 1), 17 (ADJ 14, DET 1, NOUN 1, PROPN 1)

The 10 most frequent ambiguous types: _ (NOUN 218254, ADP 91694, PUNCT 75148, ADJ 67604, PROPN 58325, VERB 55215, CCONJ 50032, PRON 31239, ADV 26527, SCONJ 26034, NUM 15147, PART 8612, AUX 7723, DET 6362, X 917, INTJ 56)

Morphology

The form / lemma ratio of ADJ is 0.009434 (the average of all parts of speech is 0.002933).

The 1st highest number of forms (1) was observed with the lemma “!”: _.

The 2nd highest number of forms (1) was observed with the lemma “””: _.

The 3rd highest number of forms (1) was observed with the lemma “(”: _.

ADJ occurs with 8 features: Gender (67102; 99% instances), Number (67102; 99% instances), Definite (67059; 99% instances), Case (63518; 94% instances), Person (82; 0% instances), Voice (44; 0% instances), Mood (43; 0% instances), Polarity (1; 0% instances)

ADJ occurs with 19 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Definite=Com, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Mood=Ind, Mood=Jus, Mood=Sub, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Voice=Act

ADJ occurs with 85 feature combinations. The most frequent feature combination is Case=Gen|Definite=Def|Gender=Fem|Number=Sing (16978 tokens). Examples: _

Relations

ADJ nodes are attached to their parents using 13 different relations: amod (61251; 91% instances), conj (2312; 3% instances), parataxis (1865; 3% instances), nmod:poss (1049; 2% instances), root (356; 1% instances), obj (325; 0% instances), flat (173; 0% instances), nsubj (156; 0% instances), iobj (49; 0% instances), nsubj:pass (30; 0% instances), acl (17; 0% instances), xcomp (13; 0% instances), aux (8; 0% instances)

Parents of ADJ nodes belong to 15 different parts of speech: NOUN (56585; 84% instances), VERB (3730; 6% instances), ADJ (3182; 5% instances), PROPN (1964; 3% instances), ADV (954; 1% instances), (356; 1% instances), PUNCT (229; 0% instances), PRON (151; 0% instances), NUM (126; 0% instances), CCONJ (100; 0% instances), SCONJ (87; 0% instances), DET (54; 0% instances), PART (36; 0% instances), X (29; 0% instances), AUX (21; 0% instances)

55445 (82%) ADJ nodes are leaves.

6660 (10%) ADJ nodes have one child.

3404 (5%) ADJ nodes have two children.

2095 (3%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 20.

Children of ADJ nodes are attached using 20 different relations: nmod (6863; 31% instances), cc (3248; 14% instances), conj (2365; 11% instances), punct (2175; 10% instances), amod (1265; 6% instances), nsubj (1020; 5% instances), case (988; 4% instances), advmod (866; 4% instances), mark (632; 3% instances), nummod (582; 3% instances), cop (571; 3% instances), nmod:poss (402; 2% instances), parataxis (360; 2% instances), dep (313; 1% instances), ccomp (223; 1% instances), obj (194; 1% instances), xcomp (155; 1% instances), det (134; 1% instances), csubj (38; 0% instances), flat (29; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: NOUN (7188; 32% instances), CCONJ (3260; 15% instances), ADJ (3182; 14% instances), PUNCT (2175; 10% instances), ADP (1009; 4% instances), ADV (946; 4% instances), PROPN (869; 4% instances), VERB (761; 3% instances), PRON (730; 3% instances), AUX (683; 3% instances), SCONJ (644; 3% instances), NUM (600; 3% instances), PART (172; 1% instances), DET (166; 1% instances), X (36; 0% instances), INTJ (2; 0% instances)