Treebank Statistics: UD_Arabic-NYUAD: POS Tags: ADV
There are 8 ADV
lemmas (0%), 1 ADV
types (6%) and 24067 ADV
tokens (3%).
Out of 16 observed tags, the rank of ADV
is: 13 in number of lemmas, 3 in number of types and 9 in number of tokens.
The 10 most frequent ADV
lemmas: _، lA، mA، ,، l، “، 6، f
The 10 most frequent ADV
types: _
The 10 most frequent ambiguous lemmas: _ (NOUN 221327, PUNCT 71973, ADJ 68841, ADP 62617, VERB 55127, PROPN 48391, ADV 23955, SCONJ 15652, NUM 15105, PRON 12926, AUX 6881, DET 6354, CCONJ 3889, PART 1501, X 380, INTJ 56), lA (ADV 100, PROPN 6, ADP 1, PART 1), mA (SCONJ 667, PRON 320, ADV 5, VERB 1), , (PUNCT 254, CCONJ 68, NOUN 9, ADJ 8, ADP 7, NUM 5, SCONJ 4, VERB 4, PRON 3, PROPN 3, ADV 2, DET 1, PART 1), l (ADP 15628, PART 165, NOUN 29, SCONJ 28, ADV 2, VERB 2, ADJ 1, DET 1, NUM 1, PROPN 1, PUNCT 1, X 1), “ (PUNCT 207, ADP 6, PROPN 6, CCONJ 5, NOUN 5, ADJ 2, PART 2, ADV 1, VERB 1), 6 (NUM 8, ADV 1), f (CCONJ 1360, PART 815, SCONJ 18, PRON 4, VERB 3, NOUN 2, ADP 1, ADV 1, PUNCT 1)
The 10 most frequent ambiguous types: _ (NOUN 221899, ADP 91743, PUNCT 75266, ADJ 69355, PROPN 57421, VERB 55469, CCONJ 49161, PRON 43495, ADV 24067, SCONJ 16614, NUM 15377, AUX 9155, DET 6363, PART 2521, X 927, INTJ 56)
- _
- NOUN 221899: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 91743: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 75266: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 69355: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 57421: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 55469: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CCONJ 49161: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 43495: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 24067: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 16614: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 15377: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 9155: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 6363: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 2521: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- X 927: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- INTJ 56: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Morphology
The form / lemma ratio of ADV
is 0.125000 (the average of all parts of speech is 0.003044).
The 1st highest number of forms (1) was observed with the lemma “””: _.
The 2nd highest number of forms (1) was observed with the lemma “,”: _.
The 3rd highest number of forms (1) was observed with the lemma “6”: _.
ADV
occurs with 10 features: Gender (19509; 81% instances), Number (19509; 81% instances), Definite (19507; 81% instances), Case (16050; 67% instances), Polarity (4526; 19% instances), AdpType (26; 0% instances), Mood (2; 0% instances), Person (2; 0% instances), Voice (2; 0% instances), PronType (1; 0% instances)
ADV
occurs with 16 feature-value pairs: AdpType=Prep
, Case=Acc
, Case=Gen
, Case=Nom
, Definite=Com
, Definite=Def
, Definite=Ind
, Gender=Fem
, Gender=Masc
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=3
, Polarity=Neg
, PronType=Rel
, Voice=Act
ADV
occurs with 22 feature combinations.
The most frequent feature combination is Case=Acc|Definite=Com|Gender=Masc|Number=Sing
(12933 tokens).
Examples: _
Relations
ADV
nodes are attached to their parents using 8 different relations: advmod (18736; 78% instances), obj (2738; 11% instances), iobj (1131; 5% instances), nmod (1095; 5% instances), root (348; 1% instances), nsubj (14; 0% instances), nmod:poss (3; 0% instances), mark (2; 0% instances)
Parents of ADV
nodes belong to 13 different parts of speech: VERB (12713; 53% instances), NOUN (8161; 34% instances), ADJ (976; 4% instances), ADV (826; 3% instances), PRON (487; 2% instances), (348; 1% instances), PROPN (342; 1% instances), NUM (90; 0% instances), CCONJ (56; 0% instances), DET (23; 0% instances), X (22; 0% instances), PART (14; 0% instances), AUX (9; 0% instances)
5790 (24%) ADV
nodes are leaves.
13213 (55%) ADV
nodes have one child.
2917 (12%) ADV
nodes have two children.
2147 (9%) ADV
nodes have three or more children.
The highest child degree of a ADV
node is 16.
Children of ADV
nodes are attached using 21 different relations: nmod:poss (14233; 51% instances), case (2471; 9% instances), nmod (1752; 6% instances), punct (1698; 6% instances), cc (1343; 5% instances), ccomp (1269; 5% instances), obj (1144; 4% instances), nummod (926; 3% instances), amod (674; 2% instances), xcomp (490; 2% instances), mark (457; 2% instances), advmod (420; 2% instances), nsubj (347; 1% instances), cop (264; 1% instances), appos (107; 0% instances), iobj (50; 0% instances), aux (34; 0% instances), acl (13; 0% instances), dep (2; 0% instances), compound (1; 0% instances), discourse (1; 0% instances)
Children of ADV
nodes belong to 16 different parts of speech: NOUN (12835; 46% instances), ADP (2471; 9% instances), PROPN (2025; 7% instances), VERB (1753; 6% instances), PUNCT (1698; 6% instances), PRON (1556; 6% instances), CCONJ (1354; 5% instances), ADJ (1160; 4% instances), NUM (962; 3% instances), ADV (826; 3% instances), SCONJ (315; 1% instances), AUX (302; 1% instances), DET (240; 1% instances), PART (148; 1% instances), X (50; 0% instances), INTJ (1; 0% instances)