home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: POS Tags: PART

There are 20 PART lemmas (0%), 1 PART types (6%) and 8612 PART tokens (1%). Out of 16 observed tags, the rank of PART is: 12 in number of lemmas, 10 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: _، s، f، l، lA، >، None، ,، w، “

The 10 most frequent PART types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 216429, PUNCT 72574, ADJ 66760, ADP 62646, VERB 54473, PROPN 48965, ADV 26129, SCONJ 23987, NUM 15122, AUX 6581, DET 6330, PART 5856, CCONJ 5168, PRON 2460, INTJ 54, X 32), s (PART 2011, AUX 251, NOUN 7, VERB 7, PUNCT 3, ADP 1, CCONJ 1, PRON 1, X 1), f (CCONJ 1247, AUX 459, PART 441, ADV 18, NOUN 12, SCONJ 8, PUNCT 7, VERB 4, ADP 3, ADJ 2, NUM 2, PRON 2), l (ADP 15449, PART 123, NOUN 98, AUX 67, CCONJ 33, ADJ 30, PUNCT 19, VERB 9, SCONJ 8, PROPN 7, ADV 6, PRON 5, DET 2, INTJ 2, NUM 1, X 1), lA (PART 99, PROPN 6, ADP 1, PRON 1, VERB 1), > (PART 18, AUX 10), None (NOUN 457, X 344, VERB 264, ADJ 125, PROPN 124, ADV 34, CCONJ 20, PRON 16, SCONJ 16, PART 14, ADP 8, DET 6, AUX 2), , (NOUN 100, CCONJ 96, VERB 34, PROPN 33, ADJ 30, ADP 30, PRON 11, SCONJ 11, PART 10, AUX 5, DET 5, ADV 4), w (CCONJ 43321, NOUN 190, PUNCT 136, ADP 120, ADV 117, PROPN 78, VERB 71, SCONJ 69, ADJ 55, PRON 33, PART 10, DET 9, NUM 8, AUX 5, X 3), “ (NOUN 112, ADP 34, CCONJ 20, PROPN 20, ADJ 12, VERB 8, PART 6, PRON 6, SCONJ 6, ADV 5, AUX 2, DET 2, X 2)

The 10 most frequent ambiguous types: _ (NOUN 218254, ADP 91694, PUNCT 75148, ADJ 67604, PROPN 58325, VERB 55215, CCONJ 50032, PRON 31239, ADV 26527, SCONJ 26034, NUM 15147, PART 8612, AUX 7723, DET 6362, X 917, INTJ 56)

Morphology

The form / lemma ratio of PART is 0.050000 (the average of all parts of speech is 0.002933).

The 1st highest number of forms (1) was observed with the lemma “””: _.

The 2nd highest number of forms (1) was observed with the lemma “(”: _.

The 3rd highest number of forms (1) was observed with the lemma “,”: _.

PART occurs with 8 features: Polarity (4462; 52% instances), Gender (75; 1% instances), Number (75; 1% instances), Definite (57; 1% instances), Case (44; 1% instances), Person (26; 0% instances), Mood (18; 0% instances), Voice (18; 0% instances)

PART occurs with 18 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Definite=Com, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Mood=Ind, Mood=Jus, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=3, Polarity=Neg, Voice=Act, Voice=Pass

PART occurs with 34 feature combinations. The most frequent feature combination is Polarity=Neg (4462 tokens). Examples: _

Relations

PART nodes are attached to their parents using 10 different relations: aux (4915; 57% instances), dep (3118; 36% instances), parataxis (281; 3% instances), nmod (174; 2% instances), conj (45; 1% instances), mark (37; 0% instances), root (31; 0% instances), nsubj (5; 0% instances), nmod:poss (3; 0% instances), obj (3; 0% instances)

Parents of PART nodes belong to 15 different parts of speech: VERB (6779; 79% instances), NOUN (1098; 13% instances), ADV (277; 3% instances), ADJ (172; 2% instances), PROPN (75; 1% instances), PRON (72; 1% instances), CCONJ (32; 0% instances), (31; 0% instances), DET (18; 0% instances), NUM (15; 0% instances), PUNCT (14; 0% instances), PART (13; 0% instances), SCONJ (9; 0% instances), AUX (5; 0% instances), X (2; 0% instances)

7915 (92%) PART nodes are leaves.

568 (7%) PART nodes have one child.

85 (1%) PART nodes have two children.

44 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 15.

Children of PART nodes are attached using 19 different relations: nmod (244; 26% instances), xcomp (157; 17% instances), dep (135; 15% instances), det (56; 6% instances), parataxis (53; 6% instances), case (50; 5% instances), obj (46; 5% instances), punct (46; 5% instances), amod (30; 3% instances), mark (26; 3% instances), cc (21; 2% instances), advmod (19; 2% instances), ccomp (17; 2% instances), nsubj (10; 1% instances), conj (7; 1% instances), cop (6; 1% instances), csubj (2; 0% instances), iobj (1; 0% instances), nummod (1; 0% instances)

Children of PART nodes belong to 15 different parts of speech: NOUN (224; 24% instances), VERB (195; 21% instances), ADV (132; 14% instances), DET (62; 7% instances), ADP (50; 5% instances), PUNCT (46; 5% instances), PRON (44; 5% instances), PROPN (39; 4% instances), ADJ (36; 4% instances), SCONJ (28; 3% instances), CCONJ (26; 3% instances), AUX (19; 2% instances), PART (13; 1% instances), X (12; 1% instances), NUM (1; 0% instances)