home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Persian-Seraji: POS Tags: ADP

There are 86 ADP lemmas (1%), 108 ADP types (1%) and 17415 ADP tokens (11%). Out of 15 observed tags, the rank of ADP is: 6 in number of lemmas, 8 in number of types and 2 in number of tokens.

The 10 most frequent ADP lemmas: در، به، از، با، برای، بر، تا، درباره، مورد، رو

The 10 most frequent ADP types: در، به، از، با، برای، بر، تا، مورد، روی، درباره

The 10 most frequent ambiguous lemmas: در (ADP 4876, NOUN 20), به (ADP 4280, ADJ 66, ADV 5), از (ADP 3426, NOUN 1), بر (ADP 604, NOUN 8), تا (ADP 296, SCONJ 120, NOUN 32, ADV 1, CCONJ 1), مورد (NOUN 197, ADP 128), رو (ADP 105, NOUN 55, PART 2), بین (ADP 84, NOUN 2, VERB 2), نسبت (ADP 80, NOUN 16), کنار (ADP 54, NOUN 3)

The 10 most frequent ambiguous types: در (ADP 4876, NOUN 20), به (ADP 4255, ADJ 1), بر (ADP 604, NOUN 8), تا (ADP 296, SCONJ 120, NOUN 32, ADV 1, CCONJ 1), مورد (NOUN 150, ADP 128), روی (ADP 105, NOUN 35), بین (ADP 84, NOUN 2, VERB 2), نسبت (ADP 80, NOUN 15), کنار (ADP 54, NOUN 1), برابر (ADP 52, NOUN 25)

Morphology

The form / lemma ratio of ADP is 1.255814 (the average of all parts of speech is 1.920334).

The 1st highest number of forms (22) was observed with the lemma “_”: باضافهٔ, بجز, بدانها, برا, برایت, برطبق, برعلیه, برغم, بنظر, بهش, به‌, به‌زعم, بی‌آنکه, دربارهٔ, در‌, روبه‌روی, رودرروی, رویاروی, ظرف, فرا, فرو, نبش.

The 2nd highest number of forms (4) was observed with the lemma “درباره”: درباره, دربارهٔ, دربارهٔٔ, درباره‌.

The 3rd highest number of forms (2) was observed with the lemma “از”: از, ز.

ADP does not occur with any features.

Relations

ADP nodes are attached to their parents using 9 different relations: case (16463; 95% instances), fixed (683; 4% instances), mark (154; 1% instances), compound:prt (100; 1% instances), advmod (7; 0% instances), conj (3; 0% instances), compound:lvc (2; 0% instances), obj (2; 0% instances), xcomp (1; 0% instances)

Parents of ADP nodes belong to 12 different parts of speech: NOUN (15001; 86% instances), PRON (1027; 6% instances), ADP (560; 3% instances), VERB (282; 2% instances), ADJ (262; 2% instances), ADV (164; 1% instances), NUM (90; 1% instances), CCONJ (19; 0% instances), X (7; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

16445 (94%) ADP nodes are leaves.

771 (4%) ADP nodes have one child.

175 (1%) ADP nodes have two children.

24 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 4.

Children of ADP nodes are attached using 19 different relations: fixed (889; 74% instances), conj (165; 14% instances), advmod (43; 4% instances), nummod (18; 2% instances), nmod (15; 1% instances), cc:preconj (14; 1% instances), nmod:poss (11; 1% instances), punct (10; 1% instances), cc (6; 1% instances), dep (6; 1% instances), obj (4; 0% instances), acl:relcl (2; 0% instances), advcl (2; 0% instances), case (2; 0% instances), compound:lvc (2; 0% instances), mark (2; 0% instances), compound (1; 0% instances), compound:prt (1; 0% instances), flat (1; 0% instances)

Children of ADP nodes belong to 13 different parts of speech: ADP (560; 47% instances), NOUN (296; 25% instances), CCONJ (154; 13% instances), PRON (56; 5% instances), ADV (43; 4% instances), SCONJ (27; 2% instances), NUM (23; 2% instances), ADJ (12; 1% instances), PUNCT (10; 1% instances), VERB (7; 1% instances), PART (4; 0% instances), DET (1; 0% instances), X (1; 0% instances)