home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PUD: POS Tags: ADP

There are 73 ADP lemmas (1%), 78 ADP types (1%) and 3576 ADP tokens (17%). Out of 16 observed tags, the rank of ADP is: 7 in number of lemmas, 7 in number of types and 2 in number of tokens.

The 10 most frequent ADP lemmas: فِي، مِن، بِ، لِ، عَلَى، إِلَى، أَن، مَعَ، عَن، خِلَالَ

The 10 most frequent ADP types: في، ب، من، ل، على، إلى، أن، مع، عن، خلال

The 10 most frequent ambiguous lemmas: فِي (ADP 651, PROPN 1), مِن (ADP 545, PRON 7), بِ (ADP 535, NOUN 1), أَن (ADP 93, PART 1, PROPN 1), لكِنَّ (<tt><a href="ar_pud-pos-ADP.html">ADP</a></tt> 33, <tt><a href="ar_pud-pos-PART.html">PART</a></tt> 20, <tt><a href="ar_pud-pos-CCONJ.html">CCONJ</a></tt> 1), عِند (<tt><a href="ar_pud-pos-ADP.html">ADP</a></tt> 28, <tt><a href="ar_pud-pos-PART.html">PART</a></tt> 1), حَيثُ (<tt><a href="ar_pud-pos-ADP.html">ADP</a></tt> 26, <tt><a href="ar_pud-pos-ADV.html">ADV</a></tt> 1), حَتَّى (<tt><a href="ar_pud-pos-ADP.html">ADP</a></tt> 25, <tt><a href="ar_pud-pos-CCONJ.html">CCONJ</a></tt> 1, <tt><a href="ar_pud-pos-PART.html">PART</a></tt> 1), ما (<tt><a href="ar_pud-pos-PRON.html">PRON</a></tt> 48, <tt><a href="ar_pud-pos-ADP.html">ADP</a></tt> 9, <tt><a href="ar_pud-pos-PART.html">PART</a></tt> 1, <tt><a href="ar_pud-pos-PROPN.html">PROPN</a></tt> 1), لكِن (ADP 7, PART 4)

The 10 most frequent ambiguous types: في (ADP 651, PROPN 1), ب (ADP 535, NOUN 1), من (ADP 533, PRON 11), ل (ADP 438, PART 4), أن (SCONJ 181, ADP 95, PART 17), بعد (ADP 61, ADV 1), قبل (ADP 51, ADV 1), ك (ADP 40, PRON 13), لكن (ADP 40, PART 24, CCONJ 1), عندما (ADP 28, PART 1)

Morphology

The form / lemma ratio of ADP is 1.068493 (the average of all parts of speech is 1.380334).

The 1st highest number of forms (3) was observed with the lemma “إِلَى”: إل, إلى, إلي.

The 2nd highest number of forms (3) was observed with the lemma “ما”: بما, كما, ما.

The 3rd highest number of forms (3) was observed with the lemma “مِن”: م, مما, من.

ADP occurs with 1 features: ExtPos (142; 4% instances)

ADP occurs with 3 feature-value pairs: ExtPos=ADP, ExtPos=ADV, ExtPos=SCONJ

ADP occurs with 4 feature combinations. The most frequent feature combination is _ (3434 tokens). Examples: في، ب، من، ل، على، إلى، أن، مع، عن، خلال

Relations

ADP nodes are attached to their parents using 13 different relations: case (3046; 85% instances), mark (296; 8% instances), fixed (132; 4% instances), root (38; 1% instances), advcl (16; 0% instances), ccomp (14; 0% instances), acl:relcl (13; 0% instances), advmod (11; 0% instances), dep (4; 0% instances), csubj (2; 0% instances), orphan (2; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances)

Parents of ADP nodes belong to 10 different parts of speech: NOUN (2335; 65% instances), PROPN (396; 11% instances), VERB (337; 9% instances), PRON (219; 6% instances), ADP (125; 3% instances), ADJ (74; 2% instances), (38; 1% instances), NUM (36; 1% instances), PART (11; 0% instances), ADV (5; 0% instances)

3339 (93%) ADP nodes are leaves.

128 (4%) ADP nodes have one child.

46 (1%) ADP nodes have two children.

63 (2%) ADP nodes have three or more children.

The highest child degree of a ADP node is 7.

Children of ADP nodes are attached using 17 different relations: fixed (173; 39% instances), obj (86; 19% instances), punct (71; 16% instances), nsubj (49; 11% instances), csubj (18; 4% instances), mark (13; 3% instances), advcl (8; 2% instances), nmod (8; 2% instances), advmod (7; 2% instances), conj (7; 2% instances), cc (2; 0% instances), ccomp (2; 0% instances), compound:prt (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances), obl (1; 0% instances), parataxis (1; 0% instances)

Children of ADP nodes belong to 11 different parts of speech: NOUN (135; 30% instances), ADP (125; 28% instances), PUNCT (71; 16% instances), PRON (41; 9% instances), VERB (27; 6% instances), SCONJ (18; 4% instances), ADJ (10; 2% instances), ADV (9; 2% instances), PROPN (9; 2% instances), CCONJ (2; 0% instances), PART (2; 0% instances)