home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Occitan-TTB: POS Tags: ADP

There are 78 ADP lemmas (2%), 86 ADP types (1%) and 3176 ADP tokens (12%). Out of 16 observed tags, the rank of ADP is: 8 in number of lemmas, 10 in number of types and 5 in number of tokens.

The 10 most frequent ADP lemmas: de, a, per, en, dins, sus, amb, coma, dens, sens

The 10 most frequent ADP types: de, a, d’, per, en, dins, sus, amb, coma, dens

The 10 most frequent ambiguous lemmas: de (ADP 1531, DET 133, PART 3), a (ADP 524, INTJ 19, VERB 4, SCONJ 1), en (ADP 236, PRON 21), coma (SCONJ 79, ADP 23, ADV 5), sens (ADP 21, NOUN 3), dempuèi (ADP 17, SCONJ 1), entre (ADP 15, ADV 1), darrièr (ADP 14, ADJ 4, ADV 4), entà (ADP 13, SCONJ 2), davant (ADP 10, ADV 2, NOUN 1)

The 10 most frequent ambiguous types: de (ADP 1231, DET 98, PART 2), a (ADP 498, VERB 46, AUX 26, INTJ 4, SCONJ 1), d’ (ADP 276, DET 31, PART 1), en (ADP 207, PRON 7), coma (SCONJ 72, ADP 23, ADV 3), sens (ADP 18, NOUN 4), dempuèi (ADP 15, SCONJ 1), entre (ADP 14, ADV 1), entà (ADP 13, SCONJ 1), darrièr (ADP 12, ADV 3, ADJ 2)

Morphology

The form / lemma ratio of ADP is 1.102564 (the average of all parts of speech is 1.368971).

The 1st highest number of forms (6) was observed with the lemma “a”: -a, a, a-, ad, an, à.

The 2nd highest number of forms (2) was observed with the lemma “a_lo”: al, au.

The 3rd highest number of forms (2) was observed with the lemma “amb”: amb, ambe.

ADP occurs with 1 features: ExtPos (94; 3% instances)

ADP occurs with 6 feature-value pairs: ExtPos=ADP, ExtPos=ADV, ExtPos=INTJ, ExtPos=PRON, ExtPos=PROPN, ExtPos=SCONJ

ADP occurs with 7 feature combinations. The most frequent feature combination is _ (3082 tokens). Examples: de, a, d’, per, en, dins, sus, amb, dens, coma

Relations

ADP nodes are attached to their parents using 10 different relations: case (2656; 84% instances), mark (396; 12% instances), fixed (78; 2% instances), advmod (25; 1% instances), flat (11; 0% instances), obl (3; 0% instances), conj (2; 0% instances), discourse (2; 0% instances), obj (2; 0% instances), orphan (1; 0% instances)

Parents of ADP nodes belong to 11 different parts of speech: NOUN (2148; 68% instances), VERB (400; 13% instances), PROPN (320; 10% instances), PRON (97; 3% instances), ADV (71; 2% instances), NUM (69; 2% instances), ADP (44; 1% instances), ADJ (20; 1% instances), DET (5; 0% instances), AUX (1; 0% instances), X (1; 0% instances)

3070 (97%) ADP nodes are leaves.

62 (2%) ADP nodes have one child.

40 (1%) ADP nodes have two children.

4 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 4.

Children of ADP nodes are attached using 9 different relations: fixed (133; 86% instances), punct (14; 9% instances), cc (2; 1% instances), acl (1; 1% instances), case (1; 1% instances), conj (1; 1% instances), det (1; 1% instances), flat (1; 1% instances), mark (1; 1% instances)

Children of ADP nodes belong to 12 different parts of speech: ADP (44; 28% instances), NOUN (21; 14% instances), SCONJ (21; 14% instances), ADV (20; 13% instances), PUNCT (14; 9% instances), PRON (13; 8% instances), ADJ (7; 5% instances), VERB (6; 4% instances), DET (4; 3% instances), CCONJ (2; 1% instances), X (2; 1% instances), PART (1; 1% instances)