## Treebank Statistics: UD_Arabic-NYUAD: Features: `Definite`

This feature is universal.
It occurs with 3 different values: `Com`

, `Def`

, `Ind`

.

417286 tokens (56%) have a non-empty value of `Definite`

.
1 types (0) occur at least once with a non-empty value of `Definite`

.
4544 lemmas (90%) occur at least once with a non-empty value of `Definite`

.
The feature is used with 16 part-of-speech tags: `NOUN` (216797; 29% instances), `ADJ` (67059; 9% instances), `PROPN` (54088; 7% instances), `PRON` (31010; 4% instances), `ADV` (24343; 3% instances), `SCONJ` (11386; 2% instances), `DET` (6031; 1% instances), `NUM` (3442; 0% instances), `ADP` (843; 0% instances), `PUNCT` (671; 0% instances), `CCONJ` (502; 0% instances), `AUX` (434; 0% instances), `VERB` (341; 0% instances), `X` (279; 0% instances), `PART` (57; 0% instances), `INTJ` (3; 0% instances).

`NOUN`

216797 `NOUN` tokens (99% of all `NOUN`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `NOUN`

and `Definite`

co-occurred: `Number``=Sing` (192645; 89%), `Gender``=Masc` (150658; 69%), `Case``=Gen` (142071; 66%).

`NOUN`

tokens may have the following values of `Definite`

:

`Com`

(82924; 38% of non-empty`Definite`

): _`Def`

(95738; 44% of non-empty`Definite`

): _`Ind`

(38135; 18% of non-empty`Definite`

): _`EMPTY`

(1457): _

`Definite`

seems to be **lexical feature** of `NOUN`

. 92% lemmas (35) occur only with one value of `Definite`

.

`ADJ`

67059 `ADJ` tokens (99% of all `ADJ`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `ADJ`

and `Definite`

co-occurred: `Number``=Sing` (64133; 96%), `Case``=Gen` (40502; 60%), `Gender``=Masc` (35315; 53%).

`ADJ`

tokens may have the following values of `Definite`

:

`Com`

(2404; 4% of non-empty`Definite`

): _`Def`

(45522; 68% of non-empty`Definite`

): _`Ind`

(19133; 29% of non-empty`Definite`

): _`EMPTY`

(545): _

`PROPN`

54088 `PROPN` tokens (93% of all `PROPN`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `PROPN`

and `Definite`

co-occurred: `Number``=Sing` (53621; 99%), `Gender``=Masc` (51067; 94%), `Case``=EMPTY` (42593; 79%).

`PROPN`

tokens may have the following values of `Definite`

:

`Com`

(3393; 6% of non-empty`Definite`

): _`Def`

(9981; 18% of non-empty`Definite`

): _`Ind`

(40714; 75% of non-empty`Definite`

): _`EMPTY`

(4237): _

`Definite`

seems to be **lexical feature** of `PROPN`

. 99% lemmas (4450) occur only with one value of `Definite`

.

`PRON`

31010 `PRON` tokens (99% of all `PRON`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `PRON`

and `Definite`

co-occurred: `PronType``=Prs` (30458; 98%), `Person``=3` (27571; 89%), `Number``=Sing` (25396; 82%), `Gender``=Masc` (20041; 65%), `Case``=Gen` (16343; 53%).

`PRON`

tokens may have the following values of `Definite`

:

`Com`

(127; 0% of non-empty`Definite`

): _`Def`

(28709; 93% of non-empty`Definite`

): _`Ind`

(2174; 7% of non-empty`Definite`

): _`EMPTY`

(229): _

`ADV`

24343 `ADV` tokens (92% of all `ADV`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `ADV`

and `Definite`

co-occurred: `Number``=Sing` (24181; 99%), `Gender``=Masc` (23459; 96%), `Case``=Acc` (18316; 75%).

`ADV`

tokens may have the following values of `Definite`

:

`Com`

(15629; 64% of non-empty`Definite`

): _`Def`

(31; 0% of non-empty`Definite`

): _`Ind`

(8683; 36% of non-empty`Definite`

): _`EMPTY`

(2184): _

`SCONJ`

11386 `SCONJ` tokens (44% of all `SCONJ`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `SCONJ`

and `Definite`

co-occurred: `Number``=Sing` (10477; 92%), `Gender``=Masc` (6753; 59%).

`SCONJ`

tokens may have the following values of `Definite`

:

`Com`

(35; 0% of non-empty`Definite`

): _`Def`

(964; 8% of non-empty`Definite`

): _`Ind`

(10387; 91% of non-empty`Definite`

): _`EMPTY`

(14648): _

`Definite`

seems to be **lexical feature** of `SCONJ`

. 92% lemmas (12) occur only with one value of `Definite`

.

`DET`

6031 `DET` tokens (95% of all `DET`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `DET`

and `Definite`

co-occurred: `Number``=Sing` (5845; 97%), `Gender``=Masc` (3801; 63%).

`DET`

tokens may have the following values of `Definite`

:

`Com`

(10; 0% of non-empty`Definite`

): _`Def`

(16; 0% of non-empty`Definite`

): _`Ind`

(6005; 100% of non-empty`Definite`

): _`EMPTY`

(331): _

`NUM`

3442 `NUM` tokens (23% of all `NUM`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `NUM`

and `Definite`

co-occurred: `NumForm``=Word` (3330; 97%), `Number``=Sing` (3046; 88%), `Gender``=Masc` (2094; 61%), `Case``=Gen` (2039; 59%).

`NUM`

tokens may have the following values of `Definite`

:

`Com`

(2317; 67% of non-empty`Definite`

): _`Def`

(359; 10% of non-empty`Definite`

): _`Ind`

(766; 22% of non-empty`Definite`

): _`EMPTY`

(11705): _

`ADP`

843 `ADP` tokens (1% of all `ADP`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `ADP`

and `Definite`

co-occurred: `AdpType``=Prep` (843; 100%).

`ADP`

tokens may have the following values of `Definite`

:

`Com`

(187; 22% of non-empty`Definite`

): _`Def`

(402; 48% of non-empty`Definite`

): _`Ind`

(254; 30% of non-empty`Definite`

): _`EMPTY`

(90851): _

`Definite`

seems to be **lexical feature** of `ADP`

. 93% lemmas (27) occur only with one value of `Definite`

.

`PUNCT`

671 `PUNCT` tokens (1% of all `PUNCT`

tokens) have a non-empty value of `Definite`

.

`PUNCT`

tokens may have the following values of `Definite`

:

`Com`

(137; 20% of non-empty`Definite`

): _`Def`

(295; 44% of non-empty`Definite`

): _`Ind`

(239; 36% of non-empty`Definite`

): _`EMPTY`

(74477): _

`CCONJ`

502 `CCONJ` tokens (1% of all `CCONJ`

tokens) have a non-empty value of `Definite`

.

`CCONJ`

tokens may have the following values of `Definite`

:

`Com`

(92; 18% of non-empty`Definite`

): _`Def`

(257; 51% of non-empty`Definite`

): _`Ind`

(153; 30% of non-empty`Definite`

): _`EMPTY`

(49530): _

`Definite`

seems to be **lexical feature** of `CCONJ`

. 93% lemmas (25) occur only with one value of `Definite`

.

`AUX`

434 `AUX` tokens (6% of all `AUX`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `AUX`

and `Definite`

co-occurred: `Mood``=EMPTY` (434; 100%), `Voice``=EMPTY` (434; 100%), `Gender``=Masc` (349; 80%), `Number``=Sing` (306; 71%).

`AUX`

tokens may have the following values of `Definite`

:

`Com`

(62; 14% of non-empty`Definite`

): _`Def`

(356; 82% of non-empty`Definite`

): _`Ind`

(16; 4% of non-empty`Definite`

): _`EMPTY`

(7289): _

`Definite`

seems to be **lexical feature** of `AUX`

. 91% lemmas (10) occur only with one value of `Definite`

.

`VERB`

341 `VERB` tokens (1% of all `VERB`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `VERB`

and `Definite`

co-occurred: `Aspect``=EMPTY` (341; 100%), `Mood``=EMPTY` (341; 100%), `Voice``=EMPTY` (341; 100%), `Number``=Sing` (312; 91%), `Person``=EMPTY` (312; 91%), `Gender``=Masc` (258; 76%).

`VERB`

tokens may have the following values of `Definite`

:

`Com`

(82; 24% of non-empty`Definite`

): _`Def`

(110; 32% of non-empty`Definite`

): _`Ind`

(149; 44% of non-empty`Definite`

): _`EMPTY`

(54874): _

`X`

279 `X` tokens (30% of all `X`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `X`

and `Definite`

co-occurred: `Mood``=EMPTY` (279; 100%), `Voice``=EMPTY` (279; 100%), `Person``=EMPTY` (278; 100%), `Number``=Sing` (247; 89%), `Gender``=Masc` (214; 77%).

`X`

tokens may have the following values of `Definite`

:

`Com`

(19; 7% of non-empty`Definite`

): _`Def`

(64; 23% of non-empty`Definite`

): _`Ind`

(196; 70% of non-empty`Definite`

): _`EMPTY`

(638): _

Paradigm None | Ind | Def | Com |
---|---|---|---|

Case=Acc|Gender=Masc|Number=Sing | _ | _ | _ |

Case=Acc|Gender=Masc|Number=Dual | _ | _ | _ |

Case=Acc|Gender=Masc|Number=Plur | _ | ||

Case=Acc|Gender=Fem|Number=Dual | _ | ||

Case=Gen|Gender=Masc|Number=Sing | _ | ||

Case=Nom|Gender=Masc|Number=Dual | _ | ||

Case=Nom|Gender=Masc|Number=Plur | _ | ||

Gender=Masc|Number=Sing | _ | _ | _ |

Gender=Fem|Number=Sing | _ | _ |

`PART`

57 `PART` tokens (1% of all `PART`

tokens) have a non-empty value of `Definite`

.

The most frequent other feature values with which `PART`

and `Definite`

co-occurred: `Polarity``=EMPTY` (57; 100%).

`PART`

tokens may have the following values of `Definite`

:

`Com`

(9; 16% of non-empty`Definite`

): _`Def`

(23; 40% of non-empty`Definite`

): _`Ind`

(25; 44% of non-empty`Definite`

): _`EMPTY`

(8555): _

`INTJ`

3 `INTJ` tokens (5% of all `INTJ`

tokens) have a non-empty value of `Definite`

.

`INTJ`

tokens may have the following values of `Definite`

:

`Ind`

(3; 100% of non-empty`Definite`

): _`EMPTY`

(53): _

## Relations with Agreement in `Definite`

The 10 most frequent relations where parent and child node agree in `Definite`

:
`NOUN –[ amod]–> ADJ` (46371; 84%),

`PROPN –[`(10684; 76%),

`flat:name`]–> PROPN`NOUN –[`(10465; 73%),

`conj`]–> NOUN`PROPN –[`(1799; 71%),

`conj`]–> PROPN`ADJ –[`(1771; 98%),

`conj`]–> ADJ`ADJ –[`(1041; 83%),

`amod`]–> ADJ`PROPN –[`(314; 100%),

`appos`]–> PROPN`ADJ –[`(233; 70%),

`nmod`]–> PRON`ADJ –[`(193; 56%),

`nsubj`]–> PRON`ADJ –[`(190; 71%).

`conj`]–> NOUN