This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home he/pos issue tracker

DET: determiner

This document is a placeholder for the language-specific documentation for DET.


Treebank Statistics (UD_Hebrew)

There are 1 DET lemmas (6%), 26 DET types (0%) and 17424 DET tokens (11%). Out of 16 observed tags, the rank of DET is: 6 in number of lemmas, 12 in number of types and 4 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: ה, ה_, כל, כמה, הרבה, רוב, שום, מספר, אף, שאר

The 10 most frequent ambiguous lemmas: _ (NOUN 38249, ADP 19884, PUNCT 18302, DET 17424, VERB 15920, ADJ 8032, PROPN 7971, PRON 7381, ADV 6108, CONJ 5656, SCONJ 5168, PART 4440, NUM 3309, AUX 843, X 165, INTJ 3)

The 10 most frequent ambiguous types: ה (DET 13596, SCONJ 745, X 21), ה_ (DET 2935, X 8), הרבה (DET 35, ADV 22, VERB 3, X 1), רוב (DET 34, NOUN 10), שום (DET 33, NOUN 4, PROPN 1), מספר (DET 31, NOUN 30, VERB 6), אף (CONJ 99, DET 20, ADV 14, NOUN 12), שאר (NOUN 19, DET 16), מרבית (DET 14, NOUN 1), מחצית (NOUN 22, DET 11, X 1)

Morphology

The form / lemma ratio of DET is 26.000000 (the average of all parts of speech is 1226.125000).

The 1st highest number of forms (26) was observed with the lemma “”: איזה, איזו, אילו, אף, די, ה, ה, הכל, המון, הרבה, יתר, כל, כלל, כמה, מחצית, מירב, מספיק, מספר, מעט, מרב, מרבה, מרבית, קצת, רוב, שאר, שום.

DET occurs with 4 features: he-feat/PronType (16531; 95% instances), he-feat/Definite (893; 5% instances), he-feat/Gender (18; 0% instances), he-feat/HebSource (9; 0% instances)

DET occurs with 5 feature-value pairs: Definite=Red, Gender=Masc, HebSource=ConvUncertainHead, HebSource=ConvUncertainLabel, PronType=Art

DET occurs with 5 feature combinations. The most frequent feature combination is PronType=Art (16528 tokens). Examples: ה, ה_

Relations

DET nodes are attached to their parents using 21 different relations: he-dep/det:def (16346; 94% instances), he-dep/det (766; 4% instances), he-dep/dep (144; 1% instances), he-dep/advmod (39; 0% instances), he-dep/mwe (36; 0% instances), he-dep/mark (31; 0% instances), he-dep/advcl (9; 0% instances), he-dep/nmod (9; 0% instances), he-dep/nmod:smixut (8; 0% instances), he-dep/nsubj (7; 0% instances), he-dep/dobj (6; 0% instances), he-dep/root (4; 0% instances), he-dep/amod (3; 0% instances), he-dep/aux:q (3; 0% instances), he-dep/nmod:poss (3; 0% instances), he-dep/advmod:phrase (2; 0% instances), he-dep/appos (2; 0% instances), he-dep/conj (2; 0% instances), he-dep/nsubj:cop (2; 0% instances), he-dep/iobj (1; 0% instances), he-dep/parataxis (1; 0% instances)

Parents of DET nodes belong to 12 different parts of speech: NOUN (13173; 76% instances), ADJ (3077; 18% instances), NUM (357; 2% instances), VERB (295; 2% instances), PRON (236; 1% instances), PROPN (180; 1% instances), ADV (58; 0% instances), ADP (19; 0% instances), AUX (17; 0% instances), DET (5; 0% instances), ROOT (4; 0% instances), PUNCT (3; 0% instances)

17345 (100%) DET nodes are leaves.

31 (0%) DET nodes have one child.

30 (0%) DET nodes have two children.

18 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 20.

Children of DET nodes are attached using 17 different relations: he-dep/dep (94; 48% instances), he-dep/mwe (49; 25% instances), he-dep/punct (11; 6% instances), he-dep/case (8; 4% instances), he-dep/name (8; 4% instances), he-dep/nmod (5; 3% instances), he-dep/advmod (4; 2% instances), he-dep/case:gen (3; 2% instances), he-dep/det:def (3; 2% instances), he-dep/acl:relcl (2; 1% instances), he-dep/case:acc (2; 1% instances), he-dep/amod (1; 1% instances), he-dep/appos (1; 1% instances), he-dep/cc (1; 1% instances), he-dep/conj (1; 1% instances), he-dep/nmod:smixut (1; 1% instances), he-dep/nsubj (1; 1% instances)

Children of DET nodes belong to 14 different parts of speech: ADV (51; 26% instances), PUNCT (45; 23% instances), PROPN (21; 11% instances), NOUN (14; 7% instances), NUM (14; 7% instances), VERB (14; 7% instances), ADP (12; 6% instances), ADJ (5; 3% instances), DET (5; 3% instances), PART (5; 3% instances), PRON (3; 2% instances), SCONJ (3; 2% instances), AUX (2; 1% instances), CONJ (1; 1% instances)


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]