Treebank Statistics: UD_Hebrew-HTB: POS Tags: DET
There are 20 DET
lemmas (0%), 26 DET
types (0%) and 17424 DET
tokens (11%).
Out of 15 observed tags, the rank of DET
is: 12 in number of lemmas, 12 in number of types and 4 in number of tokens.
The 10 most frequent DET
lemmas: ה, כול, כמה, הרבה, רוב, _, שום, מספר, אף, שאר
The 10 most frequent DET
types: ה, ה_, כל, כמה, הרבה, רוב, שום, מספר, אף, שאר
The 10 most frequent ambiguous lemmas: ה (DET 16515, SCONJ 745, X 28), כול (DET 524, NOUN 31, ADV 1), הרבה (DET 35, ADV 22, VERB 13), רוב (DET 34, NOUN 15), _ (NOUN 366, AUX 268, VERB 251, ADJ 231, ADV 177, CCONJ 110, X 86, PRON 57, SCONJ 47, DET 33), שום (DET 33, NOUN 4, PROPN 1), מספר (NOUN 38, DET 31), אף (CCONJ 53, ADV 46, DET 20, NOUN 13), שאר (NOUN 20, DET 16), מרבית (DET 14, NOUN 2)
The 10 most frequent ambiguous types: ה (DET 13596, SCONJ 745, X 21), ה_ (DET 2935, X 8), הרבה (DET 35, ADV 22, VERB 3, X 1), רוב (DET 34, NOUN 10), שום (DET 33, NOUN 4, PROPN 1), מספר (DET 31, NOUN 30, VERB 6), אף (ADV 60, CCONJ 53, DET 20, NOUN 12), שאר (NOUN 19, DET 16), מרבית (DET 14, NOUN 1), מחצית (NOUN 22, DET 11, X 1)
- ה
- ה_
- הרבה
- רוב
- שום
- מספר
- אף
- שאר
- מרבית
- מחצית
Morphology
The form / lemma ratio of DET
is 1.300000 (the average of all parts of speech is 1.701287).
The 1st highest number of forms (5) was observed with the lemma “”: אילו, ה, מחצית, מירב, מרבה.
The 2nd highest number of forms (2) was observed with the lemma “איזה”: איזה, איזו.
The 3rd highest number of forms (2) was observed with the lemma “ה”: ה, ה_.
DET
occurs with 3 features: Definite (17424; 100% instances), PronType (16531; 95% instances), Gender (18; 0% instances)
DET
occurs with 4 feature-value pairs: Definite=Cons
, Definite=Def
, Gender=Masc
, PronType=Art
DET
occurs with 3 feature combinations.
The most frequent feature combination is Definite=Def|PronType=Art
(16531 tokens).
Examples: ה, ה_
Relations
DET
nodes are attached to their parents using 18 different relations: det (17112; 98% instances), dep (142; 1% instances), fixed (40; 0% instances), advmod (39; 0% instances), mark (31; 0% instances), obl (10; 0% instances), advcl (9; 0% instances), compound:smixut (8; 0% instances), nsubj (7; 0% instances), obj (6; 0% instances), root (4; 0% instances), amod (3; 0% instances), mark:q (3; 0% instances), nmod:poss (3; 0% instances), appos (2; 0% instances), conj (2; 0% instances), nsubj:cop (2; 0% instances), parataxis (1; 0% instances)
Parents of DET
nodes belong to 11 different parts of speech: NOUN (13173; 76% instances), ADJ (3077; 18% instances), NUM (357; 2% instances), VERB (294; 2% instances), PRON (236; 1% instances), PROPN (183; 1% instances), ADV (58; 0% instances), ADP (19; 0% instances), AUX (18; 0% instances), DET (5; 0% instances), (4; 0% instances)
17294 (99%) DET
nodes are leaves.
92 (1%) DET
nodes have one child.
14 (0%) DET
nodes have two children.
24 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 14.
Children of DET
nodes are attached using 17 different relations: punct (68; 31% instances), dep (59; 27% instances), fixed (40; 18% instances), advmod (15; 7% instances), case (8; 4% instances), flat:name (7; 3% instances), obl (5; 2% instances), case:gen (3; 1% instances), det (3; 1% instances), acl:relcl (2; 1% instances), case:acc (2; 1% instances), cc (2; 1% instances), amod (1; 0% instances), appos (1; 0% instances), compound:smixut (1; 0% instances), conj (1; 0% instances), nsubj (1; 0% instances)
Children of DET
nodes belong to 13 different parts of speech: PUNCT (68; 31% instances), ADV (51; 23% instances), PROPN (21; 10% instances), ADP (17; 8% instances), NOUN (14; 6% instances), NUM (14; 6% instances), VERB (14; 6% instances), ADJ (5; 2% instances), DET (5; 2% instances), PRON (3; 1% instances), SCONJ (3; 1% instances), AUX (2; 1% instances), CCONJ (2; 1% instances)