Treebank Statistics: UD_Pomak-Philotis: POS Tags: ADV
There are 249 ADV
lemmas (6%), 378 ADV
types (3%) and 4336 ADV
tokens (5%).
Out of 16 observed tags, the rank of ADV
is: 4 in number of lemmas, 5 in number of types and 8 in number of tokens.
The 10 most frequent ADV
lemmas: pak, po, itám, naj, azám, le, kadé, sétne, kak, játse
The 10 most frequent ADV
types: pák, pó, naj, azám, le, kadé, kak, sétne, itám, játse
The 10 most frequent ambiguous lemmas: pak (SCONJ 227, ADV 221), po (ADV 186, ADP 5, INTJ 2), málko (ADV 59, ADJ 19), napréš (ADV 54, ADP 4), mlógo (ADJ 104, ADV 38), mífko (ADV 30, ADJ 17), sæ (PART 38, ADV 25), has (ADV 21, ADJ 6), véčer (NOUN 45, ADV 15), báre (ADV 10, SCONJ 1)
The 10 most frequent ambiguous types: málko (ADV 53, ADJ 10), napréš (ADV 54, ADP 3), mlógo (ADJ 82, ADV 31), mífko (ADV 17, ADJ 10), has (ADV 18, ADJ 6), véčer (ADV 13, NOUN 1), báre (ADV 10, SCONJ 1), dalí (ADV 8, PART 5, CCONJ 2), gajét (ADV 6, ADJ 1), ajnéj (ADV 5, PRON 1)
- málko
- napréš
- mlógo
- mífko
- has
- véčer
- báre
- dalí
- gajét
- ajnéj
Morphology
The form / lemma ratio of ADV
is 1.518072 (the average of all parts of speech is 2.731846).
The 1st highest number of forms (11) was observed with the lemma “itagáne”: itagáne, itagýne, tagáni, tagás, tagís, tagýne, togás, togáva, tugána, tugás, tugáva.
The 2nd highest number of forms (10) was observed with the lemma “inýj”: Ajní, Enékana, Ináj, ajnáj, ajnéj, ajníjkana, ajníkana, enýj, inéj, inýj.
The 3rd highest number of forms (10) was observed with the lemma “itúzi”: Ajtú, ajtóvana, ajtúj, ajtús, ajtúva, etús, itúj, itúzi, tus, túzi.
ADV
occurs with 4 features: PronType (1410; 33% instances), Deixis (1096; 25% instances), DeixisRef (430; 10% instances), Degree (348; 8% instances)
ADV
occurs with 12 feature-value pairs: Degree=Cmp
, Degree=Dim
, Degree=Sup
, Deixis=Prox
, Deixis=Remt
, DeixisRef=1
, DeixisRef=2
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Rel
, PronType=Tot
ADV
occurs with 16 feature combinations.
The most frequent feature combination is _
(2529 tokens).
Examples: pák, azám, le, sétne, játse, jéšte, jálnys, húbbe, málko, napréš
Relations
ADV
nodes are attached to their parents using 2 different relations: advmod (4264; 98% instances), root (72; 2% instances)
Parents of ADV
nodes belong to 14 different parts of speech: VERB (3032; 70% instances), NOUN (485; 11% instances), ADV (373; 9% instances), ADJ (234; 5% instances), (72; 2% instances), PRON (34; 1% instances), DET (32; 1% instances), PART (28; 1% instances), PROPN (25; 1% instances), NUM (13; 0% instances), INTJ (5; 0% instances), AUX (1; 0% instances), CCONJ (1; 0% instances), X (1; 0% instances)
3575 (82%) ADV
nodes are leaves.
509 (12%) ADV
nodes have one child.
119 (3%) ADV
nodes have two children.
133 (3%) ADV
nodes have three or more children.
The highest child degree of a ADV
node is 8.
Children of ADV
nodes are attached using 23 different relations: advmod (440; 35% instances), punct (259; 20% instances), case (176; 14% instances), aux (110; 9% instances), nsubj (62; 5% instances), obl (39; 3% instances), cc (32; 3% instances), conj (30; 2% instances), dep (27; 2% instances), det (24; 2% instances), discourse (14; 1% instances), mark (12; 1% instances), advcl (10; 1% instances), csubj (5; 0% instances), nmod (5; 0% instances), obj (5; 0% instances), vocative (5; 0% instances), iobj (4; 0% instances), ccomp (3; 0% instances), expl (3; 0% instances), nummod (2; 0% instances), acl (1; 0% instances), amod (1; 0% instances)
Children of ADV
nodes belong to 15 different parts of speech: ADV (373; 29% instances), PUNCT (259; 20% instances), ADP (176; 14% instances), AUX (110; 9% instances), NOUN (110; 9% instances), PART (80; 6% instances), VERB (43; 3% instances), PRON (42; 3% instances), CCONJ (32; 3% instances), DET (19; 1% instances), SCONJ (12; 1% instances), PROPN (6; 0% instances), ADJ (4; 0% instances), NUM (2; 0% instances), INTJ (1; 0% instances)