Treebank Statistics: UD_Pomak-Philotis: POS Tags: VERB
There are 1156 VERB
lemmas (34%), 2708 VERB
types (41%) and 5860 VERB
tokens (17%).
Out of 16 observed tags, the rank of VERB
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent VERB
lemmas: réčem, ídom, vídem, zǿmom, ímom, víkom, dam, íštom, móžom, stánom
The 10 most frequent VERB
types: víka, reklól, trǽbava, móža, íma, hódi, zøl, právi, reklála, dam
The 10 most frequent ambiguous lemmas: glǿdom (VERB 32, NOUN 1), pójem (VERB 15, ADJ 1, NOUN 1), bǽgom (VERB 6, NOUN 1), móknom (VERB 4, NOUN 1), dø (VERB 2, INTJ 1), marí (VERB 2, PART 1), ja (PRON 1966, PART 7, CCONJ 4, ADV 2, ADJ 1, ADP 1, PUNCT 1, VERB 1), na (ADP 513, PART 181, INTJ 1, PROPN 1, VERB 1), ódbranem (NOUN 1, VERB 1)
The 10 most frequent ambiguous types: dadé (VERB 17, SCONJ 1), néma (PART 31, VERB 14, ADV 1), dalí (VERB 5, CCONJ 3, PART 1), umé (VERB 3, SCONJ 1), mókne (VERB 3, ADJ 1, NOUN 1), velí (VERB 2, DET 1), bého (AUX 2, VERB 2), kopál (NOUN 1, VERB 1), marí (VERB 2, PART 1), ná (INTJ 1, PROPN 1, VERB 1)
- dadé
- néma
- PART 31: « Namój da ta je strah , néma da tí gi bárot » , reklála je zmijéna .
- VERB 14: Lǽka za itézi balíky néma faf tógavokne čarlýka .
- ADV 1: Za drúgyse godíny , “ voítia sto spíti “ še ímot vrit belidjése i jálnys faf Sélero néma da ímot óti so hem kinígyne ne stórili kákna trǽbavašo , hem némot nagadény póteve za žǽhne insána na móžot da varvǿt i abihódet sas árabo hem so ne stórili níta annó énstasi atkák mí kázaho ta tíje néma da ímot inakvóne prógrama .
- dalí
- umé
- VERB 3: Sǽ kotrí še sí íma nahtáre za tógavokne sí dulápe i ní kotrí drug néma da umé da mu vídi mehtǘpevene .
- SCONJ 1: pó mlógo íštot da só zberé bop , merǧumék , lahút , makaróņe , arpaǧík , jetá za játse míčky déti , prǽsno ad žóno umé da sedí i da só na adbáve za mlógo déne , lǽkove , sambuán , sapúne , kinígy za da só súčet i kanána drúgo trǽbava za da so císti i ne gládni inézi insán .
- mókne
- velí
- bého
- kopál
- marí
- ná
Morphology
The form / lemma ratio of VERB
is 2.342561 (the average of all parts of speech is 1.931467).
The 1st highest number of forms (35) was observed with the lemma “ídom”: Atišló, Atídah, Otíde, Utišlá, Utišlála, Utišlí, atišlá, atišlála, atišlí, atišlíli, atišlól, atišlólo, atišlýly, atišól, attída, attídah, atídaho, atíde, atídoho, idǽšo, otišlíli, otišlólo, otišól, utišól, Ídijte, ídat, íde, ídeme, ídete, ídeš, ídešo, ídij, ídiš, ídom, ídot.
The 2nd highest number of forms (26) was observed with the lemma “zǿmom”: Zǿho, zjo, zjóme, zjómi, zjómo, zélo, zémeme, zémeš, zémi, zémo, zø, zøl, zǿhme, zǿla, zǿli, zǿlo, zǿly, zǿme, zǿmeme, zǿmeš, zǿmij, zǿmijte, zǿmom, zǿmot, zǿte, zǿtokne.
The 3rd highest number of forms (24) was observed with the lemma “nájdom”: našlá, našlála, našlí, našlíli, našló, našlól, našlólo, našlý, našlýly, našól, ná, nájda, nájdah, nájdahme, nájdaho, nájdat, nájde, nájdeme, nájdete, nájdeš, nájdiš, nájdom, nájdot, náje.
VERB
occurs with 14 features: Aspect (5858; 100% instances), VerbForm (5855; 100% instances), Voice (5855; 100% instances), Number (5841; 100% instances), Tense (5423; 93% instances), Person (3495; 60% instances), Mood (3492; 60% instances), Gender (2346; 40% instances), Animacy (366; 6% instances), Case (192; 3% instances), Definite (192; 3% instances), Polarity (35; 1% instances), Deixis (28; 0% instances), DeixisRef (5; 0% instances)
VERB
occurs with 33 feature-value pairs: Animacy=Hum
, Animacy=Nhum
, Aspect=Imp
, Aspect=Iter
, Aspect=Perf
, Aspect=Prog
, Case=Acc
, Case=Gen
, Case=Nom
, Definite=Def
, Definite=Ind
, Deixis=Prox
, Deixis=Remt
, DeixisRef=2
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Imp
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Tense=Past
, Tense=Pres
, VerbForm=Conv
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, Voice=Act
, Voice=Pass
VERB
occurs with 104 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act
(981 tokens).
Examples: víka, trǽbava, móža, íma, hódi, právi, stánava, íšte, jedé, varví
Relations
VERB
nodes are attached to their parents using 21 different relations: root (2008; 34% instances), conj (1684; 29% instances), advcl (701; 12% instances), ccomp (654; 11% instances), xcomp (407; 7% instances), csubj (108; 2% instances), amod (65; 1% instances), acl:relcl (59; 1% instances), acl (53; 1% instances), parataxis (44; 1% instances), discourse (15; 0% instances), obj (12; 0% instances), obl (12; 0% instances), nsubj (10; 0% instances), dep (7; 0% instances), iobj (7; 0% instances), appos (6; 0% instances), csubj:pass (4; 0% instances), fixed (2; 0% instances), nmod (1; 0% instances), vocative (1; 0% instances)
Parents of VERB
nodes belong to 13 different parts of speech: VERB (3557; 61% instances), (2008; 34% instances), NOUN (173; 3% instances), PART (36; 1% instances), ADJ (31; 1% instances), ADV (18; 0% instances), AUX (9; 0% instances), PRON (7; 0% instances), PROPN (7; 0% instances), DET (6; 0% instances), NUM (5; 0% instances), INTJ (2; 0% instances), X (1; 0% instances)
111 (2%) VERB
nodes are leaves.
247 (4%) VERB
nodes have one child.
850 (15%) VERB
nodes have two children.
4652 (79%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 16.
Children of VERB
nodes are attached using 40 different relations: punct (4266; 18% instances), aux (3598; 15% instances), obj (2488; 11% instances), nsubj (1738; 7% instances), conj (1646; 7% instances), advmod (1483; 6% instances), cc (1345; 6% instances), obl (1055; 5% instances), expl (919; 4% instances), iobj (775; 3% instances), advcl (709; 3% instances), ccomp (693; 3% instances), mark (679; 3% instances), xcomp (465; 2% instances), obl:lmod (313; 1% instances), dep (160; 1% instances), obl:tmod (124; 1% instances), discourse (106; 0% instances), det (103; 0% instances), vocative (98; 0% instances), csubj (90; 0% instances), parataxis (77; 0% instances), obl:arg (63; 0% instances), acl:relcl (62; 0% instances), nmod:tmod (51; 0% instances), case (41; 0% instances), advmod:emph (40; 0% instances), nummod (35; 0% instances), aux:q (30; 0% instances), orphan (25; 0% instances), nsubj:pass (20; 0% instances), aux:pass (12; 0% instances), amod (11; 0% instances), dislocated (10; 0% instances), csubj:pass (6; 0% instances), nmod (6; 0% instances), obl:agent (5; 0% instances), acl (3; 0% instances), expl:impers (3; 0% instances), expl:pv (2; 0% instances)
Children of VERB
nodes belong to 16 different parts of speech: PUNCT (4266; 18% instances), NOUN (4171; 18% instances), AUX (3657; 16% instances), VERB (3557; 15% instances), PRON (2971; 13% instances), CCONJ (1289; 6% instances), ADV (1134; 5% instances), SCONJ (575; 2% instances), PART (559; 2% instances), DET (375; 2% instances), PROPN (312; 1% instances), ADJ (248; 1% instances), ADP (123; 1% instances), NUM (72; 0% instances), INTJ (39; 0% instances), X (7; 0% instances)