home edit page issue tracker

This page pertains to UD version 2.

UD_Tagalog-TRG

UD_Tagalog-Ugnayan

Tokenization and Word Segmentation

Tokenization and Word Segmentation

  • This corpus contains 55 sentences and 292 tokens.
  • This corpus contains 94 sentences, 1011 tokens and 1097 syntactic words.
  • This corpus contains 59 tokens (20%) that are not followed by a space.
  • This corpus contains 154 tokens (15%) that are not followed by a space.
  • This corpus does not contain words with spaces.
  • This corpus does not contain words with spaces.
  • This corpus does not contain words that contain both letters and punctuation.
  • This corpus contains 9 types of words that contain both letters and punctuation. Examples: -ng, 't, 'y, Isa-isa, Kapaki-pakinabang, Paulit-ulit, nag-aalaga, nag-isip, nag-party
  • This corpus contains 86 multi-word tokens. On average, one multi-word token consists of 2.00 syntactic words.
  • There are 49 types of multi-word tokens. Examples: maraming, isang, niyang, ilang, ibang, kakaibang, koryenteng, akong, aleng, dalang, kaniyang, lawang, pang, pinakamahabang, siyang, taong, Anong, Hilagang, Mukhang, Nagmukhang, Puting, Sandaling, bansang, bolang, daang, ding, estudyanteng, inyong, itong, kabutihang, kaming, kasamang, kilalang, limang, mahabang, malaking, malalaking, mapigilang, mong, natitirang, ngayong, ninyong, nitong, pangunahing, payasong, pinakamalaking, probinsyang, sinabing, turistang.

Morphology

Tags

Morphology

Tags

  • This corpus contains 2 word types tagged as particles (PART): Hindi, ng
  • This corpus contains 10 word types tagged as particles (PART): 'y, -ng, ay, ba, e, hindi, na, nang, ng, po
  • This corpus contains 3 lemmas tagged as pronouns (PRON): ang, ko, ng
  • This corpus contains 13 lemmas tagged as pronouns (PRON): akin, ako, ano, ikaw, inyo, ito, kami, kayo, ko, lahat, sila, siya, tayo
  • This corpus contains 1 lemmas tagged as determiners (DET): ang
  • This corpus contains 4 lemmas tagged as determiners (DET): bawat, ilan, marami, mga
  • Out of the above, 1 lemmas occurred sometimes as PRON and sometimes as DET: ang
  • This corpus contains 1 lemmas tagged as auxiliaries (AUX): may
  • This corpus contains 0 lemmas tagged as auxiliaries (AUX):
  • This corpus does not use the VerbForm feature.
  • Inf
    • VERB: Kailangan

Nominal Features

Nominal Features

  • Fem
    • ADJ: Komika
    • NOUN: Biyuda, maestra
    • PROPN: Linda
  • Masc
    • NOUN: Biyudo, maestro
  • Coll
    • NOUN: pagkain, alon, gulayan, ingay
  • Dual
    • PRON: kita
  • Plur
    • NOUN: araw
  • Sing
    • DET: si
    • NOUN: nanay, titser, babae, bahay, bangka, bata, Artista, Isda, istudyante, diyaryo
    • PRON: ko, ako, Iyan, Siya, ito
    • PROPN: Juan, Linda
  • Abs
    • ADP: ang
    • DET: ang, si
    • PRON: ako, Siya, kita
  • Dat
    • ADP: sa
  • Erg
    • ADP: ng
    • PART: ng
    • PRON: niya, ko
  • Loc
    • ADP: Nasa
  • Def
    • DET: ang
  • Ind
    • DET: ang

Degree and Polarity

Degree and Polarity

  • Pos
    • ADJ: Bago, Mabuti, Maganda, Matamis, Pagod
  • Sup
    • ADJ: bago
  • Neg
    • PART: Hindi

Verbal Features

Verbal Features

  • Hab
    • VERB-Inf: Kailangan
  • Imp
    • VERB: nagluluto, Binabasa, Bumabasa, Dumarating, Humihinga, Naghihilik, Nauuhaw, Tumatanda
  • Perf
    • VERB: nagluto, dumating, yumaman, Binalikan, Binigyan, Binili, Bumili, Ginising, Gumising, Ibinigay
  • Prosp
    • VERB: Magluluto
  • Ind
    • VERB: nagluluto, nagluto, dumating, yumaman, Binabasa, Binalikan, Binigyan, Binili, Bumabasa, Bumili
    • VERB-Inf: Kailangan

Pronouns, Determiners, Quantifiers

Pronouns, Determiners, Quantifiers

  • Dem
    • PRON: Iyan, ito
  • Prs
    • PRON: Siya
  • Yes
    • ADP: Sa
    • PRON: niya
  • 1
    • PRON: ko, ako
  • 3
    • PRON: Siya

Other Features

Other Features

Syntax

Auxiliary Verbs and Copula

  • This corpus uses 1 lemmas as copulas (cop). Examples: may.

Syntax

Auxiliary Verbs and Copula

  • This corpus does not contain copulas.
  • This corpus does not contain auxiliaries.
  • This corpus does not contain auxiliaries.

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).
  • nsubj
    • VERB--NOUN (27)
    • VERB--NOUN-ADP(ang) (1)
    • VERB--PRON-Abs (3)
    • VERB-Inf--PRON-Erg (1)

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).
  • nsubj
    • VERB--NOUN (1)
    • VERB--NOUN-ADP(ang) (24)
    • VERB--PRON (25)
  • obj
    • VERB--NOUN-ADP(ng) (16)
    • VERB--NOUN-ADP(ng)-ADP(ng) (1)
    • VERB--NOUN-ADP(sa) (1)
    • VERB--PRON-Erg (4)
    • VERB-Inf--NOUN (1)
  • obj
    • VERB--NOUN (13)
    • VERB--NOUN-ADP(ng) (17)
    • VERB--PRON (19)
  • iobj
    • VERB--NOUN (1)
    • VERB--NOUN-ADP(sa) (1)

Relations Overview

Relations Overview