Skip to content

eroux/tibetan-verbs-database

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tibetan Verbs Database (TVD)

This repository aims at being a database of Tibetan verbs in their four forms (past, present, future, imperative).

To access the database directly, click on db.csv.

News

The verbiator by Nathan Hill is now included! Nathan Hill allowed the publication of his source file under CC0 license. We are currently working on its integration

Sources

Its sources (tagged in each verb) are TDC,GT, PH and KN, see bibliography for references.

Other sources would be welcome:

  • དག་ཡིག་གསར་བསྒྲིག dictionary

Sources purposedly not included:

  • བོད་སྐད་ཀྱི་བརྡ་སྤྲོད་གསར་བསྒྲིགས་སྨྲ་སྒོའི་ལྡེ་མིག by གླང་རི་བ་ཐུབ་བསྟན་སྦྱིན་པ།, 2010 (mosty covered by this list, differences look like mistakes)

Notes

The auxiliary ཤོག present in PH has not been added in this database.

Some verbs appear in the main section of the TDC but not in the verb list at the end (འཁུས for instance), they are noted as included in the TDC. For some of these, the different tense forms do not appear in the dictionnary, so only the present form is indicated.

Method

The initial idea was to make a list with the correct forms of as many verbs as possible, with the second suffix ད. This is impossible though, as each Tibetan grammarian can have a different point of view on verb forms (which is surprising for a Westerner). So this database contains the raw data from the different sources, untouched: no choice has been made about the correctness of the different forms.

What can I do with it?

There are already many things possible with this database script:

  • get all syllables that may contain a second suffix ད
  • get all syllables that may be verbs
  • research one syllable in the database

If you want to constitute a simple verb list with translation or so, you'll have to merge the lists by hand and choose between the different forms. The script may help you with that in the future.

Format

The database is in CSV, easily readable by any text editor, LibreOffice Calc or Microsoft Excel (requiring a few steps).

Each source is indicated in the 5, 6 or 7th column, by TDC, PH or GT.

The order is present, past, future, imperative.

The database is separated in multiple files, one by letter. They are in alphabetical order.

Second suffix ད is indicated between tibetan parenthesis: ༼ད༽. Alternate ortographies are also indicated between tibetan parenthesis.

Contributing

There are dozens of Tibetan books and dictionaries about verbs, contributions are very welcome, especially on providing sources, scanned or in Unicode.

What about copyright?

Nathan Hill explicitely allowed distribution of his data under the CC0 license.

For other files, there is (to the extend of the author's knowledge) no copyright on languages, so no copyright on grammar. According to their authors, these lists are part of the Tibetan grammar (each grammarian thinks his verb list is the real Tibetan verb list), so there is no copyright on the verb list.

There is a copyright on the source books edition and also on the definition of the verbs given in these sources, but these do no appear in this database.

So this database has no copyright and can be considered public domain. If you really need a copyright, you can consider it is under the CC0 license.

About

simple CSV database if Tibetan verbs

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published