Skip to content

lggruspe/ipa-tokenizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ipa-tokenizer

IPA transcription tokenizer

Usage example

from ipa_tokenizer.tokenizer import tokenize

tokens = tokenize("ˈtoʊ.kən.aɪz", language="en")
print(tokens)
# ['t', 'oʊ', 'k', 'ə', 'n', 'aɪ', 'z']

License

Copyright 2023 Levi Gruspe

GPLv3 or later.

Attributions

This repository contains some data files that are derived from works that are licensed under CC BY-SA 3.0 licenses. The copyright of the original works belong to their authors. PHOIBLE 2.0 is by Steven Moran and Daniel McCloy. Wiktionary is by its editors and contributors.

Derivative works:

These derivative works are made available under a CC BY-SA 3.0 license.