Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing Pinyin in several characters. e.g.: 成绩 #7

Open
jeffreyxuan opened this issue May 25, 2021 · 5 comments
Open

Missing Pinyin in several characters. e.g.: 成绩 #7

jeffreyxuan opened this issue May 25, 2021 · 5 comments

Comments

@jeffreyxuan
Copy link

jeffreyxuan commented May 25, 2021

Thank you for making these wonderful fonts!
素晴らしいフォントを作くりましたて,
どうもありがとうございます, ごくろうさまでした.
私は日本語があまり上手ではありませんですから,
英語を使うのを許してお願いします

It would be nice if we can have more pinyin added to several characters.

- e.g.: Currently there is only "jī" for 成"绩", however it should be "jì" in 普通話

- e.g.: Currently there is no tone 5 "yi" "bu" for 對"不"對, 去"不"去, 跑"一"跑...
  ref: http://www.hakuteisha.co.jp/books/8-033-4naiyou.pdf

- e.g.: Currently these is only "qǐ" for "企"鵝, but we need "qì" in 國語

I did a compare between the mengshen pinyin in "marged-mapping-table.txt", 國語字典"簡編本" and 普通話字典"CEDICT". Please check the results in attached files:

overwrite_20210525.txt :
all missing pinyin, mengshen overwrite.txt format

mengshen_missing_yin_20210525.xlsx:
A table to show detail missing parts for each line in the overwrite_20210525.txt . It shows the pinyin is missing in 普通話 or 國語 or both.

mengshen_missing_yin_20210525.xlsx

overwrite_20210525.txt

Please consider to merge overwrite_20210525.txt into your next release.

And, as you know the project "BPMF IVS" has an IVS tool to help the poyin多音字 correction auto/manually for Zhuyin. I have modified a Pinyin version that compatible with 萌神 fonts. With this tool, users can do auto or manually choose the poyin多音字 with 萌神明朝体, and copy the results to any other software like Word/LibreOffice...

20210525-1

萌神明朝体 汉语拼音IVS字型自动选多音字输入工具
https://toneoz.com/ime

How to use:
日本語:
https://toneoz.com/blog/2021/05/27/ivsjp/
简体中文
https://toneoz.com/blog/2021/05/25/ivs/
繁體中文
https://toneoz.com/blog/2021/05/21/ivsime/

If you are interested, you can try to type in the following examples to https://toneoz.com/ime , and click "普通話校正" to see the reults, and then copy to your Office Word:

成績 成绩
爸爸 媽媽
行程 銀行 樹行子 道行
占卜 了得 切削
正好喜好的生長長度
正好喜好的生长长度

All the missing pinyin will also be detected in https://toneoz.com/ime , and highlighted with grid lines background.

Thanks you again for your consideration :)

@NightFurySL2001
Copy link
Contributor

@jeffreyxuan Thank you a lot for your interest and willingness to expand this project here and with LXGW Wenkai. I just have a question: is there any reliable source that we can obtain the pinyin from (or the override files you provided)?

@jeffreyxuan
Copy link
Author

@NightFurySL2001 您好:
I just shared the pinyin data that I used in toneoz-font-pinyin-wenkai Github:
https://github.com/jeffreyxuan/toneoz-font-pinyin-wenkai/blob/master/sources/outputs/marged-mapping-table.txt

This pinyin data is based on my personal research. You can check the detail from here:
https://toneoz.com/blog/usagewordbook/#pinyindata

For a "reliable source": The "UniHan" project, which referenced by Mengshen, probably is a choice. They have updated a lot after the last version of Mengshen pinyin
https://unicode.org/charts/unihan.html
http://www.unicode.org/Public/UCD/latest/ucd/Unihan.zip

@jeffreyxuan
Copy link
Author

@NightFurySL2001 and here, "reliable source":
https://github.com/mozillazg/pinyin-data

@MaruTama
Copy link
Owner

MaruTama commented Nov 2, 2021

Hi @jeffreyxuan -san!

Sorry I'm late...
Thank you for creating overwrite_20210525.txt!
I will confirm and merge it.

素晴らしいフォントを作くりましたて,
どうもありがとうございます, ごくろうさまでした.
私は日本語があまり上手ではありませんですから,
英語を使うのを許してお願いします

丁寧に日本語でコメントしていただけて嬉しいです😊
こちらこそ、改善点をご指摘してくださりありがとうございます。

@jeffreyxuan
Copy link
Author

jeffreyxuan commented Nov 4, 2021

Hi @MaruTama さん、

Welcome back :) For the "overwrite_20210525.txt" : This file could be out of date. For the latest version please check here:
https://github.com/jeffreyxuan/toneoz-font-pinyin-wenkai/blob/master/sources/outputs/marged-mapping-table.txt

Sorry for the inconvenience, now I only have a program to generate to the final marged-mapping-table.txt , I had another program to generate the "overwrite_*.txt" but I can't find where it is.

The latest version is the one I used to generate the current version 拼音文楷 font, it has fixed may bugs, for detail please check the file history:
jeffreyxuan/toneoz-font-pinyin-wenkai@c133754

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants