follow opencc conversion chain #688

eagleoflqj · 2023-08-10T04:51:00Z

Pull request

Issue tracker

Fixes will automatically close the related issue

Fixes #652

Feature

Describe feature of pull request

Unit test

Done

Manual test

Done

EDIT: the screenshot actually shows a clear bug 🤦‍♂️. See below for a corrected one.

Code Review

Unit and manual test pass
GitHub Action CI pass
At least one contributor reviews and votes
Can be merged clean without conflicts
PR will be merged by rebase upstream base

Additional Info

src/rime/gear/simplifier.cc

eagleoflqj · 2023-08-12T00:42:23Z

src/rime/gear/simplifier.cc

groverlynn · 2023-09-04T18:52:53Z

This is still wrong.
E.g. 才能 (zh-Hans) should be converted to 才能 and 纔能 (zh-Hant), but the two should agains be merged to 才能 (zh-TW / zh-HK). However, this algorithm can filter out 纔 but not 纔能 in zh-TW / zh-HK.

Basically, it only works with mono-character and fails when there are terms and phrases in the text

eagleoflqj · 2023-09-04T19:36:03Z

The restriction from OpenCC determines it’s not easy to be perfect if possible. Current status is more acceptable than previous as at least correct result is available to users. Treat extraneous words just as irrelevant ones.

groverlynn · 2023-09-05T08:44:25Z

it has nothing to do with OpenCC. On the contrary, OpenCC gives correct results in such cases. The problem is the implementation of conversion chain in rime

* follow opencc conversion chain * when a dict doesn't contain a word, pass as-is * de-duplication

follow opencc conversion chain

2836363

eagleoflqj marked this pull request as ready for review August 11, 2023 01:38

amorphobia reviewed Aug 11, 2023

View reviewed changes

src/rime/gear/simplifier.cc Outdated Show resolved Hide resolved

when a dict doesn't contain a word, pass as-is

bbf99f3

eagleoflqj requested review from amorphobia and lotem and removed request for amorphobia August 12, 2023 00:56

amorphobia reviewed Aug 12, 2023

View reviewed changes

src/rime/gear/simplifier.cc Outdated Show resolved Hide resolved

amorphobia approved these changes Aug 12, 2023

View reviewed changes

de-duplication

58f6c0c

lotem merged commit 75e6b1a into rime:master Aug 12, 2023
5 checks passed

eagleoflqj deleted the conversion-chain branch August 12, 2023 15:51

groverlynn mentioned this pull request Sep 18, 2023

chain conversion #715

Merged

2 tasks

groverlynn pushed a commit to groverlynn/librime that referenced this pull request Sep 27, 2023

follow opencc conversion chain (rime#688)

852023c

* follow opencc conversion chain * when a dict doesn't contain a word, pass as-is * de-duplication

graphemecluster pushed a commit to TypeDuck-HK/librime that referenced this pull request Nov 2, 2023

follow opencc conversion chain (rime#688)

68dd642

* follow opencc conversion chain * when a dict doesn't contain a word, pass as-is * de-duplication

graphemecluster pushed a commit to TypeDuck-HK/librime that referenced this pull request Nov 8, 2023

follow opencc conversion chain (rime#688)

5b4a758

* follow opencc conversion chain * when a dict doesn't contain a word, pass as-is * de-duplication

graphemecluster pushed a commit to TypeDuck-HK/librime that referenced this pull request Mar 18, 2024

follow opencc conversion chain (rime#688)

557a588

* follow opencc conversion chain * when a dict doesn't contain a word, pass as-is * de-duplication

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

follow opencc conversion chain #688

follow opencc conversion chain #688

eagleoflqj commented Aug 10, 2023 •

edited

Loading

eagleoflqj commented Aug 12, 2023

groverlynn commented Sep 4, 2023 •

edited

Loading

eagleoflqj commented Sep 4, 2023

groverlynn commented Sep 5, 2023

follow opencc conversion chain #688

follow opencc conversion chain #688

Conversation

eagleoflqj commented Aug 10, 2023 • edited Loading

Pull request

Issue tracker

Feature

Unit test

Manual test

Code Review

Additional Info

eagleoflqj commented Aug 12, 2023

groverlynn commented Sep 4, 2023 • edited Loading

eagleoflqj commented Sep 4, 2023

groverlynn commented Sep 5, 2023

eagleoflqj commented Aug 10, 2023 •

edited

Loading

groverlynn commented Sep 4, 2023 •

edited

Loading