Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimization of Underthesea Codebase Size #724

Open
3 tasks
rain1024 opened this issue Oct 1, 2023 · 0 comments
Open
3 tasks

Optimization of Underthesea Codebase Size #724

rain1024 opened this issue Oct 1, 2023 · 0 comments

Comments

@rain1024
Copy link
Contributor

rain1024 commented Oct 1, 2023

As of release 6.8.0, the zipped version of the underthesea codebase measures 73.9MB, which is excessively substantial. Aiming to optimize, the goal is to reduce the size to approximately 10MB.

Proposed Strategies:

  • Sub-repository Allocation for Datasets
    Migrate each dataset to individual sub-repositories to decentralize the storage and manage the codebase efficiently.

  • Eliminate Storage of Binary Models
    Avoid the incorporation of binary models within the codebase. For reference, binary models are currently stored here:models/ws_crf_vlsp2013_20230727

  • Code Refactoring
    Undertake a comprehensive refactoring of the code to improve its structure, readability, and maintainability, which can also contribute to reducing the overall size of the codebase.

@rain1024 rain1024 changed the title Optimization of Underthesea Codebase Dimension Optimization of Underthesea Codebase Size Oct 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant