{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":138050436,"defaultBranch":"main","name":"LASER","ownerLogin":"facebookresearch","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2018-06-20T15:18:09.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/16943930?v=4","public":true,"private":false,"isOrgOwned":true},"refInfo":{"name":"","listCacheKey":"v0:1700494144.0","currentOid":""},"activityList":{"items":[{"before":"982fb46dd51e9a17581b57ba9eb8881ca5fa5779","after":"d7e2544234c1d2a7076280944bdc2637f98ef3c2","ref":"refs/heads/main","pushedAt":"2024-05-02T20:24:47.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"up version to 0.0.2\n\nSigned-off-by: David Dale ","shortMessageHtmlLink":"up version to 0.0.2"}},{"before":"74bc6ae7ffb7cba670fe0f428d0ede9f666fb821","after":"982fb46dd51e9a17581b57ba9eb8881ca5fa5779","ref":"refs/heads/main","pushedAt":"2024-05-02T17:57:33.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Fix join of base url for windows machines (#282)","shortMessageHtmlLink":"Fix join of base url for windows machines (#282)"}},{"before":"3b255b3213a27f28d14dde3d7609d9000fc63951","after":"74bc6ae7ffb7cba670fe0f428d0ede9f666fb821","ref":"refs/heads/main","pushedAt":"2023-12-09T12:37:10.000Z","pushType":"pr_merge","commitsCount":2,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"Merge pull request #276 from NIXBLACK11/SentimentAnalysis\n\nSentiment Analysis Tutorial using laser","shortMessageHtmlLink":"Merge pull request #276 from NIXBLACK11/SentimentAnalysis"}},{"before":"dc1c68e670cc25135686452892e5ff0864137c55","after":"3b255b3213a27f28d14dde3d7609d9000fc63951","ref":"refs/heads/main","pushedAt":"2023-12-07T19:22:56.000Z","pushType":"pr_merge","commitsCount":8,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"Merge pull request #275 from Paulooh007/laser-clustering\n\nAdd laser clustering example.","shortMessageHtmlLink":"Merge pull request #275 from Paulooh007/laser-clustering"}},{"before":"91f648f281a1a3831e5e784887b575963a2f18af","after":"83c07d3dc054d6a76da572fee654efc3787c73a4","ref":"refs/heads/MLH-dev","pushedAt":"2023-12-07T15:19:01.000Z","pushType":"pr_merge","commitsCount":26,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"Merge pull request #274 from NIXBLACK11/Sentiment-analysis-laser\n\nSentiment analysis laser","shortMessageHtmlLink":"Merge pull request #274 from NIXBLACK11/Sentiment-analysis-laser"}},{"before":"995c2f782e85f847d1f6c73d4137d54ef4a22073","after":"dc1c68e670cc25135686452892e5ff0864137c55","ref":"refs/heads/main","pushedAt":"2023-12-01T08:55:52.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Fix outdated Dockerfile and Flask app (#251)\n\n* Add requirements needed for laser_encoders\r\n\r\n* Add script to use laser_encoder\r\n\r\n* Update flask app to use laser_encoders\r\n\r\n* Update Dockerfile to build image for laser_encoder\r\n\r\n* Update README to setup docker\r\n\r\n* Update README\r\n\r\n* style: Format code and sort imports using black and isort\r\n\r\n* Update Dockerfile to include maintainer\r\n\r\n* Update README for docker setup\r\n\r\n* Remove unesssary file in docker directory\r\n\r\n* Enable pip installing laser_encoders from local directory\r\n\r\n* Fix pip install error while building docker container\r\n\r\n* Add error handling for unsupported languages in /vectorize endpoint\r\n\r\n* Add language model download to Docker build process\r\n\r\n* Create cache for encoder to improve subsequent request speed\r\n\r\n* Add build arguments to predownload encoders and tokenizers\r\n\r\n* Update README on usage\r\n\r\n* Update README\r\n\r\n* Change default lang to 2 letter code\r\n\r\n* Update README to indicate language used in default build\r\n\r\n* Update Dockerfile to use toml file instead of requirements file\r\n\r\n* Improve caching for laser2 languages\r\n\r\n* Fix faulty caching logic","shortMessageHtmlLink":"Fix outdated Dockerfile and Flask app (#251)"}},{"before":"a7905b98a93b5a9e564a5c9b2933b408a5d73598","after":"995c2f782e85f847d1f6c73d4137d54ef4a22073","ref":"refs/heads/main","pushedAt":"2023-11-30T18:30:47.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"pxsim release","shortMessageHtmlLink":"pxsim release"}},{"before":"9cde37a43a5c7ea3d1c6d3d9da3efab5eaa2b178","after":"91f648f281a1a3831e5e784887b575963a2f18af","ref":"refs/heads/MLH-dev","pushedAt":"2023-11-21T16:22:40.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Merge branch 'main' into MLH-dev\n\nlet's continue developing in this branch","shortMessageHtmlLink":"Merge branch 'main' into MLH-dev"}},{"before":"5b9820b0a632df9bb4a5195e002024b3b4227a30","after":"a7905b98a93b5a9e564a5c9b2933b408a5d73598","ref":"refs/heads/main","pushedAt":"2023-11-21T15:55:28.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"MLH fellowship contribution: adding the `laser_encoders` module (#249)\n\n* feat: converted SPMapply function to use python script\r\n\r\n* modified laserTokenizer class to have a seperate function for tokenizing a file\r\n\r\n* modified tokenize_file function\r\n\r\n* removed instances of Path\r\n\r\n* created new function for opening files\r\n\r\n* test for LaserTokenizer.tokenize\r\n\r\n* tests for normalisation, descape and lower_case\r\n\r\n* deleted test dir because of relative import error\r\n\r\n* modified test tokenizer function to use the downloaded model before exiting the context manager\r\n\r\n* test for tokenize_file\r\n\r\n* added test for is_printable\r\n\r\n* test for over_write when equal to True and False\r\n\r\n* added some type hints for tests\r\n\r\n* added type hint for log function\r\n\r\n* added header comment\r\n\r\n* feat: make LASER pip installable (#239)\r\n\r\n* feat: make LASER pip installable\r\n\r\n* Added GitHub Actions workflow for tests and linting\r\n\r\n* upgraded python version due to node depreciation error\r\n\r\n* removed updated python version\r\n\r\n* removed poetry\r\n\r\n* bug fixes\r\n\r\n* removed dependencies install\r\n\r\n* updated pyproject and made lint_and_test to install dev and mono dependencies\r\n\r\n* removed isort and black\r\n\r\n* removed mono dependencies\r\n\r\n* removed version from pyproject\r\n\r\n* removed duplicate of classifiers\r\n\r\n* removed description\r\n\r\n* removed dynamic\r\n\r\n* added src-layout to discover only laser_encoder\r\n\r\n* added build backend\r\n\r\n* updated project name\r\n\r\n* changed license to BSD\r\n\r\n* removed src-layout to test\r\n\r\n* added linting to actions\r\n\r\n* updated linting to only check the laser_encoders folder\r\n\r\n* fixed linting issues\r\n\r\n* fixed black linting issues\r\n\r\n* added white-space\r\n\r\n* Refactor embedder (#241)\r\n\r\n* feat: make LASER pip installable\r\n\r\n* Added GitHub Actions workflow for tests and linting\r\n\r\n* upgraded python version due to node depreciation error\r\n\r\n* removed updated python version\r\n\r\n* removed poetry\r\n\r\n* bug fixes\r\n\r\n* removed dependencies install\r\n\r\n* updated pyproject and made lint_and_test to install dev and mono dependencies\r\n\r\n* removed isort and black\r\n\r\n* removed mono dependencies\r\n\r\n* removed version from pyproject\r\n\r\n* removed duplicate of classifiers\r\n\r\n* removed description\r\n\r\n* removed dynamic\r\n\r\n* added src-layout to discover only laser_encoder\r\n\r\n* added build backend\r\n\r\n* updated project name\r\n\r\n* changed license to BSD\r\n\r\n* removed src-layout to test\r\n\r\n* added linting to actions\r\n\r\n* updated linting to only check the laser_encoders folder\r\n\r\n* fixed linting issues\r\n\r\n* fixed black linting issues\r\n\r\n* added white-space\r\n\r\n* refactored emmbeder to work in the laser tokenizer package\r\n\r\n* downgraded numpy version to suit the installled python version\r\n\r\n* added test for sentence encoder\r\n\r\n* added whitespace to test workflow\r\n\r\n* restructured test for sentence encoder\r\n\r\n* restructured test for sentence encoder\r\n\r\n* fixed black issues\r\n\r\n* restructured test for sentence encoder\r\n\r\n* changed python version because of workflow error\r\n\r\n* updated dependencies requirements version\r\n\r\n* removed unneccessary print statement\r\n\r\n* updated python version\r\n\r\n* restructured test_sentence_encoder\r\n\r\n* restructured test_sentence encoder\r\n\r\n* black linting fixes\r\n\r\n* restructure calling of tempile module\r\n\r\n* updated workflow to remove pip cache\r\n\r\n* removed commented code\r\n\r\n* refactored code and added type hints\r\n\r\n* fixed black issues\r\n\r\n* fixed no module found error by adding Laser environment\r\n\r\n* feat: Add Python function to download LASER models (#244)\r\n\r\n* feat: make LASER pip installable\r\n\r\n* Added GitHub Actions workflow for tests and linting\r\n\r\n* upgraded python version due to node depreciation error\r\n\r\n* removed updated python version\r\n\r\n* removed poetry\r\n\r\n* bug fixes\r\n\r\n* removed dependencies install\r\n\r\n* updated pyproject and made lint_and_test to install dev and mono dependencies\r\n\r\n* removed isort and black\r\n\r\n* removed mono dependencies\r\n\r\n* removed version from pyproject\r\n\r\n* removed duplicate of classifiers\r\n\r\n* removed description\r\n\r\n* removed dynamic\r\n\r\n* added src-layout to discover only laser_encoder\r\n\r\n* added build backend\r\n\r\n* updated project name\r\n\r\n* changed license to BSD\r\n\r\n* removed src-layout to test\r\n\r\n* added linting to actions\r\n\r\n* updated linting to only check the laser_encoders folder\r\n\r\n* fixed linting issues\r\n\r\n* fixed black linting issues\r\n\r\n* added white-space\r\n\r\n* refactored emmbeder to work in the laser tokenizer package\r\n\r\n* downgraded numpy version to suit the installled python version\r\n\r\n* added test for sentence encoder\r\n\r\n* added whitespace to test workflow\r\n\r\n* restructured test for sentence encoder\r\n\r\n* restructured test for sentence encoder\r\n\r\n* fixed black issues\r\n\r\n* restructured test for sentence encoder\r\n\r\n* changed python version because of workflow error\r\n\r\n* updated dependencies requirements version\r\n\r\n* removed unneccessary print statement\r\n\r\n* updated python version\r\n\r\n* restructured test_sentence_encoder\r\n\r\n* restructured test_sentence encoder\r\n\r\n* black linting fixes\r\n\r\n* restructure calling of tempile module\r\n\r\n* updated workflow to remove pip cache\r\n\r\n* removed commented code\r\n\r\n* refactored code and added type hints\r\n\r\n* fixed black issues\r\n\r\n* fixed no module found error by adding Laser environment\r\n\r\n* feat:created download function for downloading laser models in python\r\n\r\n* added language list and made some changes to the download models\r\n\r\n* fixed linting issues\r\n\r\n* added type hints\r\n\r\n* fixed linting issues\r\n\r\n* added progress bar for downloading of models\r\n\r\n* fixed black issues\r\n\r\n* updated code to download laser model based on where the language is found\r\n\r\n* fixed black and linting issues\r\n\r\n* fixed black issues\r\n\r\n* fixed bug in sentence encoder\r\n\r\n* black issues and relative import issues\r\n\r\n* removed addition of laser path\r\n\r\n* fixed isort issues\r\n\r\n* refactored the python entrypoint functions\r\n\r\n* fixed black issues\r\n\r\n* updated laguage list with some laser2 and laser3 languages\r\n\r\n* refactor: added option for laser\r\n\r\n* added laser2 language list\r\n\r\n* added laser3 language list\r\n\r\n* fixed black issues\r\n\r\n* updated language list\r\n\r\n* refactoed download function to display total filesize in MB and also made some changes to raise an error when laser is not passed\r\n\r\n* fixed black issues\r\n\r\n* refactored download models to move model_dir to the class\r\n\r\n* fixed black issues\r\n\r\n* refactored laser tokenizer test to use the laser downloader class methods\r\n\r\n* documentation for the laser_encoder\r\n\r\n* added tokenizer part\r\n\r\n* added some docs for tokenize file and download models\r\n\r\n* updated readme to include supported flore200 langs\r\n\r\n* corrected readme path and license\r\n\r\n* added requirements for laser_encoder\r\n\r\n* added __main__.py file for running download command easily\r\n\r\n* black and isort fixes, updated docs to effect changes due to creation of __main__.py file\r\n\r\n* added contributors section\r\n\r\n* Revert \"added requirements for laser_encoder\"\r\n\r\nThis reverts commit 431780e1f133623a5c721313c372942054691d08.\r\n\r\nreverting back\r\n\r\n* reverting creation of main.py\r\n\r\n* fixed isort and black issues\r\n\r\n* removed irrelevant comment\r\n\r\n* moved pyproject to laser direcory and adjust contributors name\r\n\r\n* workflow issues due to removal of pyproject\r\n\r\n* pointed workflow to laser_encoders dir\r\n\r\n* fixed EOF error\r\n\r\n* fixed EOF error\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* debuging\r\n\r\n* bug fixes and new implementation of convert_tokens_to_id function\r\n\r\n* bug fix\r\n\r\n* bug fix\r\n\r\n* bug fix\r\n\r\n* bug fix\r\n\r\n* bug fix\r\n\r\n* bug fix\r\n\r\n* bug fix\r\n\r\n* bug fix\r\n\r\n* bug fix\r\n\r\n* reverting back because of workflow error\r\n\r\n* reverting back because of workflow error\r\n\r\n* some extra adjustment\r\n\r\n* changed ibo to igbo\r\n\r\n* updated doc to effect the ibo to igbo change\r\n\r\n* refactore: modified the sentence encoder to tokenize a text before encodingit\r\n\r\n* debugging failed test\r\n\r\n* added a call method to seperately handle the tokenization before encodding\r\n\r\n* added value error for when there is no spm_model\r\n\r\n* documentation for the new __call__ method for tokenization with encoder\r\n\r\n* docs: Update docs to include reference to laserembeddings (#254)\r\n\r\n* Handle Interrupted Model Weight Downloads (#253)\r\n\r\n* fix: Fix interrupted downloads issue\r\n\r\n* style: Format code using black\r\n\r\n* Update download method to use tempfile\r\n\r\n* style: Remove unnecessary space\r\n\r\n* Fix OSError by using shutil.move for cross-filesystem moves\r\n\r\nUsing os.rename caused an OSError when trying to move files across different filesystems (e.g., from /tmp to another directory).\r\nBy using shutil.move, we gracefully handle such situations,\r\nensuring files are moved correctly regardless of the source and destination filesystems.\r\n\r\n* Refactor `initialize_encoder` to `LaserEncoderPipeline` (#256)\r\n\r\n* Remove 'tokenize' argument from initialize_encoder function\r\n\r\n* Add LaserEncoderPipeline for streamlined tokenization and encoding\r\n\r\n* docs: Update README to show use of LaserEncoderPipeline\r\n\r\n* style: Reformat code using black\r\n\r\n* refactor: move encoder and tokenizer initialization into repective files\r\n\r\n* style: run black\r\n\r\n* test: Add test for LaserEncoderPipeline\r\n\r\n* test to validate languages\r\n\r\n* test to validate languages\r\n\r\n* Delete flores directory\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update .gitignore\r\n\r\n* added pytest to validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py using mock downloader\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Extend Tokenizer to Support Single Strings and Lists of Strings (#258)\r\n\r\n* Handle case for both str and list in tokenizer\r\n\r\n* test: Add test for tokenizer call method\r\n\r\n* Rename 'sentences' argument to 'text_or_batch' for clarity\r\n\r\n* Handle string input in call method\r\n\r\n* Update validate_models.py\r\n\r\n* Update download_models.py according to 1.\r\n\r\n* Update download_models.py\r\n\r\n* Update download_models.py\r\n\r\n* Update download_models.py\r\n\r\n* Enhance LaserTokenizer with Perl Parity, Optional Punctuation Normalization, and Embedding Normalization (#262)\r\n\r\n* Introduce pearl compability flag\r\n\r\n* Add argument `normalize_punct` to `LaserTokenizer`\r\n\r\n* Add normalize_embeddings option to encode_sentences\r\n\r\n* Update README on normalize_embeddings option\r\n\r\n* style: Run black and isort\r\n\r\n* test: Add tests for normalize_embeddings flag in sentence encoder\r\n\r\n* style: Run black\r\n\r\n* Update validate_models.py\r\n\r\n* Update models.py\r\n\r\n* Update laser_tokenizer.py\r\n\r\n* Update download_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Added slow and fast tests to validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Create test_validate_models.py\r\n\r\n* Rename test_validate_models.py to test_models_initialization.py\r\n\r\n* Update test_models_initialization.py\r\n\r\n* Update test_models_initialization.py\r\n\r\n* Update download_models.py\r\n\r\n* Update test_models_initialization.py\r\n\r\n* Update test_models_initialization.py\r\n\r\n* Update download_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update validate_models.py\r\n\r\n* Update README.md\r\n\r\n* Update README.md\r\n\r\n* Decrease versions of numpy and torch required by laser-encoders (#264)\r\n\r\n* Update requirements to follow fairseq\r\n\r\n* Update README\r\n\r\n* Update dependencies in toml file\r\n\r\n* Remove requirements.txt\r\n\r\n* Update laser_encoders README\r\n\r\n* resolve parity with MOSES-4.0 release\r\n\r\n* update test\r\n\r\n* Update the main README file with a mention of `laser_encoders` (#266)\r\n\r\n* update the main readme file\r\n\r\n* wording changes\r\n\r\n* update the example in the readme\r\n\r\n* fix readme text\r\n\r\n* Update language_list.py (#269)\r\n\r\n* Update language_list.py\r\n\r\n* Update language_list.py\r\n\r\n* Update language_list.py\r\n\r\n* Updated laser encoder pipeline\r\n\r\n* Update models.py\r\n\r\n* Update models.py\r\n\r\n* Added warning for using laser2 with a language\r\n\r\n* add tests to test_laser_tokenizer.py\r\n\r\n* Update test_laser_tokenizer.py\r\n\r\n* Update models.py\r\n\r\n* Update test_laser_tokenizer.py\r\n\r\n* Update test_laser_tokenizer.py\r\n\r\n* Update language_list.py\r\n\r\n* Update language_list.py\r\n\r\n* Update language_list.py\r\n\r\n---------\r\n\r\nCo-authored-by: CaptainVee \r\nCo-authored-by: Victor Joseph <53542380+CaptainVee@users.noreply.github.com>\r\nCo-authored-by: Kevin Heffernan <73017975+heffernankevin@users.noreply.github.com>\r\nCo-authored-by: Okewunmi Paul \r\nCo-authored-by: NIXBLACK11 \r\nCo-authored-by: Siddharth Singh Rana <91743459+NIXBLACK11@users.noreply.github.com>\r\nCo-authored-by: Kevin Heffernan ","shortMessageHtmlLink":"MLH fellowship contribution: adding the laser_encoders module (#249)"}},{"before":"b4aed58b817b1d41551d340307d798482523aac4","after":"9cde37a43a5c7ea3d1c6d3d9da3efab5eaa2b178","ref":"refs/heads/MLH-dev","pushedAt":"2023-11-21T15:44:47.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Update language_list.py (#269)\n\n* Update language_list.py\r\n\r\n* Update language_list.py\r\n\r\n* Update language_list.py\r\n\r\n* Updated laser encoder pipeline\r\n\r\n* Update models.py\r\n\r\n* Update models.py\r\n\r\n* Added warning for using laser2 with a language\r\n\r\n* add tests to test_laser_tokenizer.py\r\n\r\n* Update test_laser_tokenizer.py\r\n\r\n* Update models.py\r\n\r\n* Update test_laser_tokenizer.py\r\n\r\n* Update test_laser_tokenizer.py\r\n\r\n* Update language_list.py\r\n\r\n* Update language_list.py\r\n\r\n* Update language_list.py","shortMessageHtmlLink":"Update language_list.py (#269)"}},{"before":"77bf7fb0d23106f255645e6ae90c3d6ed6f7cf09","after":null,"ref":"refs/heads/fix-parity","pushedAt":"2023-11-20T15:29:04.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"}},{"before":"90db293f537d6744b4544325481b6a9688b26012","after":"b4aed58b817b1d41551d340307d798482523aac4","ref":"refs/heads/MLH-dev","pushedAt":"2023-11-20T15:26:28.000Z","pushType":"pr_merge","commitsCount":3,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"Merge pull request #268 from facebookresearch/fix-parity\n\nEnsure `laser_encoders` has parity with existing LASER inference code for release","shortMessageHtmlLink":"Merge pull request #268 from facebookresearch/fix-parity"}},{"before":"3f270aaf89090d642649a1680368f329a2a84099","after":null,"ref":"refs/heads/update-laser-dependencies","pushedAt":"2023-11-17T13:31:13.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"}},{"before":null,"after":"3f270aaf89090d642649a1680368f329a2a84099","ref":"refs/heads/update-laser-dependencies","pushedAt":"2023-11-17T13:31:08.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"fix readme text","shortMessageHtmlLink":"fix readme text"}},{"before":"3f270aaf89090d642649a1680368f329a2a84099","after":null,"ref":"refs/heads/update-laser-dependencies","pushedAt":"2023-11-17T13:30:31.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"}},{"before":"cd6118efba533e95b6c94c36aa61dc7152f73d5e","after":"90db293f537d6744b4544325481b6a9688b26012","ref":"refs/heads/MLH-dev","pushedAt":"2023-11-17T13:30:25.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Update the main README file with a mention of `laser_encoders` (#266)\n\n* update the main readme file\r\n\r\n* wording changes\r\n\r\n* update the example in the readme\r\n\r\n* fix readme text","shortMessageHtmlLink":"Update the main README file with a mention of laser_encoders (#266)"}},{"before":"93bbbada1c5c60e7b71413f6537d1b73262d39d5","after":"3f270aaf89090d642649a1680368f329a2a84099","ref":"refs/heads/update-laser-dependencies","pushedAt":"2023-11-17T13:24:28.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"fix readme text","shortMessageHtmlLink":"fix readme text"}},{"before":"ea7691cab801743670218048245141d86db0536b","after":"77bf7fb0d23106f255645e6ae90c3d6ed6f7cf09","ref":"refs/heads/fix-parity","pushedAt":"2023-11-17T12:25:38.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"update test","shortMessageHtmlLink":"update test"}},{"before":"cd6118efba533e95b6c94c36aa61dc7152f73d5e","after":"ea7691cab801743670218048245141d86db0536b","ref":"refs/heads/fix-parity","pushedAt":"2023-11-17T12:06:04.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"resolve parity with MOSES-4.0 release","shortMessageHtmlLink":"resolve parity with MOSES-4.0 release"}},{"before":null,"after":"cd6118efba533e95b6c94c36aa61dc7152f73d5e","ref":"refs/heads/fix-parity","pushedAt":"2023-11-17T09:03:07.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"Decrease versions of numpy and torch required by laser-encoders (#264)\n\n* Update requirements to follow fairseq\r\n\r\n* Update README\r\n\r\n* Update dependencies in toml file\r\n\r\n* Remove requirements.txt\r\n\r\n* Update laser_encoders README","shortMessageHtmlLink":"Decrease versions of numpy and torch required by laser-encoders (#264)"}},{"before":"1f5b2e50aaa9907b312e7554cedbfd45506e5a85","after":"93bbbada1c5c60e7b71413f6537d1b73262d39d5","ref":"refs/heads/update-laser-dependencies","pushedAt":"2023-11-16T10:29:22.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"update the example in the readme","shortMessageHtmlLink":"update the example in the readme"}},{"before":"b943033763a36c7b43d320eeeb12089a1c5a06e1","after":"1f5b2e50aaa9907b312e7554cedbfd45506e5a85","ref":"refs/heads/update-laser-dependencies","pushedAt":"2023-11-16T10:23:00.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"wording changes","shortMessageHtmlLink":"wording changes"}},{"before":null,"after":"b943033763a36c7b43d320eeeb12089a1c5a06e1","ref":"refs/heads/update-laser-dependencies","pushedAt":"2023-11-16T10:19:21.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"update the main readme file","shortMessageHtmlLink":"update the main readme file"}},{"before":"63606271d9f4c460a59ba0628e6a32dcea318790","after":"cd6118efba533e95b6c94c36aa61dc7152f73d5e","ref":"refs/heads/MLH-dev","pushedAt":"2023-11-15T14:27:08.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Decrease versions of numpy and torch required by laser-encoders (#264)\n\n* Update requirements to follow fairseq\r\n\r\n* Update README\r\n\r\n* Update dependencies in toml file\r\n\r\n* Remove requirements.txt\r\n\r\n* Update laser_encoders README","shortMessageHtmlLink":"Decrease versions of numpy and torch required by laser-encoders (#264)"}},{"before":"b0131d905211e6084e8df5b78f293b0faca11b06","after":"63606271d9f4c460a59ba0628e6a32dcea318790","ref":"refs/heads/MLH-dev","pushedAt":"2023-11-15T10:33:03.000Z","pushType":"pr_merge","commitsCount":3,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"Merge pull request #265 from NIXBLACK11/Laser_readme_update\n\nAdded Contributers in the readme.","shortMessageHtmlLink":"Merge pull request #265 from NIXBLACK11/Laser_readme_update"}},{"before":"3c5f5edff5f5e926f855d392bee4cc5d55df7570","after":"b0131d905211e6084e8df5b78f293b0faca11b06","ref":"refs/heads/MLH-dev","pushedAt":"2023-11-14T14:57:04.000Z","pushType":"pr_merge","commitsCount":47,"pusher":{"login":"heffernankevin","name":"Kevin Heffernan","path":"/heffernankevin","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/73017975?s=80&v=4"},"commit":{"message":"Merge pull request #257 from NIXBLACK11/Language_model_validation\n\nAdding Language Validation Test","shortMessageHtmlLink":"Merge pull request #257 from NIXBLACK11/Language_model_validation"}},{"before":"c5b6f6028400c428f88be059f8e5b3c8fc12102d","after":"3c5f5edff5f5e926f855d392bee4cc5d55df7570","ref":"refs/heads/MLH-dev","pushedAt":"2023-11-08T10:25:05.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Enhance LaserTokenizer with Perl Parity, Optional Punctuation Normalization, and Embedding Normalization (#262)\n\n* Introduce pearl compability flag\r\n\r\n* Add argument `normalize_punct` to `LaserTokenizer`\r\n\r\n* Add normalize_embeddings option to encode_sentences\r\n\r\n* Update README on normalize_embeddings option\r\n\r\n* style: Run black and isort\r\n\r\n* test: Add tests for normalize_embeddings flag in sentence encoder\r\n\r\n* style: Run black","shortMessageHtmlLink":"Enhance LaserTokenizer with Perl Parity, Optional Punctuation Normali…"}},{"before":"e6f4805e95612a0b35a83042fc48e8da970b399b","after":"c5b6f6028400c428f88be059f8e5b3c8fc12102d","ref":"refs/heads/MLH-dev","pushedAt":"2023-11-07T10:28:32.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Extend Tokenizer to Support Single Strings and Lists of Strings (#258)\n\n* Handle case for both str and list in tokenizer\r\n\r\n* test: Add test for tokenizer call method\r\n\r\n* Rename 'sentences' argument to 'text_or_batch' for clarity\r\n\r\n* Handle string input in call method","shortMessageHtmlLink":"Extend Tokenizer to Support Single Strings and Lists of Strings (#258)"}},{"before":"e3257c1b01fb9ed67be709d0b4405e91323928be","after":"e6f4805e95612a0b35a83042fc48e8da970b399b","ref":"refs/heads/MLH-dev","pushedAt":"2023-10-31T11:03:29.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Refactor `initialize_encoder` to `LaserEncoderPipeline` (#256)\n\n* Remove 'tokenize' argument from initialize_encoder function\r\n\r\n* Add LaserEncoderPipeline for streamlined tokenization and encoding\r\n\r\n* docs: Update README to show use of LaserEncoderPipeline\r\n\r\n* style: Reformat code using black\r\n\r\n* refactor: move encoder and tokenizer initialization into repective files\r\n\r\n* style: run black\r\n\r\n* test: Add test for LaserEncoderPipeline","shortMessageHtmlLink":"Refactor initialize_encoder to LaserEncoderPipeline (#256)"}},{"before":"0976ee84a9f404f0dd7b105d603da52a551bfe21","after":"e3257c1b01fb9ed67be709d0b4405e91323928be","ref":"refs/heads/MLH-dev","pushedAt":"2023-10-13T08:33:37.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"avidale","name":"David Dale","path":"/avidale","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/8642136?s=80&v=4"},"commit":{"message":"Handle Interrupted Model Weight Downloads (#253)\n\n* fix: Fix interrupted downloads issue\r\n\r\n* style: Format code using black\r\n\r\n* Update download method to use tempfile\r\n\r\n* style: Remove unnecessary space\r\n\r\n* Fix OSError by using shutil.move for cross-filesystem moves\r\n\r\nUsing os.rename caused an OSError when trying to move files across different filesystems (e.g., from /tmp to another directory).\r\nBy using shutil.move, we gracefully handle such situations,\r\nensuring files are moved correctly regardless of the source and destination filesystems.","shortMessageHtmlLink":"Handle Interrupted Model Weight Downloads (#253)"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEQC_S7gA","startCursor":null,"endCursor":null}},"title":"Activity · facebookresearch/LASER"}