Skip to content
View lpla's full-sized avatar

Highlights

  • Pro

Organizations

@paracrawl @bitextor @macocu @multiscore @Grupo-Enercoop
Block or Report

Block or report lpla

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. bitextor/bitextor bitextor/bitextor Public

    Bitextor generates translation memories from multilingual websites

    Python 278 43

  2. bitextor/bicleaner bitextor/bicleaner Public

    Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.

    Python 145 21

  3. bitextor/bicleaner-ai bitextor/bicleaner-ai Public

    Bicleaner fork that uses neural networks

    Python 31 4

  4. bitextor/warc2text bitextor/warc2text Public

    Extracts plain text, language identification and more metadata from WARC records

    C++ 17 5