Skip to content
@nlpaueb

NLP AUEB

The NLP Group of Athens University of Economics and Business.

Popular repositories

  1. edgar-crawler edgar-crawler Public

    The only open-source toolkit that can download EDGAR financial reports and extract textual data from specific item sections into nice and clean JSON files.

    Python 236 65

  2. greek-bert greek-bert Public

    A Greek edition of BERT pre-trained language model

    Python 137 10

  3. deep-relevance-ranking deep-relevance-ranking Public

    Deep Relevance Ranking Using Enhanced Document-Query Interactions

    Python 113 24

  4. bio_image_caption bio_image_caption Public

    Biomedical Image Captioning

    Python 51 16

  5. finer finer Public

    FiNER: Financial Numeric Entity Recognition for XBRL Tagging

    Python 50 6

  6. gr-nlp-toolkit gr-nlp-toolkit Public

    A Transformer-based natural language processing toolkit for (modern) Greek.

    Python 47 5

Repositories

Showing 10 of 18 repositories
  • edgar-crawler Public

    The only open-source toolkit that can download EDGAR financial reports and extract textual data from specific item sections into nice and clean JSON files.

    Python 236 GPL-3.0 65 2 2 Updated Jun 1, 2024
  • dmmcs Public

    Distance from Median Maximum Cosine Similarity

    Jupyter Notebook 0 MIT 0 5 1 Updated May 30, 2024
  • greeklish Public

    Greeklish to Greek

    Jupyter Notebook 0 Apache-2.0 0 3 0 Updated May 23, 2024
  • multiple-choice-mutation Public

    Multiple Choice Mutation (MCM) is a technique for generating good quality domain-specific synthetic data with an LLM.

    Jupyter Notebook 0 0 0 0 Updated Mar 26, 2024
  • Python 1 1 0 0 Updated Feb 1, 2024
  • SumQE Public

    SUM-QE, a BERT-based Summary Quality Estimation Model

    Python 21 MIT 3 0 5 Updated Jul 22, 2023
  • bioCaption Public

    Diagnostic Captioning

    Python 15 Apache-2.0 1 5 4 Updated Dec 8, 2022
  • bio_image_caption Public

    Biomedical Image Captioning

    Python 51 MIT 16 1 4 Updated Dec 8, 2022
  • aueb-bioasq7 Public

    AUEB at BioASQ 7: Document and Snippet Retrieval

    C 6 0 1 2 Updated Jun 21, 2022
  • multi-eurlex Public

    MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer

    Python 30 3 0 0 Updated Jun 7, 2022

Top languages

Loading…

Most used topics

Loading…