imvladikon

Follow

Vladimir Gurevich imvladikon

Follow

56 followers · 709 following

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Highlights

Pro

Block or Report

Block or report imvladikon

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

imvladikon/README.md

Hi

I'm Vladimir Gurevich, ML/NLP Engineer (IR tasks, such as Semantic Search, Information Extraction tasks, such as NER, Relation Extraction, etc.).

I am also interested in Speech Recognition and in LLMs.

Works:

jupyter-notebook-viewer - Jupyter Notebook Viewer for local files *.ipynb in browser without Jupyter Notebook installation.
wav2vec2-hebrew - package for speech recognition in Hebrew language using wav2vec2 models that were trained on Hebrew datasets (check out the datasets below).
distiller - distillation TextClassification and TokenClassification models using transformers library with different distillation methods.
spacy-trankit - spacy wrapper for Trankit (NLP pipeline for tokenization+dependency parsing+lemmatization, etc.)

Models:

t5-english-ner - NER model that based on T5 encoder that was trained on extremely small dataset.
sentence-transformers-alephbert - Sentence Transformers model that based on AlephBERT model for sentence similarity tasks.
het5_small_summarization - mt5-small based summarization model for Hebrew

Speech Recognition:

Datasets:

Contacts

Pinned

jupyter-notebook-viewer jupyter-notebook-viewer Public

chrome extension for viewing Jupyter Notebooks in the browser without Jupyter Server

JavaScript 23 4
huawei-nlpcourse-project huawei-nlpcourse-project Public

Topic modeling and classification news on Hebrew with Neural Text Summarizer model

Python 1
distiller distiller Public

knowledge distillations for bert (classification, token classification models)

Python 1
wav2vec2-hebrew wav2vec2-hebrew Public

Speech Recognition for Hebrew (using wav2vec2 models)

Python 2 1

duckdb + huggingface datasets

1

#!/usr/bin/env python3

2

# -*- coding: utf-8 -*-

3

import duckdb

4

import pyarrow as pa

5

from datasets import Dataset

fuzzy_grouper.py

1

#!/usr/bin/env python3

2

# -*- coding: utf-8 -*-

3

"""

4

Simple fuzzy grouping of the list of the dictionaries using any string field and string similarities functions

5

Dependencies: