Skip to content
View lambdaofgod's full-sized avatar
๐Ÿค—
opensource
๐Ÿค—
opensource

Highlights

  • Pro
Block or Report

Block or report lambdaofgod

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
lambdaofgod/README.md

About me

  • Since 2017 I work as a data scientist.
  • ๐Ÿข My work experience can be found in my LinkedIn profile (Jakub Bartczuk).
  • ๐ŸŽ“ I studied Theoretical Mathematics (BSc.) and Data Science (MSc.) at University of Wrocล‚aw.
  • I am mostly focused on deep learning, especially for NLP and Computer Vision. I enjoy problems that go beyond straightforward supervised learning. Extending standard search engine methods is one of such problems.
  • In the free time I like to tinker with opensource
  • The languages I enjoy the most are Elixir, Lisps (Elixir is kinda a Lisp), Rust and Python.
  • ๐Ÿ“– When I need to take a rest from sitting at the computer I train martial arts and like to read about mathematics, linguistics, buddhism and psychology.

Projects

  • In deepsense.ai I worked on TrelBERT - Polish twitter BERT.
  • In findkit I put together wrappers making working with information retrieval with vector data easier.
  • ๐Ÿ“ฐ NewsBERT is a RSS feeds information retrieval app that using huggingface transformers zero-shot learning feature.
  • niph makes searching podcasts easier. The inspiration was Karpathy's transcription of Lex Fridman Podcast Currently tested with Lex Fridman podcast, but it also will work on transcriptions with similar format.

Neural networks for searching github repositories - MSc thesis

With over 500 starred repositories searching through them became cumbersome. I did a small project for retrieval on starred repositories which looked promising, but it is hard to gauge how useful such solution would be in practice.

In the thesis I use PapersWithCode data for information retrieval.

PapersWithCode contains links between papers and repositories that implement them. Most repositories are tagged with at least one task like "unsupervised segmentation" or "semantic parsing".

I proposed and built a system that among other things uses zero-shot learning and features extracted with Graph Neural Networks from Python files and functions dependency (call) graph.

๐Ÿค— huggingface spaces

Small projects

Stackexchange

cross-validated

Other ๐Ÿ‘€

Pinned

  1. github_wordcloud github_wordcloud Public

    Generate wordclouds from Github repositories information

    Python 2

  2. examples-counterexamples examples-counterexamples Public

    Machine learning examples and counterexamples

    Jupyter Notebook 19 5

  3. livebooks livebooks Public

    Elixir

  4. uhackathons uhackathons Public

    microhackathon ideas

    Jupyter Notebook