deep-semantic-code-search/code_summarization_transfer_learning at master · chnsh/deep-semantic-code-search

History

Name		Name	Last commit message	Last commit date
parent directory ..
fastai		fastai
1 - Preprocess Data.ipynb		1 - Preprocess Data.ipynb
2 - Keras code summarization.ipynb		2 - Keras code summarization.ipynb
3 - Train Language Model Using FastAI.ipynb		3 - Train Language Model Using FastAI.ipynb
4 - Train Model To Map Code Embeddings to Language Embeddings.ipynb		4 - Train Model To Map Code Embeddings to Language Embeddings.ipynb
5 - Build Search Index.ipynb		5 - Build Search Index.ipynb
6 - Eval metrics.ipynb		6 - Eval metrics.ipynb
README.md		README.md
feature_extractor.py		feature_extractor.py
general_utils.py		general_utils.py
lang_model_utils.py		lang_model_utils.py
seq2seq_utils.py		seq2seq_utils.py
visitor.py		visitor.py

README.md

Code summarization using transfer learning

How to run?

These notebooks should be run sequentially using the docker containers provided below.

The first notebook fetches and creates the dataset.
The second notebook vectorizes the code sequence and description sequence and trains 3 seq2seq models:
- Seq2Seq model from function tokens -> docstring
- Seq2Seq model from api seq -> docstring
- Seq2Seq model from method name -> docstring
This notebook trains an AWD LSTM model for docstring using FastAI's implementation.
This notebooks trains the final joint embedder from code to docstring vectors.
In this notebook, we build a search engine that uses the trained networks to output query results.
This notebook evaluates the model.

In order to run these sets of notebooks (1 - 6), we would highly suggest using these docker containers:

Docker Containers

hamelsmu/ml-gpu: Use this container for any gpu bound parts.
hamelsmu/ml-cpu: Use this container for any cpu bound parts.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code_summarization_transfer_learning

code_summarization_transfer_learning

fastai

fastai

1 - Preprocess Data.ipynb

1 - Preprocess Data.ipynb

2 - Keras code summarization.ipynb

2 - Keras code summarization.ipynb

3 - Train Language Model Using FastAI.ipynb

3 - Train Language Model Using FastAI.ipynb

4 - Train Model To Map Code Embeddings to Language Embeddings.ipynb

4 - Train Model To Map Code Embeddings to Language Embeddings.ipynb

5 - Build Search Index.ipynb

5 - Build Search Index.ipynb

6 - Eval metrics.ipynb

6 - Eval metrics.ipynb

README.md

README.md

feature_extractor.py

feature_extractor.py

general_utils.py

general_utils.py

lang_model_utils.py

lang_model_utils.py

seq2seq_utils.py

seq2seq_utils.py

visitor.py

visitor.py

README.md

Code summarization using transfer learning

How to run?

Docker Containers

Files

code_summarization_transfer_learning

Directory actions

More options

Directory actions

More options

Latest commit

History

code_summarization_transfer_learning

Folders and files

parent directory

Code summarization using transfer learning

How to run?

Docker Containers