Repo2GPT is a Python application that clones a GitHub repository and generates a tree diagram of the repository's file structure and a consolidated text file containing all the code files in the repository. This utility can help in quickly understanding the structure of a repository and reviewing its code. This simplifies the process of iterative development using LLMs such as GPT-4. Another usecase for it is analyzing a give repo for malicious code using ChatGPT.
Repo2GPT can also be used on a local folder containing code if no repo is available for the project you are trying to consolidate.
...
With the virtual environment activated (optional), install the packages listed in requirements.txt
:
pip install -r requirements.txt
...
With everything set up, you can now use Repo2GPT:
python main.py <GitHub repository URL or local directory path>
Replace with the URL of the repository you want to clone and analyze, or a local directory path you want to analyze.
Repo2GPT will generate a tree diagram of the repository's structure and a consolidated text file containing all the code in the repository. The files will be named tree_diagram.txt and consolidated_code.txt, respectively.
- Add ASM Traversal and mapping similar to ctags.
- Web version or VS Code Extension.
- Account for ignoring more types of irrelevant metadata files.
- Fix some of the encoding errors when consolidating certain readme.md files.
- Better identification of code files and non-code files.
Repo2GPT is licensed under the terms of the MIT license. See LICENSE for more details.