Skip to content

roverbird/chatgpt_corpus

Repository files navigation

ChatGPT Corpus

A small collection of replies that were generated by ChatGPT and downloaded from the web. Used to analyse word frequency distribution in AI-generated texts.

ChatGPT detector demo: https://textvisualization.app/chatgpt-detector/

Details, findings, explanations in this post: The Intricate Tapestry of ChatGPT Posts: Why LLM overuses some words at the expense of others?

Word frequency data for Project Gutenberg books is collected from Wikipedia, project-gutenberg list

ChatGPT

SEO spam detection

Fake texts