Skip to content

MeetShah9/Collection-and-exploration-of-radiohead-lyrics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Collection and Exploration of Radiohead lyrics

Overview

Creation of a dataset containing Radiohead lyrics and performing EDA

1) Data Collection

This Python script scrapes the lyrics of albums by the band Radiohead from Genius.com using their API. It saves the lyrics of each album to individual JSON files.

2) Data Cleaning

This script includes several text preprocessing functions implemented in Python for cleaning up text data. This uses Regex Libaray to clean data.

3) EDA

This script performs basic EDA on the collected data.

It uses a word cloud to show which words are most frequently used by Radiohead in their songs.

Screenshot 2024-04-23 at 11 15 51 AM

It also uses a bar graph to show average number of words used per song in each album.

Screenshot 2024-04-23 at 11 17 01 AM

Cloning the Repository

To clone and run this project locally, use the following commands:

git clone https://github.com/MeetShah9/Collection-and-exploration-of-radiohead-lyrics.git

To install the required Python libraries, run the following command in your terminal:

pip install -r requirements.txt

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Contributing

Contributions are welcome! Please feel free to open an issue or submit a pull request.

About

Collected Lyrics of all Radiohead songs using LyricsGenius

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published