Scraping Best Sellers (Top 100) on Audible.com with Python and Selenium on Google colaboratory
In this notebook, I will scrape Best Sellers section on Audible.com using Python and selenium where Top 100 audio books are listed, and store the scraped then processed data in a csv file.
the link to scrape: https://www.audible.com/adblbestsellers
robots.txt has been checked and the above link is allowed.
For each audio book, following fields are scraped:
- Book title
- Authors
- Narrators
- Length
- Release date
- Language
- Ratings
On the website, at the time of web scraping in March 2023, Best Sellers section looked like this: Best Sellers (Top 100) on Audible.com
for PDF format: Best Sellers (Top 100) on Audible.com - PDF file
Using my scraping code, data is scraped, necessary data processing done and top 100 Audible audio books are stored in CSV file like this: Top 100 Audible audio books (CSV file)